Picture for Zhiheng Liu

Zhiheng Liu

See, Plan, Rewind: Progress-Aware Vision-Language-Action Models for Robust Robotic Manipulation

Add code
Mar 10, 2026
Viaarxiv icon

VecGlypher: Unified Vector Glyph Generation with Language Models

Add code
Feb 25, 2026
Viaarxiv icon

HiStream: Efficient High-Resolution Video Generation via Redundancy-Eliminated Streaming

Add code
Dec 24, 2025
Figure 1 for HiStream: Efficient High-Resolution Video Generation via Redundancy-Eliminated Streaming
Figure 2 for HiStream: Efficient High-Resolution Video Generation via Redundancy-Eliminated Streaming
Figure 3 for HiStream: Efficient High-Resolution Video Generation via Redundancy-Eliminated Streaming
Figure 4 for HiStream: Efficient High-Resolution Video Generation via Redundancy-Eliminated Streaming
Viaarxiv icon

OneStory: Coherent Multi-Shot Video Generation with Adaptive Memory

Add code
Dec 08, 2025
Figure 1 for OneStory: Coherent Multi-Shot Video Generation with Adaptive Memory
Figure 2 for OneStory: Coherent Multi-Shot Video Generation with Adaptive Memory
Figure 3 for OneStory: Coherent Multi-Shot Video Generation with Adaptive Memory
Figure 4 for OneStory: Coherent Multi-Shot Video Generation with Adaptive Memory
Viaarxiv icon

ShizhenGPT: Towards Multimodal LLMs for Traditional Chinese Medicine

Add code
Aug 20, 2025
Figure 1 for ShizhenGPT: Towards Multimodal LLMs for Traditional Chinese Medicine
Figure 2 for ShizhenGPT: Towards Multimodal LLMs for Traditional Chinese Medicine
Figure 3 for ShizhenGPT: Towards Multimodal LLMs for Traditional Chinese Medicine
Figure 4 for ShizhenGPT: Towards Multimodal LLMs for Traditional Chinese Medicine
Viaarxiv icon

Scaling Law for Quantization-Aware Training

Add code
May 20, 2025
Figure 1 for Scaling Law for Quantization-Aware Training
Figure 2 for Scaling Law for Quantization-Aware Training
Figure 3 for Scaling Law for Quantization-Aware Training
Figure 4 for Scaling Law for Quantization-Aware Training
Viaarxiv icon

DanceGRPO: Unleashing GRPO on Visual Generation

Add code
May 12, 2025
Figure 1 for DanceGRPO: Unleashing GRPO on Visual Generation
Figure 2 for DanceGRPO: Unleashing GRPO on Visual Generation
Figure 3 for DanceGRPO: Unleashing GRPO on Visual Generation
Figure 4 for DanceGRPO: Unleashing GRPO on Visual Generation
Viaarxiv icon

Fuzzy Clustering for Low-Complexity Time Domain Chromatic Dispersion Compensation Scheme in Coherent Optical Fiber Communication Systems

Add code
Mar 16, 2025
Viaarxiv icon

Soundwave: Less is More for Speech-Text Alignment in LLMs

Add code
Feb 18, 2025
Viaarxiv icon

VanGogh: A Unified Multimodal Diffusion-based Framework for Video Colorization

Add code
Jan 16, 2025
Figure 1 for VanGogh: A Unified Multimodal Diffusion-based Framework for Video Colorization
Figure 2 for VanGogh: A Unified Multimodal Diffusion-based Framework for Video Colorization
Figure 3 for VanGogh: A Unified Multimodal Diffusion-based Framework for Video Colorization
Figure 4 for VanGogh: A Unified Multimodal Diffusion-based Framework for Video Colorization
Viaarxiv icon