Picture for Ming-Hsuan Yang

Ming-Hsuan Yang

LVOmniBench: Pioneering Long Audio-Video Understanding Evaluation for Omnimodal LLMs

Add code
Mar 19, 2026
Viaarxiv icon

Streaming Autoregressive Video Generation via Diagonal Distillation

Add code
Mar 11, 2026
Viaarxiv icon

ReCoSplat: Autoregressive Feed-Forward Gaussian Splatting Using Render-and-Compare

Add code
Mar 10, 2026
Viaarxiv icon

FaceCam: Portrait Video Camera Control via Scale-Aware Conditioning

Add code
Mar 05, 2026
Viaarxiv icon

LoGeR: Long-Context Geometric Reconstruction with Hybrid Memory

Add code
Mar 03, 2026
Viaarxiv icon

Human Video Generation from a Single Image with 3D Pose and View Control

Add code
Feb 24, 2026
Viaarxiv icon

PromptCD: Test-Time Behavior Enhancement via Polarity-Prompt Contrastive Decoding

Add code
Feb 24, 2026
Viaarxiv icon

Learning Situated Awareness in the Real World

Add code
Feb 18, 2026
Viaarxiv icon

Unlocking Prototype Potential: An Efficient Tuning Framework for Few-Shot Class-Incremental Learning

Add code
Feb 05, 2026
Viaarxiv icon

Context Forcing: Consistent Autoregressive Video Generation with Long Context

Add code
Feb 05, 2026
Viaarxiv icon