Picture for Ming-Hsuan Yang

Ming-Hsuan Yang

SAMOFT: Robust Multi-Object Tracking via Region and Flow

Add code
May 10, 2026
Viaarxiv icon

AlbumFill: Album-Guided Reasoning and Retrieval for Personalized Image Completion

Add code
May 04, 2026
Viaarxiv icon

Evolution of Video Generative Foundations

Add code
Apr 07, 2026
Viaarxiv icon

Interactive Tracking: A Human-in-the-Loop Paradigm with Memory-Augmented Adaptation

Add code
Apr 02, 2026
Viaarxiv icon

Finding Distributed Object-Centric Properties in Self-Supervised Transformers

Add code
Mar 27, 2026
Viaarxiv icon

LVOmniBench: Pioneering Long Audio-Video Understanding Evaluation for Omnimodal LLMs

Add code
Mar 19, 2026
Viaarxiv icon

Streaming Autoregressive Video Generation via Diagonal Distillation

Add code
Mar 11, 2026
Viaarxiv icon

ReCoSplat: Autoregressive Feed-Forward Gaussian Splatting Using Render-and-Compare

Add code
Mar 10, 2026
Viaarxiv icon

FaceCam: Portrait Video Camera Control via Scale-Aware Conditioning

Add code
Mar 05, 2026
Viaarxiv icon

LoGeR: Long-Context Geometric Reconstruction with Hybrid Memory

Add code
Mar 03, 2026
Viaarxiv icon