Picture for Ying Shan

Ying Shan

ARC-Hunyuan-Video-7B: Structured Video Comprehension of Real-World Shorts

Add code
Jul 28, 2025
Viaarxiv icon

IC-Custom: Diverse Image Customization via In-Context Learning

Add code
Jul 02, 2025
Viaarxiv icon

DepthSync: Diffusion Guidance-Based Depth Synchronization for Scale- and Geometry-Consistent Video Depth Estimation

Add code
Jul 02, 2025
Viaarxiv icon

LoRA-Gen: Specializing Large Language Model via Online LoRA Generation

Add code
Jun 13, 2025
Viaarxiv icon

Aligning Latent Spaces with Flow Priors

Add code
Jun 05, 2025
Viaarxiv icon

Sci-Fi: Symmetric Constraint for Frame Inbetweening

Add code
May 27, 2025
Viaarxiv icon

Video-Holmes: Can MLLM Think Like Holmes for Complex Video Reasoning?

Add code
May 27, 2025
Viaarxiv icon

TensorAR: Refinement is All You Need in Autoregressive Image Generation

Add code
May 22, 2025
Viaarxiv icon

MindOmni: Unleashing Reasoning Generation in Vision Language Models with RGPO

Add code
May 19, 2025
Viaarxiv icon

TokLIP: Marry Visual Tokens to CLIP for Multimodal Comprehension and Generation

Add code
May 08, 2025
Viaarxiv icon