Picture for Ying Shan

Ying Shan

AudioStory: Generating Long-Form Narrative Audio with Large Language Models

Add code
Aug 27, 2025
Viaarxiv icon

ToonComposer: Streamlining Cartoon Production with Generative Post-Keyframing

Add code
Aug 14, 2025
Viaarxiv icon

ARC-Hunyuan-Video-7B: Structured Video Comprehension of Real-World Shorts

Add code
Jul 28, 2025
Viaarxiv icon

IC-Custom: Diverse Image Customization via In-Context Learning

Add code
Jul 02, 2025
Viaarxiv icon

DepthSync: Diffusion Guidance-Based Depth Synchronization for Scale- and Geometry-Consistent Video Depth Estimation

Add code
Jul 02, 2025
Viaarxiv icon

LoRA-Gen: Specializing Large Language Model via Online LoRA Generation

Add code
Jun 13, 2025
Viaarxiv icon

Aligning Latent Spaces with Flow Priors

Add code
Jun 05, 2025
Viaarxiv icon

Video-Holmes: Can MLLM Think Like Holmes for Complex Video Reasoning?

Add code
May 27, 2025
Viaarxiv icon

Sci-Fi: Symmetric Constraint for Frame Inbetweening

Add code
May 27, 2025
Viaarxiv icon

TensorAR: Refinement is All You Need in Autoregressive Image Generation

Add code
May 22, 2025
Viaarxiv icon