Picture for Ying Shan

Ying Shan

TokLIP: Marry Visual Tokens to CLIP for Multimodal Comprehension and Generation

Add code
May 08, 2025
Viaarxiv icon

FlexiAct: Towards Flexible Action Control in Heterogeneous Scenarios

Add code
May 06, 2025
Viaarxiv icon

Cobra: Efficient Line Art COlorization with BRoAder References

Add code
Apr 16, 2025
Viaarxiv icon

GeometryCrafter: Consistent Geometry Estimation for Open-world Videos with Diffusion Priors

Add code
Apr 01, 2025
Viaarxiv icon

AnimeGamer: Infinite Anime Life Simulation with Next Game State Prediction

Add code
Apr 01, 2025
Viaarxiv icon

Exploring the Effect of Reinforcement Learning on Video Understanding: Insights from SEED-Bench-R1

Add code
Mar 31, 2025
Viaarxiv icon

Mono2Stereo: A Benchmark and Empirical Study for Stereo Conversion

Add code
Mar 28, 2025
Viaarxiv icon

GenHancer: Imperfect Generative Models are Secretly Strong Vision-Centric Enhancers

Add code
Mar 25, 2025
Viaarxiv icon

BlobCtrl: A Unified and Flexible Framework for Element-level Image Generation and Editing

Add code
Mar 17, 2025
Viaarxiv icon

TrajectoryCrafter: Redirecting Camera Trajectory for Monocular Videos via Diffusion Models

Add code
Mar 07, 2025
Viaarxiv icon