Picture for Limin Wang

Limin Wang

Eagle 2.5: Boosting Long-Context Post-Training for Frontier Vision-Language Models

Add code
Apr 21, 2025
Viaarxiv icon

DMM: Building a Versatile Image Generation Model via Distillation-Based Model Merging

Add code
Apr 16, 2025
Viaarxiv icon

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Add code
Apr 15, 2025
Viaarxiv icon

Shrinkage Initialization for Smooth Learning of Neural Networks

Add code
Apr 12, 2025
Viaarxiv icon

VideoChat-R1: Enhancing Spatio-Temporal Perception via Reinforcement Fine-Tuning

Add code
Apr 10, 2025
Viaarxiv icon

DDT: Decoupled Diffusion Transformer

Add code
Apr 09, 2025
Viaarxiv icon

MiLA: Multi-view Intensive-fidelity Long-term Video Generation World Model for Autonomous Driving

Add code
Mar 20, 2025
Viaarxiv icon

Make Your Training Flexible: Towards Deployment-Efficient Video Models

Add code
Mar 18, 2025
Viaarxiv icon

History-Aware Transformation of ReID Features for Multiple Object Tracking

Add code
Mar 16, 2025
Viaarxiv icon

VideoMAP: Toward Scalable Mamba-based Video Autoregressive Pretraining

Add code
Mar 16, 2025
Viaarxiv icon