Picture for Mingyu Liu

Mingyu Liu

Generative Video Matting

Add code
Aug 11, 2025
Viaarxiv icon

ODYSSEY: Open-World Quadrupeds Exploration and Manipulation for Long-Horizon Tasks

Add code
Aug 11, 2025
Viaarxiv icon

SDEval: Safety Dynamic Evaluation for Multimodal Large Language Models

Add code
Aug 08, 2025
Viaarxiv icon

Affordance-R1: Reinforcement Learning for Generalizable Affordance Reasoning in Multimodal Large Language Model

Add code
Aug 08, 2025
Viaarxiv icon

VQ-VLA: Improving Vision-Language-Action Models via Scaling Vector-Quantized Action Tokenizers

Add code
Jul 01, 2025
Viaarxiv icon

Active-O3: Empowering Multimodal Large Language Models with Active Perception via GRPO

Add code
May 27, 2025
Viaarxiv icon

Omni-R1: Reinforcement Learning for Omnimodal Reasoning via Two-System Collaboration

Add code
May 26, 2025
Viaarxiv icon

CoMo: Learning Continuous Latent Motion from Internet Videos for Scalable Robot Learning

Add code
May 22, 2025
Viaarxiv icon

PerturboLLaVA: Reducing Multimodal Hallucinations with Perturbative Visual Training

Add code
Mar 09, 2025
Viaarxiv icon

DICEPTION: A Generalist Diffusion Model for Visual Perceptual Tasks

Add code
Feb 25, 2025
Viaarxiv icon