Picture for Chen Gao

Chen Gao

Future Forcing: Future-aware Training-free KV Cache Policy for Autoregressive Video Generation

Add code
May 28, 2026
Viaarxiv icon

GE-Sim 2.0: A Roadmap Towards Comprehensive Closed-loop Video World Simulators for Robotic Manipulation

Add code
May 26, 2026
Viaarxiv icon

EventPrune: Cascaded Event-Assisted Token Pruning for Efficient First-Person Dynamic Spatial Reasoning

Add code
May 19, 2026
Viaarxiv icon

ManiSoft: Towards Vision-Language Manipulation for Soft Continuum Robotics

Add code
May 18, 2026
Viaarxiv icon

UniVLR: Unifying Text and Vision in Visual Latent Reasoning for Multimodal LLMs

Add code
May 12, 2026
Viaarxiv icon

LoViF 2026 The First Challenge on Holistic Quality Assessment for 4D World Model (PhyScore)

Add code
May 06, 2026
Viaarxiv icon

iWorld-Bench: A Benchmark for Interactive World Models with a Unified Action Generation Framework

Add code
May 06, 2026
Viaarxiv icon

A Benchmark for Interactive World Models with a Unified Action Generation Framework

Add code
May 05, 2026
Viaarxiv icon

How Far Are Large Multimodal Models from Human-Level Spatial Action? A Benchmark for Goal-Oriented Embodied Navigation in Urban Airspace

Add code
Apr 09, 2026
Viaarxiv icon

WorldMAP: Bootstrapping Vision-Language Navigation Trajectory Prediction with Generative World Models

Add code
Apr 09, 2026
Viaarxiv icon