Picture for Jiangmiao Pang

Jiangmiao Pang

VFlowOpt: A Token Pruning Framework for LMMs with Visual Information Flow-Guided Optimization

Add code
Aug 07, 2025
Viaarxiv icon

InstructVLA: Vision-Language-Action Instruction Tuning from Understanding to Manipulation

Add code
Jul 23, 2025
Viaarxiv icon

Yume: An Interactive World Generation Model

Add code
Jul 23, 2025
Viaarxiv icon

$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning

Add code
Jul 17, 2025
Viaarxiv icon

Rethinking the Embodied Gap in Vision-and-Language Navigation: A Holistic Study of Physical and Visual Disparities

Add code
Jul 17, 2025
Viaarxiv icon

OST-Bench: Evaluating the Capabilities of MLLMs in Online Spatio-temporal Scene Understanding

Add code
Jul 10, 2025
Viaarxiv icon

UniTracker: Learning Universal Whole-Body Motion Tracker for Humanoid Robots

Add code
Jul 10, 2025
Viaarxiv icon

CronusVLA: Transferring Latent Motion Across Time for Multi-Frame Prediction in Manipulation

Add code
Jun 24, 2025
Viaarxiv icon

Sekai: A Video Dataset towards World Exploration

Add code
Jun 18, 2025
Viaarxiv icon

GENMANIP: LLM-driven Simulation for Generalizable Instruction-Following Manipulation

Add code
Jun 12, 2025
Viaarxiv icon