Picture for Yinchuan Li

Yinchuan Li

WorldLines: Benchmarking and Modeling Long-Horizon Stateful Embodied Agents

Add code
Jun 17, 2026
Viaarxiv icon

AffordanceVLA: A Vision-Language-Action Model Empowering Action Generation through Affordance-Aware Understanding

Add code
Jun 04, 2026
Viaarxiv icon

The Right Inference Strategy Is All You Need: Nearly Training-Free Domain-Wise Inference for EgoCross Challenge

Add code
May 30, 2026
Viaarxiv icon

RoboStressBench: Benchmarking VLM Robustness to Physical Visual Stress in Embodied Scenes

Add code
May 30, 2026
Viaarxiv icon

Panoramic Affordance Prediction

Add code
Mar 16, 2026
Viaarxiv icon

DVD: Deterministic Video Depth Estimation with Generative Priors

Add code
Mar 12, 2026
Viaarxiv icon

ActionCodec: What Makes for Good Action Tokenizers

Add code
Feb 17, 2026
Viaarxiv icon

PhysToolBench: Benchmarking Physical Tool Understanding for MLLMs

Add code
Oct 10, 2025
Viaarxiv icon

Mirage-1: Augmenting and Updating GUI Agent with Hierarchical Multimodal Skills

Add code
Jun 12, 2025
Figure 1 for Mirage-1: Augmenting and Updating GUI Agent with Hierarchical Multimodal Skills
Figure 2 for Mirage-1: Augmenting and Updating GUI Agent with Hierarchical Multimodal Skills
Figure 3 for Mirage-1: Augmenting and Updating GUI Agent with Hierarchical Multimodal Skills
Figure 4 for Mirage-1: Augmenting and Updating GUI Agent with Hierarchical Multimodal Skills
Viaarxiv icon

STAR: Learning Diverse Robot Skill Abstractions through Rotation-Augmented Vector Quantization

Add code
Jun 04, 2025
Figure 1 for STAR: Learning Diverse Robot Skill Abstractions through Rotation-Augmented Vector Quantization
Figure 2 for STAR: Learning Diverse Robot Skill Abstractions through Rotation-Augmented Vector Quantization
Figure 3 for STAR: Learning Diverse Robot Skill Abstractions through Rotation-Augmented Vector Quantization
Figure 4 for STAR: Learning Diverse Robot Skill Abstractions through Rotation-Augmented Vector Quantization
Viaarxiv icon