Picture for Yinchuan Li

Yinchuan Li

Panoramic Affordance Prediction

Add code
Mar 16, 2026
Viaarxiv icon

DVD: Deterministic Video Depth Estimation with Generative Priors

Add code
Mar 12, 2026
Viaarxiv icon

ActionCodec: What Makes for Good Action Tokenizers

Add code
Feb 17, 2026
Viaarxiv icon

PhysToolBench: Benchmarking Physical Tool Understanding for MLLMs

Add code
Oct 10, 2025
Viaarxiv icon

Mirage-1: Augmenting and Updating GUI Agent with Hierarchical Multimodal Skills

Add code
Jun 12, 2025
Figure 1 for Mirage-1: Augmenting and Updating GUI Agent with Hierarchical Multimodal Skills
Figure 2 for Mirage-1: Augmenting and Updating GUI Agent with Hierarchical Multimodal Skills
Figure 3 for Mirage-1: Augmenting and Updating GUI Agent with Hierarchical Multimodal Skills
Figure 4 for Mirage-1: Augmenting and Updating GUI Agent with Hierarchical Multimodal Skills
Viaarxiv icon

STAR: Learning Diverse Robot Skill Abstractions through Rotation-Augmented Vector Quantization

Add code
Jun 04, 2025
Figure 1 for STAR: Learning Diverse Robot Skill Abstractions through Rotation-Augmented Vector Quantization
Figure 2 for STAR: Learning Diverse Robot Skill Abstractions through Rotation-Augmented Vector Quantization
Figure 3 for STAR: Learning Diverse Robot Skill Abstractions through Rotation-Augmented Vector Quantization
Figure 4 for STAR: Learning Diverse Robot Skill Abstractions through Rotation-Augmented Vector Quantization
Viaarxiv icon

Proximalized Preference Optimization for Diverse Feedback Types: A Decomposed Perspective on DPO

Add code
May 29, 2025
Figure 1 for Proximalized Preference Optimization for Diverse Feedback Types: A Decomposed Perspective on DPO
Figure 2 for Proximalized Preference Optimization for Diverse Feedback Types: A Decomposed Perspective on DPO
Figure 3 for Proximalized Preference Optimization for Diverse Feedback Types: A Decomposed Perspective on DPO
Figure 4 for Proximalized Preference Optimization for Diverse Feedback Types: A Decomposed Perspective on DPO
Viaarxiv icon

UltraVSR: Achieving Ultra-Realistic Video Super-Resolution with Efficient One-Step Diffusion Space

Add code
May 26, 2025
Viaarxiv icon

GUI-explorer: Autonomous Exploration and Mining of Transition-aware Knowledge for GUI Agent

Add code
May 22, 2025
Viaarxiv icon

Conditioning Matters: Training Diffusion Policies is Faster Than You Think

Add code
May 16, 2025
Viaarxiv icon