Picture for Yong-Lu Li

Yong-Lu Li

A Pragmatic VLA Foundation Model

Add code
Jan 26, 2026
Viaarxiv icon

The Great March 100: 100 Detail-oriented Tasks for Evaluating Embodied AI Agents

Add code
Jan 16, 2026
Viaarxiv icon

IPR-1: Interactive Physical Reasoner

Add code
Nov 19, 2025
Viaarxiv icon

exUMI: Extensible Robot Teaching System with Action-aware Task-agnostic Tactile Representation

Add code
Sep 18, 2025
Viaarxiv icon

SIME: Enhancing Policy Self-Improvement with Modal-level Exploration

Add code
May 02, 2025
Viaarxiv icon

GarmageNet: A Dataset and Scalable Representation for Generic Garment Modeling

Add code
Apr 02, 2025
Figure 1 for GarmageNet: A Dataset and Scalable Representation for Generic Garment Modeling
Figure 2 for GarmageNet: A Dataset and Scalable Representation for Generic Garment Modeling
Figure 3 for GarmageNet: A Dataset and Scalable Representation for Generic Garment Modeling
Figure 4 for GarmageNet: A Dataset and Scalable Representation for Generic Garment Modeling
Viaarxiv icon

Reconstructing In-the-Wild Open-Vocabulary Human-Object Interactions

Add code
Mar 20, 2025
Viaarxiv icon

Dense Policy: Bidirectional Autoregressive Learning of Actions

Add code
Mar 17, 2025
Viaarxiv icon

Interacted Object Grounding in Spatio-Temporal Human-Object Interactions

Add code
Dec 27, 2024
Figure 1 for Interacted Object Grounding in Spatio-Temporal Human-Object Interactions
Figure 2 for Interacted Object Grounding in Spatio-Temporal Human-Object Interactions
Figure 3 for Interacted Object Grounding in Spatio-Temporal Human-Object Interactions
Figure 4 for Interacted Object Grounding in Spatio-Temporal Human-Object Interactions
Viaarxiv icon

M$^3$-VOS: Multi-Phase, Multi-Transition, and Multi-Scenery Video Object Segmentation

Add code
Dec 19, 2024
Figure 1 for M$^3$-VOS: Multi-Phase, Multi-Transition, and Multi-Scenery Video Object Segmentation
Figure 2 for M$^3$-VOS: Multi-Phase, Multi-Transition, and Multi-Scenery Video Object Segmentation
Figure 3 for M$^3$-VOS: Multi-Phase, Multi-Transition, and Multi-Scenery Video Object Segmentation
Figure 4 for M$^3$-VOS: Multi-Phase, Multi-Transition, and Multi-Scenery Video Object Segmentation
Viaarxiv icon