Picture for Li Yi

Li Yi

Full-Body Motion Reconstruction with Sparse Sensing from Graph Perspective

Add code
Jan 22, 2024
Viaarxiv icon

CrossVideo: Self-supervised Cross-modal Contrastive Learning for Point Cloud Video Understanding

Add code
Jan 17, 2024
Viaarxiv icon

TACO: Benchmarking Generalizable Bimanual Tool-ACtion-Object Understanding

Add code
Jan 16, 2024
Viaarxiv icon

GenH2R: Learning Generalizable Human-to-Robot Handover via Scalable Simulation, Demonstration, and Imitation

Add code
Jan 01, 2024
Viaarxiv icon

Interactive Humanoid: Online Full-Body Motion Reaction Synthesis with Social Affordance Canonicalization and Forecasting

Add code
Dec 30, 2023
Viaarxiv icon

Semantic Complete Scene Forecasting from a 4D Dynamic Point Cloud Sequence

Add code
Dec 15, 2023
Figure 1 for Semantic Complete Scene Forecasting from a 4D Dynamic Point Cloud Sequence
Figure 2 for Semantic Complete Scene Forecasting from a 4D Dynamic Point Cloud Sequence
Figure 3 for Semantic Complete Scene Forecasting from a 4D Dynamic Point Cloud Sequence
Figure 4 for Semantic Complete Scene Forecasting from a 4D Dynamic Point Cloud Sequence
Viaarxiv icon

Look Before You Leap: Unveiling the Power of GPT-4V in Robotic Vision-Language Planning

Add code
Nov 29, 2023
Figure 1 for Look Before You Leap: Unveiling the Power of GPT-4V in Robotic Vision-Language Planning
Figure 2 for Look Before You Leap: Unveiling the Power of GPT-4V in Robotic Vision-Language Planning
Figure 3 for Look Before You Leap: Unveiling the Power of GPT-4V in Robotic Vision-Language Planning
Figure 4 for Look Before You Leap: Unveiling the Power of GPT-4V in Robotic Vision-Language Planning
Viaarxiv icon

NSM4D: Neural Scene Model Based Online 4D Point Cloud Sequence Understanding

Add code
Oct 12, 2023
Viaarxiv icon

DreamLLM: Synergistic Multimodal Comprehension and Creation

Add code
Sep 20, 2023
Viaarxiv icon

TransTouch: Learning Transparent Objects Depth Sensing Through Sparse Touches

Add code
Sep 18, 2023
Viaarxiv icon