Picture for Siheng Zhao

Siheng Zhao

ResMimic: From General Motion Tracking to Humanoid Whole-body Loco-Manipulation via Residual Learning

Add code
Oct 06, 2025
Viaarxiv icon

RoboVerse: Towards a Unified Platform, Dataset and Benchmark for Scalable and Generalizable Robot Learning

Add code
Apr 26, 2025
Figure 1 for RoboVerse: Towards a Unified Platform, Dataset and Benchmark for Scalable and Generalizable Robot Learning
Figure 2 for RoboVerse: Towards a Unified Platform, Dataset and Benchmark for Scalable and Generalizable Robot Learning
Figure 3 for RoboVerse: Towards a Unified Platform, Dataset and Benchmark for Scalable and Generalizable Robot Learning
Figure 4 for RoboVerse: Towards a Unified Platform, Dataset and Benchmark for Scalable and Generalizable Robot Learning
Viaarxiv icon

Learning from Massive Human Videos for Universal Humanoid Pose Control

Add code
Dec 18, 2024
Figure 1 for Learning from Massive Human Videos for Universal Humanoid Pose Control
Figure 2 for Learning from Massive Human Videos for Universal Humanoid Pose Control
Figure 3 for Learning from Massive Human Videos for Universal Humanoid Pose Control
Figure 4 for Learning from Massive Human Videos for Universal Humanoid Pose Control
Viaarxiv icon

GRUtopia: Dream General Robots in a City at Scale

Add code
Jul 15, 2024
Viaarxiv icon

TieBot: Learning to Knot a Tie from Visual Demonstration through a Real-to-Sim-to-Real Approach

Add code
Jul 03, 2024
Figure 1 for TieBot: Learning to Knot a Tie from Visual Demonstration through a Real-to-Sim-to-Real Approach
Figure 2 for TieBot: Learning to Knot a Tie from Visual Demonstration through a Real-to-Sim-to-Real Approach
Figure 3 for TieBot: Learning to Knot a Tie from Visual Demonstration through a Real-to-Sim-to-Real Approach
Figure 4 for TieBot: Learning to Knot a Tie from Visual Demonstration through a Real-to-Sim-to-Real Approach
Viaarxiv icon

OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

Add code
Apr 11, 2024
Figure 1 for OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
Figure 2 for OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
Figure 3 for OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
Figure 4 for OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
Viaarxiv icon

Lemur: Harmonizing Natural Language and Code for Language Agents

Add code
Oct 10, 2023
Viaarxiv icon

Text2Reward: Automated Dense Reward Function Generation for Reinforcement Learning

Add code
Sep 21, 2023
Figure 1 for Text2Reward: Automated Dense Reward Function Generation for Reinforcement Learning
Figure 2 for Text2Reward: Automated Dense Reward Function Generation for Reinforcement Learning
Figure 3 for Text2Reward: Automated Dense Reward Function Generation for Reinforcement Learning
Figure 4 for Text2Reward: Automated Dense Reward Function Generation for Reinforcement Learning
Viaarxiv icon

ClothesNet: An Information-Rich 3D Garment Model Repository with Simulated Clothes Environment

Add code
Aug 19, 2023
Figure 1 for ClothesNet: An Information-Rich 3D Garment Model Repository with Simulated Clothes Environment
Figure 2 for ClothesNet: An Information-Rich 3D Garment Model Repository with Simulated Clothes Environment
Figure 3 for ClothesNet: An Information-Rich 3D Garment Model Repository with Simulated Clothes Environment
Figure 4 for ClothesNet: An Information-Rich 3D Garment Model Repository with Simulated Clothes Environment
Viaarxiv icon

kNN-BOX: A Unified Framework for Nearest Neighbor Generation

Add code
Feb 27, 2023
Viaarxiv icon