Picture for Siyuan Huang

Siyuan Huang

Learning Human-Humanoid Coordination for Collaborative Object Carrying

Add code
Oct 16, 2025
Viaarxiv icon

GWM: Towards Scalable Gaussian World Models for Robotic Manipulation

Add code
Aug 25, 2025
Figure 1 for GWM: Towards Scalable Gaussian World Models for Robotic Manipulation
Figure 2 for GWM: Towards Scalable Gaussian World Models for Robotic Manipulation
Figure 3 for GWM: Towards Scalable Gaussian World Models for Robotic Manipulation
Figure 4 for GWM: Towards Scalable Gaussian World Models for Robotic Manipulation
Viaarxiv icon

Spatial-Temporal Multi-Scale Quantization for Flexible Motion Generation

Add code
Aug 12, 2025
Viaarxiv icon

Genie Envisioner: A Unified World Foundation Platform for Robotic Manipulation

Add code
Aug 07, 2025
Figure 1 for Genie Envisioner: A Unified World Foundation Platform for Robotic Manipulation
Figure 2 for Genie Envisioner: A Unified World Foundation Platform for Robotic Manipulation
Figure 3 for Genie Envisioner: A Unified World Foundation Platform for Robotic Manipulation
Figure 4 for Genie Envisioner: A Unified World Foundation Platform for Robotic Manipulation
Viaarxiv icon

LEO-VL: Towards 3D Vision-Language Generalists via Data Scaling with Efficient Representation

Add code
Jun 11, 2025
Figure 1 for LEO-VL: Towards 3D Vision-Language Generalists via Data Scaling with Efficient Representation
Figure 2 for LEO-VL: Towards 3D Vision-Language Generalists via Data Scaling with Efficient Representation
Figure 3 for LEO-VL: Towards 3D Vision-Language Generalists via Data Scaling with Efficient Representation
Figure 4 for LEO-VL: Towards 3D Vision-Language Generalists via Data Scaling with Efficient Representation
Viaarxiv icon

Cross-Spectral Body Recognition with Side Information Embedding: Benchmarks on LLCM and Analyzing Range-Induced Occlusions on IJB-MDF

Add code
Jun 10, 2025
Viaarxiv icon

CLONE: Closed-Loop Whole-Body Humanoid Teleoperation for Long-Horizon Tasks

Add code
Jun 10, 2025
Viaarxiv icon

RoboCerebra: A Large-scale Benchmark for Long-horizon Robotic Manipulation Evaluation

Add code
Jun 07, 2025
Viaarxiv icon

InteractAnything: Zero-shot Human Object Interaction Synthesis via LLM Feedback and Object Affordance Parsing

Add code
May 30, 2025
Figure 1 for InteractAnything: Zero-shot Human Object Interaction Synthesis via LLM Feedback and Object Affordance Parsing
Figure 2 for InteractAnything: Zero-shot Human Object Interaction Synthesis via LLM Feedback and Object Affordance Parsing
Figure 3 for InteractAnything: Zero-shot Human Object Interaction Synthesis via LLM Feedback and Object Affordance Parsing
Figure 4 for InteractAnything: Zero-shot Human Object Interaction Synthesis via LLM Feedback and Object Affordance Parsing
Viaarxiv icon

Pretraining Language Models to Ponder in Continuous Space

Add code
May 27, 2025
Viaarxiv icon