Picture for Siteng Huang

Siteng Huang

Articulat3D: Reconstructing Articulated Digital Twins From Monocular Videos with Geometric and Motion Constraints

Add code
Mar 12, 2026
Viaarxiv icon

RynnBrain: Open Embodied Foundation Models

Add code
Feb 13, 2026
Viaarxiv icon

HiF-VLA: Hindsight, Insight and Foresight through Motion Representation for Vision-Language-Action Models

Add code
Dec 10, 2025
Figure 1 for HiF-VLA: Hindsight, Insight and Foresight through Motion Representation for Vision-Language-Action Models
Figure 2 for HiF-VLA: Hindsight, Insight and Foresight through Motion Representation for Vision-Language-Action Models
Figure 3 for HiF-VLA: Hindsight, Insight and Foresight through Motion Representation for Vision-Language-Action Models
Figure 4 for HiF-VLA: Hindsight, Insight and Foresight through Motion Representation for Vision-Language-Action Models
Viaarxiv icon

RynnVLA-001: Using Human Demonstrations to Improve Robot Manipulation

Add code
Sep 18, 2025
Viaarxiv icon

VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model

Add code
Sep 11, 2025
Viaarxiv icon

Long-VLA: Unleashing Long-Horizon Capability of Vision Language Action Model for Robot Manipulation

Add code
Aug 28, 2025
Viaarxiv icon

Towards Affordance-Aware Robotic Dexterous Grasping with Human-like Priors

Add code
Aug 12, 2025
Viaarxiv icon

WorldVLA: Towards Autoregressive Action World Model

Add code
Jun 26, 2025
Viaarxiv icon

VARD: Efficient and Dense Fine-Tuning for Diffusion Models with Value-based RL

Add code
May 21, 2025
Figure 1 for VARD: Efficient and Dense Fine-Tuning for Diffusion Models with Value-based RL
Figure 2 for VARD: Efficient and Dense Fine-Tuning for Diffusion Models with Value-based RL
Figure 3 for VARD: Efficient and Dense Fine-Tuning for Diffusion Models with Value-based RL
Figure 4 for VARD: Efficient and Dense Fine-Tuning for Diffusion Models with Value-based RL
Viaarxiv icon

SSR: Enhancing Depth Perception in Vision-Language Models via Rationale-Guided Spatial Reasoning

Add code
May 18, 2025
Figure 1 for SSR: Enhancing Depth Perception in Vision-Language Models via Rationale-Guided Spatial Reasoning
Figure 2 for SSR: Enhancing Depth Perception in Vision-Language Models via Rationale-Guided Spatial Reasoning
Figure 3 for SSR: Enhancing Depth Perception in Vision-Language Models via Rationale-Guided Spatial Reasoning
Figure 4 for SSR: Enhancing Depth Perception in Vision-Language Models via Rationale-Guided Spatial Reasoning
Viaarxiv icon