Picture for Rong Xiong

Rong Xiong

Toward Embodiment Equivariant Vision-Language-Action Policy

Add code
Sep 18, 2025
Viaarxiv icon

BEV-ODOM2: Enhanced BEV-based Monocular Visual Odometry with PV-BEV Fusion and Dense Flow Supervision for Ground Robots

Add code
Sep 18, 2025
Viaarxiv icon

TOP: Time Optimization Policy for Stable and Accurate Standing Manipulation with Humanoid Robots

Add code
Aug 01, 2025
Viaarxiv icon

Grounding 3D Object Affordance with Language Instructions, Visual Observations and Interactions

Add code
Apr 07, 2025
Figure 1 for Grounding 3D Object Affordance with Language Instructions, Visual Observations and Interactions
Figure 2 for Grounding 3D Object Affordance with Language Instructions, Visual Observations and Interactions
Figure 3 for Grounding 3D Object Affordance with Language Instructions, Visual Observations and Interactions
Figure 4 for Grounding 3D Object Affordance with Language Instructions, Visual Observations and Interactions
Viaarxiv icon

UnIRe: Unsupervised Instance Decomposition for Dynamic Urban Scene Reconstruction

Add code
Apr 01, 2025
Viaarxiv icon

Disambiguate Gripper State in Grasp-Based Tasks: Pseudo-Tactile as Feedback Enables Pure Simulation Learning

Add code
Mar 31, 2025
Viaarxiv icon

Efficient Alignment of Unconditioned Action Prior for Language-conditioned Pick and Place in Clutter

Add code
Mar 12, 2025
Viaarxiv icon

Natural Humanoid Robot Locomotion with Generative Motion Prior

Add code
Mar 12, 2025
Viaarxiv icon

BEV-DWPVO: BEV-based Differentiable Weighted Procrustes for Low Scale-drift Monocular Visual Odometry on Ground

Add code
Feb 27, 2025
Viaarxiv icon

CarPlanner: Consistent Auto-regressive Trajectory Planning for Large-scale Reinforcement Learning in Autonomous Driving

Add code
Feb 27, 2025
Viaarxiv icon