Picture for Xiang Deng

Xiang Deng

Mark

UAV-ON: A Benchmark for Open-World Object Goal Navigation with Aerial Agents

Add code
Aug 01, 2025
Viaarxiv icon

Agent-RLVR: Training Software Engineering Agents via Guidance and Environment Rewards

Add code
Jun 13, 2025
Viaarxiv icon

STAR: Learning Diverse Robot Skill Abstractions through Rotation-Augmented Vector Quantization

Add code
Jun 04, 2025
Viaarxiv icon

Few-Shot Vision-Language Action-Incremental Policy Learning

Add code
Apr 22, 2025
Viaarxiv icon

Graph-based Diffusion Model for Collaborative Filtering

Add code
Apr 07, 2025
Viaarxiv icon

Spatial-Temporal Graph Diffusion Policy with Kinematic Modeling for Bimanual Robotic Manipulation

Add code
Mar 13, 2025
Viaarxiv icon

Embodied Crowd Counting

Add code
Mar 11, 2025
Figure 1 for Embodied Crowd Counting
Figure 2 for Embodied Crowd Counting
Figure 3 for Embodied Crowd Counting
Figure 4 for Embodied Crowd Counting
Viaarxiv icon

3D-AffordanceLLM: Harnessing Large Language Models for Open-Vocabulary Affordance Detection in 3D Worlds

Add code
Feb 27, 2025
Viaarxiv icon

Stereo-Talker: Audio-driven 3D Human Synthesis with Prior-Guided Mixture-of-Experts

Add code
Oct 31, 2024
Figure 1 for Stereo-Talker: Audio-driven 3D Human Synthesis with Prior-Guided Mixture-of-Experts
Figure 2 for Stereo-Talker: Audio-driven 3D Human Synthesis with Prior-Guided Mixture-of-Experts
Figure 3 for Stereo-Talker: Audio-driven 3D Human Synthesis with Prior-Guided Mixture-of-Experts
Figure 4 for Stereo-Talker: Audio-driven 3D Human Synthesis with Prior-Guided Mixture-of-Experts
Viaarxiv icon

EPD: Long-term Memory Extraction, Context-awared Planning and Multi-iteration Decision @ EgoPlan Challenge ICML 2024

Add code
Jul 28, 2024
Viaarxiv icon