Picture for He Wang

He Wang

University College London

Robust Differentiable Collision Detection for General Objects

Add code
Nov 09, 2025
Viaarxiv icon

The Robustness of Differentiable Causal Discovery in Misspecified Scenarios

Add code
Oct 14, 2025
Viaarxiv icon

TrackVLA++: Unleashing Reasoning and Memory Capabilities in VLA Models for Embodied Visual Tracking

Add code
Oct 08, 2025
Viaarxiv icon

RealMirror: A Comprehensive, Open-Source Vision-Language-Action Platform for Embodied AI

Add code
Sep 18, 2025
Figure 1 for RealMirror: A Comprehensive, Open-Source Vision-Language-Action Platform for Embodied AI
Figure 2 for RealMirror: A Comprehensive, Open-Source Vision-Language-Action Platform for Embodied AI
Figure 3 for RealMirror: A Comprehensive, Open-Source Vision-Language-Action Platform for Embodied AI
Figure 4 for RealMirror: A Comprehensive, Open-Source Vision-Language-Action Platform for Embodied AI
Viaarxiv icon

Embodied Arena: A Comprehensive, Unified, and Evolving Evaluation Platform for Embodied AI

Add code
Sep 18, 2025
Viaarxiv icon

Track Any Motions under Any Disturbances

Add code
Sep 17, 2025
Figure 1 for Track Any Motions under Any Disturbances
Figure 2 for Track Any Motions under Any Disturbances
Figure 3 for Track Any Motions under Any Disturbances
Figure 4 for Track Any Motions under Any Disturbances
Viaarxiv icon

FLUX-Makeup: High-Fidelity, Identity-Consistent, and Robust Makeup Transfer via Diffusion Transformer

Add code
Aug 07, 2025
Figure 1 for FLUX-Makeup: High-Fidelity, Identity-Consistent, and Robust Makeup Transfer via Diffusion Transformer
Figure 2 for FLUX-Makeup: High-Fidelity, Identity-Consistent, and Robust Makeup Transfer via Diffusion Transformer
Figure 3 for FLUX-Makeup: High-Fidelity, Identity-Consistent, and Robust Makeup Transfer via Diffusion Transformer
Figure 4 for FLUX-Makeup: High-Fidelity, Identity-Consistent, and Robust Makeup Transfer via Diffusion Transformer
Viaarxiv icon

DexVLG: Dexterous Vision-Language-Grasp Model at Scale

Add code
Jul 03, 2025
Viaarxiv icon

OctoNav: Towards Generalist Embodied Navigation

Add code
Jun 11, 2025
Viaarxiv icon

AISHELL-5: The First Open-Source In-Car Multi-Channel Multi-Speaker Speech Dataset for Automatic Speech Diarization and Recognition

Add code
May 29, 2025
Viaarxiv icon