Picture for He Wang

He Wang

TrackVLA++: Unleashing Reasoning and Memory Capabilities in VLA Models for Embodied Visual Tracking

Add code
Oct 08, 2025
Viaarxiv icon

Embodied Arena: A Comprehensive, Unified, and Evolving Evaluation Platform for Embodied AI

Add code
Sep 18, 2025
Viaarxiv icon

RealMirror: A Comprehensive, Open-Source Vision-Language-Action Platform for Embodied AI

Add code
Sep 18, 2025
Viaarxiv icon

Track Any Motions under Any Disturbances

Add code
Sep 17, 2025
Viaarxiv icon

FLUX-Makeup: High-Fidelity, Identity-Consistent, and Robust Makeup Transfer via Diffusion Transformer

Add code
Aug 07, 2025
Viaarxiv icon

DexVLG: Dexterous Vision-Language-Grasp Model at Scale

Add code
Jul 03, 2025
Viaarxiv icon

OctoNav: Towards Generalist Embodied Navigation

Add code
Jun 11, 2025
Viaarxiv icon

TrackVLA: Embodied Visual Tracking in the Wild

Add code
May 29, 2025
Viaarxiv icon

AISHELL-5: The First Open-Source In-Car Multi-Channel Multi-Speaker Speech Dataset for Automatic Speech Diarization and Recognition

Add code
May 29, 2025
Viaarxiv icon

Adversarially Robust AI-Generated Image Detection for Free: An Information Theoretic Perspective

Add code
May 28, 2025
Viaarxiv icon