Picture for He Wang

He Wang

TrackVLA: Embodied Visual Tracking in the Wild

Add code
May 29, 2025
Viaarxiv icon

AISHELL-5: The First Open-Source In-Car Multi-Channel Multi-Speaker Speech Dataset for Automatic Speech Diarization and Recognition

Add code
May 29, 2025
Viaarxiv icon

Adversarially Robust AI-Generated Image Detection for Free: An Information Theoretic Perspective

Add code
May 28, 2025
Viaarxiv icon

Learning and Interpreting Gravitational-Wave Features from CNNs with a Random Forest Approach

Add code
May 26, 2025
Viaarxiv icon

Your Classifier Can Do More: Towards Bridging the Gaps in Classification, Robustness, and Generation

Add code
May 26, 2025
Viaarxiv icon

Recent Deep Learning in Crowd Behaviour Analysis: A Brief Review

Add code
May 23, 2025
Viaarxiv icon

Large-Scale Multi-Character Interaction Synthesis

Add code
May 20, 2025
Viaarxiv icon

DSMentor: Enhancing Data Science Agents with Curriculum Learning and Online Knowledge Accumulation

Add code
May 20, 2025
Viaarxiv icon

Self-Learning Hyperspectral and Multispectral Image Fusion via Adaptive Residual Guided Subspace Diffusion Model

Add code
May 17, 2025
Viaarxiv icon

Unleashing Humanoid Reaching Potential via Real-world-Ready Skill Space

Add code
May 16, 2025
Viaarxiv icon