Picture for Tao Huang

Tao Huang

DiffCap-Bench: A Comprehensive, Challenging, Robust Benchmark for Image Difference Captioning

Add code
May 06, 2026
Viaarxiv icon

Seeing Realism from Simulation: Efficient Video Transfer for Vision-Language-Action Data Augmentation

Add code
May 04, 2026
Viaarxiv icon

AFFormer: Adaptive Feature Fusion Transformer for V2X Cooperative Perception under Channel Impairments

Add code
May 03, 2026
Viaarxiv icon

Scalable Hyperparameter-Divergent Ensemble Training with Automatic Learning Rate Exploration for Large Models

Add code
Apr 27, 2026
Viaarxiv icon

Real-time Neural Six-way Lightmaps

Add code
Apr 04, 2026
Viaarxiv icon

SMASH: Mastering Scalable Whole-Body Skills for Humanoid Ping-Pong with Egocentric Vision

Add code
Apr 01, 2026
Viaarxiv icon

Feel Robot Feels: Tactile Feedback Array Glove for Dexterous Manipulation

Add code
Mar 30, 2026
Viaarxiv icon

DASH: Dynamic Audio-Driven Semantic Chunking for Efficient Omnimodal Token Compression

Add code
Mar 15, 2026
Viaarxiv icon

WaterVideoQA: ASV-Centric Perception and Rule-Compliant Reasoning via Multi-Modal Agents

Add code
Feb 26, 2026
Viaarxiv icon

DP-aware AdaLN-Zero: Taming Conditioning-Induced Heavy-Tailed Gradients in Differentially Private Diffusion

Add code
Feb 26, 2026
Viaarxiv icon