Picture for Xiaoshuai Hao

Xiaoshuai Hao

I$^2$S-TFCKD: Intra-Inter Set Knowledge Distillation with Time-Frequency Calibration for Speech Enhancement

Add code
Jun 16, 2025
Viaarxiv icon

Video-CoT: A Comprehensive Dataset for Spatiotemporal Understanding of Videos Based on Chain-of-Thought

Add code
Jun 12, 2025
Viaarxiv icon

Your Classifier Can Do More: Towards Bridging the Gaps in Classification, Robustness, and Generation

Add code
May 26, 2025
Viaarxiv icon

VTLA: Vision-Tactile-Language-Action Model with Preference Learning for Insertion Manipulation

Add code
May 14, 2025
Viaarxiv icon

Multi-Modal Molecular Representation Learning via Structure Awareness

Add code
May 09, 2025
Viaarxiv icon

RoboOS: A Hierarchical Embodied Framework for Cross-Embodiment and Multi-Agent Collaboration

Add code
May 06, 2025
Viaarxiv icon

MEGA: Second-Order Gradient Alignment for Catastrophic Forgetting Mitigation in GFSCIL

Add code
Apr 18, 2025
Viaarxiv icon

FastRSR: Efficient and Accurate Road Surface Reconstruction from Bird's Eye View

Add code
Apr 13, 2025
Viaarxiv icon

Reason-RFT: Reinforcement Fine-Tuning for Visual Reasoning

Add code
Mar 27, 2025
Viaarxiv icon

TLA: Tactile-Language-Action Model for Contact-Rich Manipulation

Add code
Mar 11, 2025
Viaarxiv icon