Picture for Chenhui Xu

Chenhui Xu

Towards Pretraining Robust ASR Foundation Model with Acoustic-Aware Data Augmentation

Add code
May 27, 2025
Viaarxiv icon

FP64 is All You Need: Rethinking Failure Modes in Physics-Informed Neural Networks

Add code
May 16, 2025
Viaarxiv icon

Combating Partial Perception Deficit in Autonomous Driving with Multimodal LLM Commonsense

Add code
Mar 10, 2025
Viaarxiv icon

Towards Understanding Multi-Round Large Language Model Reasoning: Approximability, Learnability and Generalizability

Add code
Mar 05, 2025
Viaarxiv icon

Recognize Any Surgical Object: Unleashing the Power of Weakly-Supervised Data

Add code
Jan 25, 2025
Figure 1 for Recognize Any Surgical Object: Unleashing the Power of Weakly-Supervised Data
Figure 2 for Recognize Any Surgical Object: Unleashing the Power of Weakly-Supervised Data
Figure 3 for Recognize Any Surgical Object: Unleashing the Power of Weakly-Supervised Data
Figure 4 for Recognize Any Surgical Object: Unleashing the Power of Weakly-Supervised Data
Viaarxiv icon

Tiny-Align: Bridging Automatic Speech Recognition and Large Language Model on the Edge

Add code
Nov 21, 2024
Figure 1 for Tiny-Align: Bridging Automatic Speech Recognition and Large Language Model on the Edge
Figure 2 for Tiny-Align: Bridging Automatic Speech Recognition and Large Language Model on the Edge
Figure 3 for Tiny-Align: Bridging Automatic Speech Recognition and Large Language Model on the Edge
Figure 4 for Tiny-Align: Bridging Automatic Speech Recognition and Large Language Model on the Edge
Viaarxiv icon

Large Language Models have Intrinsic Self-Correction Ability

Add code
Jun 21, 2024
Viaarxiv icon

PI-Whisper: An Adaptive and Incremental ASR Framework for Diverse and Evolving Speaker Characteristics

Add code
Jun 21, 2024
Figure 1 for PI-Whisper: An Adaptive and Incremental ASR Framework for Diverse and Evolving Speaker Characteristics
Figure 2 for PI-Whisper: An Adaptive and Incremental ASR Framework for Diverse and Evolving Speaker Characteristics
Figure 3 for PI-Whisper: An Adaptive and Incremental ASR Framework for Diverse and Evolving Speaker Characteristics
Figure 4 for PI-Whisper: An Adaptive and Incremental ASR Framework for Diverse and Evolving Speaker Characteristics
Viaarxiv icon

Infinite-Dimensional Feature Interaction

Add code
May 22, 2024
Figure 1 for Infinite-Dimensional Feature Interaction
Figure 2 for Infinite-Dimensional Feature Interaction
Figure 3 for Infinite-Dimensional Feature Interaction
Figure 4 for Infinite-Dimensional Feature Interaction
Viaarxiv icon

QuadraNet V2: Efficient and Sustainable Training of High-Order Neural Networks with Quadratic Adaptation

Add code
May 06, 2024
Viaarxiv icon