Picture for Chenhui Xu

Chenhui Xu

Towards Pretraining Robust ASR Foundation Model with Acoustic-Aware Data Augmentation

Add code
May 27, 2025
Viaarxiv icon

FP64 is All You Need: Rethinking Failure Modes in Physics-Informed Neural Networks

Add code
May 16, 2025
Viaarxiv icon

Combating Partial Perception Deficit in Autonomous Driving with Multimodal LLM Commonsense

Add code
Mar 10, 2025
Figure 1 for Combating Partial Perception Deficit in Autonomous Driving with Multimodal LLM Commonsense
Figure 2 for Combating Partial Perception Deficit in Autonomous Driving with Multimodal LLM Commonsense
Figure 3 for Combating Partial Perception Deficit in Autonomous Driving with Multimodal LLM Commonsense
Figure 4 for Combating Partial Perception Deficit in Autonomous Driving with Multimodal LLM Commonsense
Viaarxiv icon

Towards Understanding Multi-Round Large Language Model Reasoning: Approximability, Learnability and Generalizability

Add code
Mar 05, 2025
Viaarxiv icon

Recognize Any Surgical Object: Unleashing the Power of Weakly-Supervised Data

Add code
Jan 25, 2025
Figure 1 for Recognize Any Surgical Object: Unleashing the Power of Weakly-Supervised Data
Figure 2 for Recognize Any Surgical Object: Unleashing the Power of Weakly-Supervised Data
Figure 3 for Recognize Any Surgical Object: Unleashing the Power of Weakly-Supervised Data
Figure 4 for Recognize Any Surgical Object: Unleashing the Power of Weakly-Supervised Data
Viaarxiv icon

Tiny-Align: Bridging Automatic Speech Recognition and Large Language Model on the Edge

Add code
Nov 21, 2024
Figure 1 for Tiny-Align: Bridging Automatic Speech Recognition and Large Language Model on the Edge
Figure 2 for Tiny-Align: Bridging Automatic Speech Recognition and Large Language Model on the Edge
Figure 3 for Tiny-Align: Bridging Automatic Speech Recognition and Large Language Model on the Edge
Figure 4 for Tiny-Align: Bridging Automatic Speech Recognition and Large Language Model on the Edge
Viaarxiv icon

Large Language Models have Intrinsic Self-Correction Ability

Add code
Jun 21, 2024
Viaarxiv icon

PI-Whisper: An Adaptive and Incremental ASR Framework for Diverse and Evolving Speaker Characteristics

Add code
Jun 21, 2024
Figure 1 for PI-Whisper: An Adaptive and Incremental ASR Framework for Diverse and Evolving Speaker Characteristics
Figure 2 for PI-Whisper: An Adaptive and Incremental ASR Framework for Diverse and Evolving Speaker Characteristics
Figure 3 for PI-Whisper: An Adaptive and Incremental ASR Framework for Diverse and Evolving Speaker Characteristics
Figure 4 for PI-Whisper: An Adaptive and Incremental ASR Framework for Diverse and Evolving Speaker Characteristics
Viaarxiv icon

Infinite-Dimensional Feature Interaction

Add code
May 22, 2024
Figure 1 for Infinite-Dimensional Feature Interaction
Figure 2 for Infinite-Dimensional Feature Interaction
Figure 3 for Infinite-Dimensional Feature Interaction
Figure 4 for Infinite-Dimensional Feature Interaction
Viaarxiv icon

QuadraNet V2: Efficient and Sustainable Training of High-Order Neural Networks with Quadratic Adaptation

Add code
May 06, 2024
Figure 1 for QuadraNet V2: Efficient and Sustainable Training of High-Order Neural Networks with Quadratic Adaptation
Figure 2 for QuadraNet V2: Efficient and Sustainable Training of High-Order Neural Networks with Quadratic Adaptation
Figure 3 for QuadraNet V2: Efficient and Sustainable Training of High-Order Neural Networks with Quadratic Adaptation
Figure 4 for QuadraNet V2: Efficient and Sustainable Training of High-Order Neural Networks with Quadratic Adaptation
Viaarxiv icon