Picture for Berlin Chen

Berlin Chen

An Effective Context-Balanced Adaptation Approach for Long-Tailed Speech Recognition

Add code
Sep 10, 2024
Figure 1 for An Effective Context-Balanced Adaptation Approach for Long-Tailed Speech Recognition
Figure 2 for An Effective Context-Balanced Adaptation Approach for Long-Tailed Speech Recognition
Figure 3 for An Effective Context-Balanced Adaptation Approach for Long-Tailed Speech Recognition
Figure 4 for An Effective Context-Balanced Adaptation Approach for Long-Tailed Speech Recognition
Viaarxiv icon

Effective Noise-aware Data Simulation for Domain-adaptive Speech Enhancement Leveraging Dynamic Stochastic Perturbation

Add code
Sep 03, 2024
Viaarxiv icon

Optimizing Automatic Speech Assessment: W-RankSim Regularization and Hybrid Feature Fusion Strategies

Add code
Jun 16, 2024
Viaarxiv icon

ConPCO: Preserving Phoneme Characteristics for Automatic Pronunciation Assessment Leveraging Contrastive Ordinal Regularization

Add code
Jun 05, 2024
Viaarxiv icon

An Effective Automated Speaking Assessment Approach to Mitigating Data Scarcity and Imbalanced Distribution

Add code
Apr 12, 2024
Figure 1 for An Effective Automated Speaking Assessment Approach to Mitigating Data Scarcity and Imbalanced Distribution
Figure 2 for An Effective Automated Speaking Assessment Approach to Mitigating Data Scarcity and Imbalanced Distribution
Figure 3 for An Effective Automated Speaking Assessment Approach to Mitigating Data Scarcity and Imbalanced Distribution
Figure 4 for An Effective Automated Speaking Assessment Approach to Mitigating Data Scarcity and Imbalanced Distribution
Viaarxiv icon

DANCER: Entity Description Augmented Named Entity Corrector for Automatic Speech Recognition

Add code
Apr 11, 2024
Figure 1 for DANCER: Entity Description Augmented Named Entity Corrector for Automatic Speech Recognition
Figure 2 for DANCER: Entity Description Augmented Named Entity Corrector for Automatic Speech Recognition
Figure 3 for DANCER: Entity Description Augmented Named Entity Corrector for Automatic Speech Recognition
Figure 4 for DANCER: Entity Description Augmented Named Entity Corrector for Automatic Speech Recognition
Viaarxiv icon

Speech-Aware Neural Diarization with Encoder-Decoder Attractor Guided by Attention Constraints

Add code
Mar 21, 2024
Viaarxiv icon

What do neural networks listen to? Exploring the crucial bands in Speech Enhancement using Sinc-convolution

Add code
Mar 04, 2024
Viaarxiv icon

ConSep: a Noise- and Reverberation-Robust Speech Separation Framework by Magnitude Conditioning

Add code
Mar 04, 2024
Viaarxiv icon

An Effective Mixture-Of-Experts Approach For Code-Switching Speech Recognition Leveraging Encoder Disentanglement

Add code
Feb 27, 2024
Figure 1 for An Effective Mixture-Of-Experts Approach For Code-Switching Speech Recognition Leveraging Encoder Disentanglement
Figure 2 for An Effective Mixture-Of-Experts Approach For Code-Switching Speech Recognition Leveraging Encoder Disentanglement
Figure 3 for An Effective Mixture-Of-Experts Approach For Code-Switching Speech Recognition Leveraging Encoder Disentanglement
Figure 4 for An Effective Mixture-Of-Experts Approach For Code-Switching Speech Recognition Leveraging Encoder Disentanglement
Viaarxiv icon