Picture for Berlin Chen

Berlin Chen

Leveraging Language ID to Calculate Intermediate CTC Loss for Enhanced Code-Switching Speech Recognition

Add code
Dec 15, 2023
Viaarxiv icon

Preserving Phonemic Distinctions for Ordinal Regression: A Novel Loss Function for Automatic Pronunciation Assessment

Add code
Oct 04, 2023
Figure 1 for Preserving Phonemic Distinctions for Ordinal Regression: A Novel Loss Function for Automatic Pronunciation Assessment
Figure 2 for Preserving Phonemic Distinctions for Ordinal Regression: A Novel Loss Function for Automatic Pronunciation Assessment
Figure 3 for Preserving Phonemic Distinctions for Ordinal Regression: A Novel Loss Function for Automatic Pronunciation Assessment
Figure 4 for Preserving Phonemic Distinctions for Ordinal Regression: A Novel Loss Function for Automatic Pronunciation Assessment
Viaarxiv icon

AVATAR: Robust Voice Search Engine Leveraging Autoregressive Document Retrieval and Contrastive Learning

Add code
Sep 04, 2023
Figure 1 for AVATAR: Robust Voice Search Engine Leveraging Autoregressive Document Retrieval and Contrastive Learning
Figure 2 for AVATAR: Robust Voice Search Engine Leveraging Autoregressive Document Retrieval and Contrastive Learning
Figure 3 for AVATAR: Robust Voice Search Engine Leveraging Autoregressive Document Retrieval and Contrastive Learning
Figure 4 for AVATAR: Robust Voice Search Engine Leveraging Autoregressive Document Retrieval and Contrastive Learning
Viaarxiv icon

Naaloss: Rethinking the objective of speech enhancement

Add code
Aug 24, 2023
Figure 1 for Naaloss: Rethinking the objective of speech enhancement
Figure 2 for Naaloss: Rethinking the objective of speech enhancement
Figure 3 for Naaloss: Rethinking the objective of speech enhancement
Figure 4 for Naaloss: Rethinking the objective of speech enhancement
Viaarxiv icon

A Hierarchical Context-aware Modeling Approach for Multi-aspect and Multi-granular Pronunciation Assessment

Add code
Jun 07, 2023
Figure 1 for A Hierarchical Context-aware Modeling Approach for Multi-aspect and Multi-granular Pronunciation Assessment
Figure 2 for A Hierarchical Context-aware Modeling Approach for Multi-aspect and Multi-granular Pronunciation Assessment
Figure 3 for A Hierarchical Context-aware Modeling Approach for Multi-aspect and Multi-granular Pronunciation Assessment
Figure 4 for A Hierarchical Context-aware Modeling Approach for Multi-aspect and Multi-granular Pronunciation Assessment
Viaarxiv icon

3M: An Effective Multi-view, Multi-granularity, and Multi-aspect Modeling Approach to English Pronunciation Assessment

Add code
Aug 19, 2022
Figure 1 for 3M: An Effective Multi-view, Multi-granularity, and Multi-aspect Modeling Approach to English Pronunciation Assessment
Figure 2 for 3M: An Effective Multi-view, Multi-granularity, and Multi-aspect Modeling Approach to English Pronunciation Assessment
Figure 3 for 3M: An Effective Multi-view, Multi-granularity, and Multi-aspect Modeling Approach to English Pronunciation Assessment
Figure 4 for 3M: An Effective Multi-view, Multi-granularity, and Multi-aspect Modeling Approach to English Pronunciation Assessment
Viaarxiv icon

Geometric Learning of Hidden Markov Models via a Method of Moments Algorithm

Add code
Jul 02, 2022
Figure 1 for Geometric Learning of Hidden Markov Models via a Method of Moments Algorithm
Figure 2 for Geometric Learning of Hidden Markov Models via a Method of Moments Algorithm
Figure 3 for Geometric Learning of Hidden Markov Models via a Method of Moments Algorithm
Viaarxiv icon

Conversational speech recognition leveraging effective fusion methods for cross-utterance language modeling

Add code
Nov 05, 2021
Figure 1 for Conversational speech recognition leveraging effective fusion methods for cross-utterance language modeling
Figure 2 for Conversational speech recognition leveraging effective fusion methods for cross-utterance language modeling
Figure 3 for Conversational speech recognition leveraging effective fusion methods for cross-utterance language modeling
Figure 4 for Conversational speech recognition leveraging effective fusion methods for cross-utterance language modeling
Viaarxiv icon

Exploring Non-Autoregressive End-To-End Neural Modeling For English Mispronunciation Detection And Diagnosis

Add code
Nov 01, 2021
Figure 1 for Exploring Non-Autoregressive End-To-End Neural Modeling For English Mispronunciation Detection And Diagnosis
Figure 2 for Exploring Non-Autoregressive End-To-End Neural Modeling For English Mispronunciation Detection And Diagnosis
Figure 3 for Exploring Non-Autoregressive End-To-End Neural Modeling For English Mispronunciation Detection And Diagnosis
Figure 4 for Exploring Non-Autoregressive End-To-End Neural Modeling For English Mispronunciation Detection And Diagnosis
Viaarxiv icon

Improving End-To-End Modeling for Mispronunciation Detection with Effective Augmentation Mechanisms

Add code
Oct 17, 2021
Figure 1 for Improving End-To-End Modeling for Mispronunciation Detection with Effective Augmentation Mechanisms
Figure 2 for Improving End-To-End Modeling for Mispronunciation Detection with Effective Augmentation Mechanisms
Figure 3 for Improving End-To-End Modeling for Mispronunciation Detection with Effective Augmentation Mechanisms
Figure 4 for Improving End-To-End Modeling for Mispronunciation Detection with Effective Augmentation Mechanisms
Viaarxiv icon