Alert button

"speech recognition": models, code, and papers
Alert button

End-to-End Code Switching Language Models for Automatic Speech Recognition

Jun 16, 2020
Ahan M. R., Shreyas Sunil Kulkarni

Figure 1 for End-to-End Code Switching Language Models for Automatic Speech Recognition
Figure 2 for End-to-End Code Switching Language Models for Automatic Speech Recognition
Figure 3 for End-to-End Code Switching Language Models for Automatic Speech Recognition
Figure 4 for End-to-End Code Switching Language Models for Automatic Speech Recognition
Viaarxiv icon

Raw Waveform Encoder with Multi-Scale Globally Attentive Locally Recurrent Networks for End-to-End Speech Recognition

Add code
Bookmark button
Alert button
Jun 08, 2021
Max W. Y. Lam, Jun Wang, Chao Weng, Dan Su, Dong Yu

Figure 1 for Raw Waveform Encoder with Multi-Scale Globally Attentive Locally Recurrent Networks for End-to-End Speech Recognition
Figure 2 for Raw Waveform Encoder with Multi-Scale Globally Attentive Locally Recurrent Networks for End-to-End Speech Recognition
Figure 3 for Raw Waveform Encoder with Multi-Scale Globally Attentive Locally Recurrent Networks for End-to-End Speech Recognition
Figure 4 for Raw Waveform Encoder with Multi-Scale Globally Attentive Locally Recurrent Networks for End-to-End Speech Recognition
Viaarxiv icon

Comparing the Benefit of Synthetic Training Data for Various Automatic Speech Recognition Architectures

Add code
Bookmark button
Alert button
Apr 12, 2021
Nick Rossenbach, Mohammad Zeineldeen, Benedikt Hilmes, Ralf Schlüter, Hermann Ney

Figure 1 for Comparing the Benefit of Synthetic Training Data for Various Automatic Speech Recognition Architectures
Figure 2 for Comparing the Benefit of Synthetic Training Data for Various Automatic Speech Recognition Architectures
Figure 3 for Comparing the Benefit of Synthetic Training Data for Various Automatic Speech Recognition Architectures
Figure 4 for Comparing the Benefit of Synthetic Training Data for Various Automatic Speech Recognition Architectures
Viaarxiv icon

Attention-based Transducer for Online Speech Recognition

May 18, 2020
Bin Wang, Yan Yin, Hui Lin

Figure 1 for Attention-based Transducer for Online Speech Recognition
Figure 2 for Attention-based Transducer for Online Speech Recognition
Figure 3 for Attention-based Transducer for Online Speech Recognition
Figure 4 for Attention-based Transducer for Online Speech Recognition
Viaarxiv icon

A Policy-based Approach to the SpecAugment Method for Low Resource E2E ASR

Add code
Bookmark button
Alert button
Oct 16, 2022
Rui Li, Guodong Ma, Dexin Zhao, Ranran Zeng, Xiaoyu Li, Hao Huang

Figure 1 for A Policy-based Approach to the SpecAugment Method for Low Resource E2E ASR
Figure 2 for A Policy-based Approach to the SpecAugment Method for Low Resource E2E ASR
Figure 3 for A Policy-based Approach to the SpecAugment Method for Low Resource E2E ASR
Figure 4 for A Policy-based Approach to the SpecAugment Method for Low Resource E2E ASR
Viaarxiv icon

Multi-head Monotonic Chunkwise Attention For Online Speech Recognition

May 01, 2020
Baiji Liu, Songjun Cao, Sining Sun, Weibin Zhang, Long Ma

Figure 1 for Multi-head Monotonic Chunkwise Attention For Online Speech Recognition
Figure 2 for Multi-head Monotonic Chunkwise Attention For Online Speech Recognition
Figure 3 for Multi-head Monotonic Chunkwise Attention For Online Speech Recognition
Viaarxiv icon

Internal Language Model Training for Domain-Adaptive End-to-End Speech Recognition

Feb 02, 2021
Zhong Meng, Naoyuki Kanda, Yashesh Gaur, Sarangarajan Parthasarathy, Eric Sun, Liang Lu, Xie Chen, Jinyu Li, Yifan Gong

Figure 1 for Internal Language Model Training for Domain-Adaptive End-to-End Speech Recognition
Figure 2 for Internal Language Model Training for Domain-Adaptive End-to-End Speech Recognition
Viaarxiv icon

Reducing Geographic Disparities in Automatic Speech Recognition via Elastic Weight Consolidation

Jul 16, 2022
Viet Anh Trinh, Pegah Ghahremani, Brian King, Jasha Droppo, Andreas Stolcke, Roland Maas

Figure 1 for Reducing Geographic Disparities in Automatic Speech Recognition via Elastic Weight Consolidation
Figure 2 for Reducing Geographic Disparities in Automatic Speech Recognition via Elastic Weight Consolidation
Viaarxiv icon

AISHELL-NER: Named Entity Recognition from Chinese Speech

Add code
Bookmark button
Alert button
Feb 17, 2022
Boli Chen, Guangwei Xu, Xiaobin Wang, Pengjun Xie, Meishan Zhang, Fei Huang

Figure 1 for AISHELL-NER: Named Entity Recognition from Chinese Speech
Figure 2 for AISHELL-NER: Named Entity Recognition from Chinese Speech
Figure 3 for AISHELL-NER: Named Entity Recognition from Chinese Speech
Figure 4 for AISHELL-NER: Named Entity Recognition from Chinese Speech
Viaarxiv icon

ESSumm: Extractive Speech Summarization from Untranscribed Meeting

Sep 14, 2022
Jun Wang

Figure 1 for ESSumm: Extractive Speech Summarization from Untranscribed Meeting
Figure 2 for ESSumm: Extractive Speech Summarization from Untranscribed Meeting
Figure 3 for ESSumm: Extractive Speech Summarization from Untranscribed Meeting
Figure 4 for ESSumm: Extractive Speech Summarization from Untranscribed Meeting
Viaarxiv icon