Alert button

"speech recognition": models, code, and papers
Alert button

Discovering Phonetic Inventories with Crosslingual Automatic Speech Recognition

Jan 26, 2022
Piotr Żelasko, Siyuan Feng, Laureano Moro Velazquez, Ali Abavisani, Saurabhchand Bhati, Odette Scharenborg, Mark Hasegawa-Johnson, Najim Dehak

Figure 1 for Discovering Phonetic Inventories with Crosslingual Automatic Speech Recognition
Figure 2 for Discovering Phonetic Inventories with Crosslingual Automatic Speech Recognition
Figure 3 for Discovering Phonetic Inventories with Crosslingual Automatic Speech Recognition
Figure 4 for Discovering Phonetic Inventories with Crosslingual Automatic Speech Recognition
Viaarxiv icon

TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices

Aug 11, 2020
Alexander Wong, Mahmoud Famouri, Maya Pavlova, Siddharth Surana

Figure 1 for TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices
Figure 2 for TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices
Figure 3 for TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices
Viaarxiv icon

Comparative Study of Pre-Trained BERT Models for Code-Mixed Hindi-English Data

May 26, 2023
Aryan Patil, Varad Patwardhan, Abhishek Phaltankar, Gauri Takawane, Raviraj Joshi

Figure 1 for Comparative Study of Pre-Trained BERT Models for Code-Mixed Hindi-English Data
Figure 2 for Comparative Study of Pre-Trained BERT Models for Code-Mixed Hindi-English Data
Figure 3 for Comparative Study of Pre-Trained BERT Models for Code-Mixed Hindi-English Data
Figure 4 for Comparative Study of Pre-Trained BERT Models for Code-Mixed Hindi-English Data
Viaarxiv icon

Robustifying automatic speech recognition by extracting slowly varying features

Dec 14, 2021
Matias Pizarro, Dorothea Kolossa, Asja Fischer

Figure 1 for Robustifying automatic speech recognition by extracting slowly varying features
Figure 2 for Robustifying automatic speech recognition by extracting slowly varying features
Figure 3 for Robustifying automatic speech recognition by extracting slowly varying features
Figure 4 for Robustifying automatic speech recognition by extracting slowly varying features
Viaarxiv icon

Neural Architecture Search: Insights from 1000 Papers

Jan 25, 2023
Colin White, Mahmoud Safari, Rhea Sukthanker, Binxin Ru, Thomas Elsken, Arber Zela, Debadeepta Dey, Frank Hutter

Figure 1 for Neural Architecture Search: Insights from 1000 Papers
Figure 2 for Neural Architecture Search: Insights from 1000 Papers
Figure 3 for Neural Architecture Search: Insights from 1000 Papers
Figure 4 for Neural Architecture Search: Insights from 1000 Papers
Viaarxiv icon

Leveraging Unimodal Self-Supervised Learning for Multimodal Audio-Visual Speech Recognition

Mar 26, 2022
Xichen Pan, Peiyu Chen, Yichen Gong, Helong Zhou, Xinbing Wang, Zhouhan Lin

Figure 1 for Leveraging Unimodal Self-Supervised Learning for Multimodal Audio-Visual Speech Recognition
Figure 2 for Leveraging Unimodal Self-Supervised Learning for Multimodal Audio-Visual Speech Recognition
Figure 3 for Leveraging Unimodal Self-Supervised Learning for Multimodal Audio-Visual Speech Recognition
Figure 4 for Leveraging Unimodal Self-Supervised Learning for Multimodal Audio-Visual Speech Recognition
Viaarxiv icon

Spell my name: keyword boosted speech recognition

Oct 06, 2021
Namkyu Jung, Geonmin Kim, Joon Son Chung

Figure 1 for Spell my name: keyword boosted speech recognition
Figure 2 for Spell my name: keyword boosted speech recognition
Figure 3 for Spell my name: keyword boosted speech recognition
Figure 4 for Spell my name: keyword boosted speech recognition
Viaarxiv icon

Conversion of Acoustic Signal (Speech) Into Text By Digital Filter using Natural Language Processing

Sep 09, 2022
Abhiram Katuri, Sindhu Salugu, Gelli Tharuni, Challa Sri Gouri

Figure 1 for Conversion of Acoustic Signal (Speech) Into Text By Digital Filter using Natural Language Processing
Figure 2 for Conversion of Acoustic Signal (Speech) Into Text By Digital Filter using Natural Language Processing
Figure 3 for Conversion of Acoustic Signal (Speech) Into Text By Digital Filter using Natural Language Processing
Viaarxiv icon

Language Agnostic Data-Driven Inverse Text Normalization

Jan 24, 2023
Szu-Jui Chen, Debjyoti Paul, Yutong Pang, Peng Su, Xuedong Zhang

Figure 1 for Language Agnostic Data-Driven Inverse Text Normalization
Figure 2 for Language Agnostic Data-Driven Inverse Text Normalization
Figure 3 for Language Agnostic Data-Driven Inverse Text Normalization
Figure 4 for Language Agnostic Data-Driven Inverse Text Normalization
Viaarxiv icon

The neural dynamics of auditory word recognition and integration

May 22, 2023
Jon Gauthier, Roger Levy

Figure 1 for The neural dynamics of auditory word recognition and integration
Figure 2 for The neural dynamics of auditory word recognition and integration
Figure 3 for The neural dynamics of auditory word recognition and integration
Figure 4 for The neural dynamics of auditory word recognition and integration
Viaarxiv icon