Alert button

"speech recognition": models, code, and papers
Alert button

Accent Recognition with Hybrid Phonetic Features

Add code
Bookmark button
Alert button
May 05, 2021
Zhan Zhang, Xi Chen, Yuehai Wang, Jianyi Yang

Figure 1 for Accent Recognition with Hybrid Phonetic Features
Figure 2 for Accent Recognition with Hybrid Phonetic Features
Figure 3 for Accent Recognition with Hybrid Phonetic Features
Figure 4 for Accent Recognition with Hybrid Phonetic Features
Viaarxiv icon

T-NGA: Temporal Network Grafting Algorithm for Learning to Process Spiking Audio Sensor Events

Feb 07, 2022
Shu Wang, Yuhuang Hu, Shih-Chii Liu

Figure 1 for T-NGA: Temporal Network Grafting Algorithm for Learning to Process Spiking Audio Sensor Events
Figure 2 for T-NGA: Temporal Network Grafting Algorithm for Learning to Process Spiking Audio Sensor Events
Figure 3 for T-NGA: Temporal Network Grafting Algorithm for Learning to Process Spiking Audio Sensor Events
Figure 4 for T-NGA: Temporal Network Grafting Algorithm for Learning to Process Spiking Audio Sensor Events
Viaarxiv icon

Dynamic Layer Normalization for Adaptive Neural Acoustic Modeling in Speech Recognition

Add code
Bookmark button
Alert button
Jul 19, 2017
Taesup Kim, Inchul Song, Yoshua Bengio

Figure 1 for Dynamic Layer Normalization for Adaptive Neural Acoustic Modeling in Speech Recognition
Figure 2 for Dynamic Layer Normalization for Adaptive Neural Acoustic Modeling in Speech Recognition
Figure 3 for Dynamic Layer Normalization for Adaptive Neural Acoustic Modeling in Speech Recognition
Figure 4 for Dynamic Layer Normalization for Adaptive Neural Acoustic Modeling in Speech Recognition
Viaarxiv icon

ViDA-MAN: Visual Dialog with Digital Humans

Oct 26, 2021
Tong Shen, Jiawei Zuo, Fan Shi, Jin Zhang, Liqin Jiang, Meng Chen, Zhengchen Zhang, Wei Zhang, Xiaodong He, Tao Mei

Figure 1 for ViDA-MAN: Visual Dialog with Digital Humans
Figure 2 for ViDA-MAN: Visual Dialog with Digital Humans
Viaarxiv icon

Multitask vocal burst modeling with ResNets and pre-trained paralinguistic Conformers

Jun 24, 2022
Josh Belanich, Krishna Somandepalli, Brian Eoff, Brendan Jou

Figure 1 for Multitask vocal burst modeling with ResNets and pre-trained paralinguistic Conformers
Figure 2 for Multitask vocal burst modeling with ResNets and pre-trained paralinguistic Conformers
Viaarxiv icon

Optimizing expected word error rate via sampling for speech recognition

Jun 08, 2017
Matt Shannon

Figure 1 for Optimizing expected word error rate via sampling for speech recognition
Figure 2 for Optimizing expected word error rate via sampling for speech recognition
Figure 3 for Optimizing expected word error rate via sampling for speech recognition
Figure 4 for Optimizing expected word error rate via sampling for speech recognition
Viaarxiv icon

Late reverberation suppression using U-nets

Oct 05, 2021
Diego León, Felipe Tobar

Figure 1 for Late reverberation suppression using U-nets
Figure 2 for Late reverberation suppression using U-nets
Figure 3 for Late reverberation suppression using U-nets
Figure 4 for Late reverberation suppression using U-nets
Viaarxiv icon

Senone-aware Adversarial Multi-task Training for Unsupervised Child to Adult Speech Adaptation

Add code
Bookmark button
Alert button
Feb 23, 2021
Richeng Duan, Nancy F. Chen

Figure 1 for Senone-aware Adversarial Multi-task Training for Unsupervised Child to Adult Speech Adaptation
Figure 2 for Senone-aware Adversarial Multi-task Training for Unsupervised Child to Adult Speech Adaptation
Figure 3 for Senone-aware Adversarial Multi-task Training for Unsupervised Child to Adult Speech Adaptation
Figure 4 for Senone-aware Adversarial Multi-task Training for Unsupervised Child to Adult Speech Adaptation
Viaarxiv icon

Revisiting the Boundary between ASR and NLU in the Age of Conversational Dialog Systems

Dec 10, 2021
Manaal Faruqui, Dilek Hakkani-Tür

Figure 1 for Revisiting the Boundary between ASR and NLU in the Age of Conversational Dialog Systems
Figure 2 for Revisiting the Boundary between ASR and NLU in the Age of Conversational Dialog Systems
Figure 3 for Revisiting the Boundary between ASR and NLU in the Age of Conversational Dialog Systems
Figure 4 for Revisiting the Boundary between ASR and NLU in the Age of Conversational Dialog Systems
Viaarxiv icon

CLSRIL-23: Cross Lingual Speech Representations for Indic Languages

Add code
Bookmark button
Alert button
Jul 15, 2021
Anirudh Gupta, Harveen Singh Chadha, Priyanshi Shah, Neeraj Chimmwal, Ankur Dhuriya, Rishabh Gaur, Vivek Raghavan

Figure 1 for CLSRIL-23: Cross Lingual Speech Representations for Indic Languages
Figure 2 for CLSRIL-23: Cross Lingual Speech Representations for Indic Languages
Figure 3 for CLSRIL-23: Cross Lingual Speech Representations for Indic Languages
Figure 4 for CLSRIL-23: Cross Lingual Speech Representations for Indic Languages
Viaarxiv icon