Alert button

"speech recognition": models, code, and papers
Alert button

Learning linearly separable features for speech recognition using convolutional neural networks

Apr 16, 2015
Dimitri Palaz, Mathew Magimai Doss, Ronan Collobert

Figure 1 for Learning linearly separable features for speech recognition using convolutional neural networks
Figure 2 for Learning linearly separable features for speech recognition using convolutional neural networks
Figure 3 for Learning linearly separable features for speech recognition using convolutional neural networks
Figure 4 for Learning linearly separable features for speech recognition using convolutional neural networks
Viaarxiv icon

Leveraging Acoustic Contextual Representation by Audio-textual Cross-modal Learning for Conversational ASR

Jul 03, 2022
Kun Wei, Yike Zhang, Sining Sun, Lei Xie, Long Ma

Figure 1 for Leveraging Acoustic Contextual Representation by Audio-textual Cross-modal Learning for Conversational ASR
Figure 2 for Leveraging Acoustic Contextual Representation by Audio-textual Cross-modal Learning for Conversational ASR
Figure 3 for Leveraging Acoustic Contextual Representation by Audio-textual Cross-modal Learning for Conversational ASR
Figure 4 for Leveraging Acoustic Contextual Representation by Audio-textual Cross-modal Learning for Conversational ASR
Viaarxiv icon

"How Robust r u?": Evaluating Task-Oriented Dialogue Systems on Spoken Conversations

Add code
Bookmark button
Alert button
Sep 28, 2021
Seokhwan Kim, Yang Liu, Di Jin, Alexandros Papangelis, Karthik Gopalakrishnan, Behnam Hedayatnia, Dilek Hakkani-Tur

Figure 1 for "How Robust r u?": Evaluating Task-Oriented Dialogue Systems on Spoken Conversations
Figure 2 for "How Robust r u?": Evaluating Task-Oriented Dialogue Systems on Spoken Conversations
Figure 3 for "How Robust r u?": Evaluating Task-Oriented Dialogue Systems on Spoken Conversations
Figure 4 for "How Robust r u?": Evaluating Task-Oriented Dialogue Systems on Spoken Conversations
Viaarxiv icon

Variable-rate hierarchical CPC leads to acoustic unit discovery in speech

Add code
Bookmark button
Alert button
Jun 07, 2022
Santiago Cuervo, Adrian Łańcucki, Ricard Marxer, Paweł Rychlikowski, Jan Chorowski

Figure 1 for Variable-rate hierarchical CPC leads to acoustic unit discovery in speech
Figure 2 for Variable-rate hierarchical CPC leads to acoustic unit discovery in speech
Figure 3 for Variable-rate hierarchical CPC leads to acoustic unit discovery in speech
Figure 4 for Variable-rate hierarchical CPC leads to acoustic unit discovery in speech
Viaarxiv icon

Distilling Knowledge Using Parallel Data for Far-field Speech Recognition

Feb 20, 2018
Jiangyan Yi, Jianhua Tao, Zhengqi Wen, Bin Liu

Figure 1 for Distilling Knowledge Using Parallel Data for Far-field Speech Recognition
Figure 2 for Distilling Knowledge Using Parallel Data for Far-field Speech Recognition
Figure 3 for Distilling Knowledge Using Parallel Data for Far-field Speech Recognition
Viaarxiv icon

Analyzing Phonetic and Graphemic Representations in End-to-End Automatic Speech Recognition

Add code
Bookmark button
Alert button
Jul 09, 2019
Yonatan Belinkov, Ahmed Ali, James Glass

Figure 1 for Analyzing Phonetic and Graphemic Representations in End-to-End Automatic Speech Recognition
Figure 2 for Analyzing Phonetic and Graphemic Representations in End-to-End Automatic Speech Recognition
Figure 3 for Analyzing Phonetic and Graphemic Representations in End-to-End Automatic Speech Recognition
Figure 4 for Analyzing Phonetic and Graphemic Representations in End-to-End Automatic Speech Recognition
Viaarxiv icon

Transfer Learning for Robust Low-Resource Children's Speech ASR with Transformers and Source-Filter Warping

Jun 19, 2022
Jenthe Thienpondt, Kris Demuynck

Figure 1 for Transfer Learning for Robust Low-Resource Children's Speech ASR with Transformers and Source-Filter Warping
Figure 2 for Transfer Learning for Robust Low-Resource Children's Speech ASR with Transformers and Source-Filter Warping
Figure 3 for Transfer Learning for Robust Low-Resource Children's Speech ASR with Transformers and Source-Filter Warping
Figure 4 for Transfer Learning for Robust Low-Resource Children's Speech ASR with Transformers and Source-Filter Warping
Viaarxiv icon

On the Impact of Word Error Rate on Acoustic-Linguistic Speech Emotion Recognition: An Update for the Deep Learning Era

Add code
Bookmark button
Alert button
Apr 20, 2021
Shahin Amiriparian, Artem Sokolov, Ilhan Aslan, Lukas Christ, Maurice Gerczuk, Tobias Hübner, Dmitry Lamanov, Manuel Milling, Sandra Ottl, Ilya Poduremennykh, Evgeniy Shuranov, Björn W. Schuller

Figure 1 for On the Impact of Word Error Rate on Acoustic-Linguistic Speech Emotion Recognition: An Update for the Deep Learning Era
Figure 2 for On the Impact of Word Error Rate on Acoustic-Linguistic Speech Emotion Recognition: An Update for the Deep Learning Era
Figure 3 for On the Impact of Word Error Rate on Acoustic-Linguistic Speech Emotion Recognition: An Update for the Deep Learning Era
Figure 4 for On the Impact of Word Error Rate on Acoustic-Linguistic Speech Emotion Recognition: An Update for the Deep Learning Era
Viaarxiv icon

Decoupled Federated Learning for ASR with Non-IID Data

Jun 18, 2022
Han Zhu, Jindong Wang, Gaofeng Cheng, Pengyuan Zhang, Yonghong Yan

Figure 1 for Decoupled Federated Learning for ASR with Non-IID Data
Figure 2 for Decoupled Federated Learning for ASR with Non-IID Data
Figure 3 for Decoupled Federated Learning for ASR with Non-IID Data
Figure 4 for Decoupled Federated Learning for ASR with Non-IID Data
Viaarxiv icon

3M: Multi-loss, Multi-path and Multi-level Neural Networks for speech recognition

Add code
Bookmark button
Alert button
Apr 07, 2022
Zhao You, Shulin Feng, Dan Su, Dong Yu

Figure 1 for 3M: Multi-loss, Multi-path and Multi-level Neural Networks for speech recognition
Figure 2 for 3M: Multi-loss, Multi-path and Multi-level Neural Networks for speech recognition
Figure 3 for 3M: Multi-loss, Multi-path and Multi-level Neural Networks for speech recognition
Figure 4 for 3M: Multi-loss, Multi-path and Multi-level Neural Networks for speech recognition
Viaarxiv icon