Alert button

"speech recognition": models, code, and papers
Alert button

Extending RNN-T-based speech recognition systems with emotion and language classification

Jul 28, 2022
Zvi Kons, Hagai Aronowitz, Edmilson Morais, Matheus Damasceno, Hong-Kwang Kuo, Samuel Thomas, George Saon

Figure 1 for Extending RNN-T-based speech recognition systems with emotion and language classification
Figure 2 for Extending RNN-T-based speech recognition systems with emotion and language classification
Figure 3 for Extending RNN-T-based speech recognition systems with emotion and language classification
Figure 4 for Extending RNN-T-based speech recognition systems with emotion and language classification
Viaarxiv icon

Machine Unlearning: A Survey

Jun 06, 2023
Heng Xu, Tianqing Zhu, Lefeng Zhang, Wanlei Zhou, Philip S. Yu

Figure 1 for Machine Unlearning: A Survey
Figure 2 for Machine Unlearning: A Survey
Figure 3 for Machine Unlearning: A Survey
Figure 4 for Machine Unlearning: A Survey
Viaarxiv icon

FonMTL: Towards Multitask Learning for the Fon Language

Add code
Bookmark button
Alert button
Aug 28, 2023
Bonaventure F. P. Dossou, Iffanice Houndayi, Pamely Zantou, Gilles Hacheme

Figure 1 for FonMTL: Towards Multitask Learning for the Fon Language
Figure 2 for FonMTL: Towards Multitask Learning for the Fon Language
Figure 3 for FonMTL: Towards Multitask Learning for the Fon Language
Figure 4 for FonMTL: Towards Multitask Learning for the Fon Language
Viaarxiv icon

Leveraging Domain Features for Detecting Adversarial Attacks Against Deep Speech Recognition in Noise

Add code
Bookmark button
Alert button
Nov 03, 2022
Christian Heider Nielsen, Zheng-Hua Tan

Figure 1 for Leveraging Domain Features for Detecting Adversarial Attacks Against Deep Speech Recognition in Noise
Figure 2 for Leveraging Domain Features for Detecting Adversarial Attacks Against Deep Speech Recognition in Noise
Figure 3 for Leveraging Domain Features for Detecting Adversarial Attacks Against Deep Speech Recognition in Noise
Figure 4 for Leveraging Domain Features for Detecting Adversarial Attacks Against Deep Speech Recognition in Noise
Viaarxiv icon

pMCT: Patched Multi-Condition Training for Robust Speech Recognition

Add code
Bookmark button
Alert button
Jul 11, 2022
Pablo Peso Parada, Agnieszka Dobrowolska, Karthikeyan Saravanan, Mete Ozay

Figure 1 for pMCT: Patched Multi-Condition Training for Robust Speech Recognition
Figure 2 for pMCT: Patched Multi-Condition Training for Robust Speech Recognition
Figure 3 for pMCT: Patched Multi-Condition Training for Robust Speech Recognition
Figure 4 for pMCT: Patched Multi-Condition Training for Robust Speech Recognition
Viaarxiv icon

Robustness of Multi-Source MT to Transcription Errors

Add code
Bookmark button
Alert button
May 26, 2023
Dominik Macháček, Peter Polák, Ondřej Bojar, Raj Dabre

Figure 1 for Robustness of Multi-Source MT to Transcription Errors
Figure 2 for Robustness of Multi-Source MT to Transcription Errors
Figure 3 for Robustness of Multi-Source MT to Transcription Errors
Figure 4 for Robustness of Multi-Source MT to Transcription Errors
Viaarxiv icon

HMM vs. CTC for Automatic Speech Recognition: Comparison Based on Full-Sum Training from Scratch

Oct 18, 2022
Tina Raissi, Wei Zhou, Simon Berger, Ralf Schlüter, Hermann Ney

Figure 1 for HMM vs. CTC for Automatic Speech Recognition: Comparison Based on Full-Sum Training from Scratch
Figure 2 for HMM vs. CTC for Automatic Speech Recognition: Comparison Based on Full-Sum Training from Scratch
Figure 3 for HMM vs. CTC for Automatic Speech Recognition: Comparison Based on Full-Sum Training from Scratch
Figure 4 for HMM vs. CTC for Automatic Speech Recognition: Comparison Based on Full-Sum Training from Scratch
Viaarxiv icon

The Far Side of Failure: Investigating the Impact of Speech Recognition Errors on Subsequent Dementia Classification

Add code
Bookmark button
Alert button
Nov 11, 2022
Changye Li, Trevor Cohen, Serguei Pakhomov

Figure 1 for The Far Side of Failure: Investigating the Impact of Speech Recognition Errors on Subsequent Dementia Classification
Figure 2 for The Far Side of Failure: Investigating the Impact of Speech Recognition Errors on Subsequent Dementia Classification
Figure 3 for The Far Side of Failure: Investigating the Impact of Speech Recognition Errors on Subsequent Dementia Classification
Figure 4 for The Far Side of Failure: Investigating the Impact of Speech Recognition Errors on Subsequent Dementia Classification
Viaarxiv icon

Applying wav2vec2 for Speech Recognition on Bengali Common Voices Dataset

Add code
Bookmark button
Alert button
Sep 11, 2022
H. A. Z. Sameen Shahgir, Khondker Salman Sayeed, Tanjeem Azwad Zaman

Figure 1 for Applying wav2vec2 for Speech Recognition on Bengali Common Voices Dataset
Figure 2 for Applying wav2vec2 for Speech Recognition on Bengali Common Voices Dataset
Figure 3 for Applying wav2vec2 for Speech Recognition on Bengali Common Voices Dataset
Figure 4 for Applying wav2vec2 for Speech Recognition on Bengali Common Voices Dataset
Viaarxiv icon

Approximate Nearest Neighbour Phrase Mining for Contextual Speech Recognition

Add code
Bookmark button
Alert button
Apr 18, 2023
Maurits Bleeker, Pawel Swietojanski, Stefan Braun, Xiaodan Zhuang

Figure 1 for Approximate Nearest Neighbour Phrase Mining for Contextual Speech Recognition
Figure 2 for Approximate Nearest Neighbour Phrase Mining for Contextual Speech Recognition
Figure 3 for Approximate Nearest Neighbour Phrase Mining for Contextual Speech Recognition
Figure 4 for Approximate Nearest Neighbour Phrase Mining for Contextual Speech Recognition
Viaarxiv icon