Alert button

"speech recognition": models, code, and papers
Alert button

wav2letter++: The Fastest Open-source Speech Recognition System

Add code
Bookmark button
Alert button
Dec 18, 2018
Vineel Pratap, Awni Hannun, Qiantong Xu, Jeff Cai, Jacob Kahn, Gabriel Synnaeve, Vitaliy Liptchinsky, Ronan Collobert

Figure 1 for wav2letter++: The Fastest Open-source Speech Recognition System
Figure 2 for wav2letter++: The Fastest Open-source Speech Recognition System
Figure 3 for wav2letter++: The Fastest Open-source Speech Recognition System
Figure 4 for wav2letter++: The Fastest Open-source Speech Recognition System
Viaarxiv icon

Multiple-hypothesis CTC-based semi-supervised adaptation of end-to-end speech recognition

Mar 31, 2021
Cong-Thanh Do, Rama Doddipatla, Thomas Hain

Figure 1 for Multiple-hypothesis CTC-based semi-supervised adaptation of end-to-end speech recognition
Figure 2 for Multiple-hypothesis CTC-based semi-supervised adaptation of end-to-end speech recognition
Figure 3 for Multiple-hypothesis CTC-based semi-supervised adaptation of end-to-end speech recognition
Figure 4 for Multiple-hypothesis CTC-based semi-supervised adaptation of end-to-end speech recognition
Viaarxiv icon

BembaSpeech: A Speech Recognition Corpus for the Bemba Language

Add code
Bookmark button
Alert button
Feb 09, 2021
Claytone Sikasote, Antonios Anastasopoulos

Figure 1 for BembaSpeech: A Speech Recognition Corpus for the Bemba Language
Figure 2 for BembaSpeech: A Speech Recognition Corpus for the Bemba Language
Figure 3 for BembaSpeech: A Speech Recognition Corpus for the Bemba Language
Figure 4 for BembaSpeech: A Speech Recognition Corpus for the Bemba Language
Viaarxiv icon

FullStop:Punctuation and Segmentation Prediction for Dutch with Transformers

Add code
Bookmark button
Alert button
Jan 09, 2023
Vincent Vandeghinste, Oliver Guhr

Figure 1 for FullStop:Punctuation and Segmentation Prediction for Dutch with Transformers
Figure 2 for FullStop:Punctuation and Segmentation Prediction for Dutch with Transformers
Figure 3 for FullStop:Punctuation and Segmentation Prediction for Dutch with Transformers
Figure 4 for FullStop:Punctuation and Segmentation Prediction for Dutch with Transformers
Viaarxiv icon

Training Autoregressive Speech Recognition Models with Limited in-domain Supervision

Oct 27, 2022
Chak-Fai Li, Francis Keith, William Hartmann, Matthew Snover

Figure 1 for Training Autoregressive Speech Recognition Models with Limited in-domain Supervision
Figure 2 for Training Autoregressive Speech Recognition Models with Limited in-domain Supervision
Figure 3 for Training Autoregressive Speech Recognition Models with Limited in-domain Supervision
Figure 4 for Training Autoregressive Speech Recognition Models with Limited in-domain Supervision
Viaarxiv icon

SynthASR: Unlocking Synthetic Data for Speech Recognition

Jun 14, 2021
Amin Fazel, Wei Yang, Yulan Liu, Roberto Barra-Chicote, Yixiong Meng, Roland Maas, Jasha Droppo

Figure 1 for SynthASR: Unlocking Synthetic Data for Speech Recognition
Figure 2 for SynthASR: Unlocking Synthetic Data for Speech Recognition
Figure 3 for SynthASR: Unlocking Synthetic Data for Speech Recognition
Figure 4 for SynthASR: Unlocking Synthetic Data for Speech Recognition
Viaarxiv icon

GM-TCNet: Gated Multi-scale Temporal Convolutional Network using Emotion Causality for Speech Emotion Recognition

Add code
Bookmark button
Alert button
Oct 28, 2022
Jia-Xin Ye, Xin-Cheng Wen, Xuan-Ze Wang, Yong Xu, Yan Luo, Chang-Li Wu, Li-Yan Chen, Kun-Hong Liu

Figure 1 for GM-TCNet: Gated Multi-scale Temporal Convolutional Network using Emotion Causality for Speech Emotion Recognition
Figure 2 for GM-TCNet: Gated Multi-scale Temporal Convolutional Network using Emotion Causality for Speech Emotion Recognition
Figure 3 for GM-TCNet: Gated Multi-scale Temporal Convolutional Network using Emotion Causality for Speech Emotion Recognition
Figure 4 for GM-TCNet: Gated Multi-scale Temporal Convolutional Network using Emotion Causality for Speech Emotion Recognition
Viaarxiv icon

Adapting End-to-End Speech Recognition for Readable Subtitles

Add code
Bookmark button
Alert button
May 25, 2020
Danni Liu, Jan Niehues, Gerasimos Spanakis

Figure 1 for Adapting End-to-End Speech Recognition for Readable Subtitles
Figure 2 for Adapting End-to-End Speech Recognition for Readable Subtitles
Figure 3 for Adapting End-to-End Speech Recognition for Readable Subtitles
Figure 4 for Adapting End-to-End Speech Recognition for Readable Subtitles
Viaarxiv icon

Adapt-and-Adjust: Overcoming the Long-Tail Problem of Multilingual Speech Recognition

Dec 03, 2020
Genta Indra Winata, Guangsen Wang, Caiming Xiong, Steven Hoi

Figure 1 for Adapt-and-Adjust: Overcoming the Long-Tail Problem of Multilingual Speech Recognition
Figure 2 for Adapt-and-Adjust: Overcoming the Long-Tail Problem of Multilingual Speech Recognition
Figure 3 for Adapt-and-Adjust: Overcoming the Long-Tail Problem of Multilingual Speech Recognition
Figure 4 for Adapt-and-Adjust: Overcoming the Long-Tail Problem of Multilingual Speech Recognition
Viaarxiv icon

Disentangling Prosody Representations with Unsupervised Speech Reconstruction

Add code
Bookmark button
Alert button
Dec 14, 2022
Leyuan Qu, Taihao Li, Cornelius Weber, Theresa Pekarek-Rosin, Fuji Ren, Stefan Wermter

Figure 1 for Disentangling Prosody Representations with Unsupervised Speech Reconstruction
Figure 2 for Disentangling Prosody Representations with Unsupervised Speech Reconstruction
Figure 3 for Disentangling Prosody Representations with Unsupervised Speech Reconstruction
Figure 4 for Disentangling Prosody Representations with Unsupervised Speech Reconstruction
Viaarxiv icon