Alert button

"speech recognition": models, code, and papers
Alert button

End-to-End Speech Emotion Recognition: Challenges of Real-Life Emergency Call Centers Data Recordings

Oct 28, 2021
Théo Deschamps-Berger, Lori Lamel, Laurence Devillers

Figure 1 for End-to-End Speech Emotion Recognition: Challenges of Real-Life Emergency Call Centers Data Recordings
Figure 2 for End-to-End Speech Emotion Recognition: Challenges of Real-Life Emergency Call Centers Data Recordings
Figure 3 for End-to-End Speech Emotion Recognition: Challenges of Real-Life Emergency Call Centers Data Recordings
Figure 4 for End-to-End Speech Emotion Recognition: Challenges of Real-Life Emergency Call Centers Data Recordings
Viaarxiv icon

Long Short-Term Memory based Convolutional Recurrent Neural Networks for Large Vocabulary Speech Recognition

Oct 11, 2016
Xiangang Li, Xihong Wu

Figure 1 for Long Short-Term Memory based Convolutional Recurrent Neural Networks for Large Vocabulary Speech Recognition
Figure 2 for Long Short-Term Memory based Convolutional Recurrent Neural Networks for Large Vocabulary Speech Recognition
Figure 3 for Long Short-Term Memory based Convolutional Recurrent Neural Networks for Large Vocabulary Speech Recognition
Figure 4 for Long Short-Term Memory based Convolutional Recurrent Neural Networks for Large Vocabulary Speech Recognition
Viaarxiv icon

Pretraining Approaches for Spoken Language Recognition: TalTech Submission to the OLR 2021 Challenge

Add code
Bookmark button
Alert button
May 14, 2022
Tanel Alumäe, Kunnar Kukk

Figure 1 for Pretraining Approaches for Spoken Language Recognition: TalTech Submission to the OLR 2021 Challenge
Figure 2 for Pretraining Approaches for Spoken Language Recognition: TalTech Submission to the OLR 2021 Challenge
Figure 3 for Pretraining Approaches for Spoken Language Recognition: TalTech Submission to the OLR 2021 Challenge
Figure 4 for Pretraining Approaches for Spoken Language Recognition: TalTech Submission to the OLR 2021 Challenge
Viaarxiv icon

Speaker adaptation for Wav2vec2 based dysarthric ASR

Add code
Bookmark button
Alert button
Apr 02, 2022
Murali Karthick Baskar, Tim Herzig, Diana Nguyen, Mireia Diez, Tim Polzehl, Lukáš Burget, Jan "Honza'' Černocký

Figure 1 for Speaker adaptation for Wav2vec2 based dysarthric ASR
Figure 2 for Speaker adaptation for Wav2vec2 based dysarthric ASR
Figure 3 for Speaker adaptation for Wav2vec2 based dysarthric ASR
Figure 4 for Speaker adaptation for Wav2vec2 based dysarthric ASR
Viaarxiv icon

Building an ASR Error Robust Spoken Virtual Patient System in a Highly Class-Imbalanced Scenario Without Speech Data

Add code
Bookmark button
Alert button
Apr 11, 2022
Vishal Sunder, Prashant Serai, Eric Fosler-Lussier

Figure 1 for Building an ASR Error Robust Spoken Virtual Patient System in a Highly Class-Imbalanced Scenario Without Speech Data
Figure 2 for Building an ASR Error Robust Spoken Virtual Patient System in a Highly Class-Imbalanced Scenario Without Speech Data
Figure 3 for Building an ASR Error Robust Spoken Virtual Patient System in a Highly Class-Imbalanced Scenario Without Speech Data
Figure 4 for Building an ASR Error Robust Spoken Virtual Patient System in a Highly Class-Imbalanced Scenario Without Speech Data
Viaarxiv icon

Revisiting End-to-End Speech-to-Text Translation From Scratch

Add code
Bookmark button
Alert button
Jun 09, 2022
Biao Zhang, Barry Haddow, Rico Sennrich

Figure 1 for Revisiting End-to-End Speech-to-Text Translation From Scratch
Figure 2 for Revisiting End-to-End Speech-to-Text Translation From Scratch
Figure 3 for Revisiting End-to-End Speech-to-Text Translation From Scratch
Figure 4 for Revisiting End-to-End Speech-to-Text Translation From Scratch
Viaarxiv icon

Improving Language Identification of Accented Speech

Apr 01, 2022
Kunnar Kukk, Tanel Alumäe

Figure 1 for Improving Language Identification of Accented Speech
Figure 2 for Improving Language Identification of Accented Speech
Figure 3 for Improving Language Identification of Accented Speech
Figure 4 for Improving Language Identification of Accented Speech
Viaarxiv icon

Unsupervised Speech Recognition via Segmental Empirical Output Distribution Matching

Dec 23, 2018
Chih-Kuan Yeh, Jianshu Chen, Chengzhu Yu, Dong Yu

Figure 1 for Unsupervised Speech Recognition via Segmental Empirical Output Distribution Matching
Figure 2 for Unsupervised Speech Recognition via Segmental Empirical Output Distribution Matching
Figure 3 for Unsupervised Speech Recognition via Segmental Empirical Output Distribution Matching
Figure 4 for Unsupervised Speech Recognition via Segmental Empirical Output Distribution Matching
Viaarxiv icon

Fast and Accurate Capitalization and Punctuation for Automatic Speech Recognition Using Transformer and Chunk Merging

Aug 07, 2019
Binh Nguyen, Vu Bao Hung Nguyen, Hien Nguyen, Pham Ngoc Phuong, The-Loc Nguyen, Quoc Truong Do, Luong Chi Mai

Figure 1 for Fast and Accurate Capitalization and Punctuation for Automatic Speech Recognition Using Transformer and Chunk Merging
Figure 2 for Fast and Accurate Capitalization and Punctuation for Automatic Speech Recognition Using Transformer and Chunk Merging
Figure 3 for Fast and Accurate Capitalization and Punctuation for Automatic Speech Recognition Using Transformer and Chunk Merging
Figure 4 for Fast and Accurate Capitalization and Punctuation for Automatic Speech Recognition Using Transformer and Chunk Merging
Viaarxiv icon

Sequence-Level Knowledge Distillation for Model Compression of Attention-based Sequence-to-Sequence Speech Recognition

Nov 12, 2018
Raden Mu'az Mun'im, Nakamasa Inoue, Koichi Shinoda

Figure 1 for Sequence-Level Knowledge Distillation for Model Compression of Attention-based Sequence-to-Sequence Speech Recognition
Figure 2 for Sequence-Level Knowledge Distillation for Model Compression of Attention-based Sequence-to-Sequence Speech Recognition
Figure 3 for Sequence-Level Knowledge Distillation for Model Compression of Attention-based Sequence-to-Sequence Speech Recognition
Figure 4 for Sequence-Level Knowledge Distillation for Model Compression of Attention-based Sequence-to-Sequence Speech Recognition
Viaarxiv icon