Alert button

"speech recognition": models, code, and papers
Alert button

Bilingual Streaming ASR with Grapheme units and Auxiliary Monolingual Loss

Aug 11, 2023
Mohammad Soleymanpour, Mahmoud Al Ismail, Fahimeh Bahmaninezhad, Kshitiz Kumar, Jian Wu

Figure 1 for Bilingual Streaming ASR with Grapheme units and Auxiliary Monolingual Loss
Figure 2 for Bilingual Streaming ASR with Grapheme units and Auxiliary Monolingual Loss
Figure 3 for Bilingual Streaming ASR with Grapheme units and Auxiliary Monolingual Loss
Figure 4 for Bilingual Streaming ASR with Grapheme units and Auxiliary Monolingual Loss
Viaarxiv icon

Towards spoken dialect identification of Irish

Add code
Bookmark button
Alert button
Jul 14, 2023
Liam Lonergan, Mengjie Qian, Neasa Ní Chiaráin, Christer Gobl, Ailbhe Ní Chasaide

Figure 1 for Towards spoken dialect identification of Irish
Figure 2 for Towards spoken dialect identification of Irish
Figure 3 for Towards spoken dialect identification of Irish
Figure 4 for Towards spoken dialect identification of Irish
Viaarxiv icon

Turning Whisper into Real-Time Transcription System

Add code
Bookmark button
Alert button
Jul 27, 2023
Dominik Macháček, Raj Dabre, Ondřej Bojar

Figure 1 for Turning Whisper into Real-Time Transcription System
Figure 2 for Turning Whisper into Real-Time Transcription System
Figure 3 for Turning Whisper into Real-Time Transcription System
Figure 4 for Turning Whisper into Real-Time Transcription System
Viaarxiv icon

ESB: A Benchmark For Multi-Domain End-to-End Speech Recognition

Add code
Bookmark button
Alert button
Oct 24, 2022
Sanchit Gandhi, Patrick von Platen, Alexander M. Rush

Figure 1 for ESB: A Benchmark For Multi-Domain End-to-End Speech Recognition
Figure 2 for ESB: A Benchmark For Multi-Domain End-to-End Speech Recognition
Figure 3 for ESB: A Benchmark For Multi-Domain End-to-End Speech Recognition
Figure 4 for ESB: A Benchmark For Multi-Domain End-to-End Speech Recognition
Viaarxiv icon

Comparative Analysis of the wav2vec 2.0 Feature Extractor

Add code
Bookmark button
Alert button
Aug 08, 2023
Peter Vieting, Ralf Schlüter, Hermann Ney

Figure 1 for Comparative Analysis of the wav2vec 2.0 Feature Extractor
Figure 2 for Comparative Analysis of the wav2vec 2.0 Feature Extractor
Figure 3 for Comparative Analysis of the wav2vec 2.0 Feature Extractor
Figure 4 for Comparative Analysis of the wav2vec 2.0 Feature Extractor
Viaarxiv icon

On Monotonic Aggregation for Open-domain QA

Add code
Bookmark button
Alert button
Aug 08, 2023
Sang-eun Han, Yeonseok Jeong, Seung-won Hwang, Kyungjae Lee

Figure 1 for On Monotonic Aggregation for Open-domain QA
Figure 2 for On Monotonic Aggregation for Open-domain QA
Figure 3 for On Monotonic Aggregation for Open-domain QA
Figure 4 for On Monotonic Aggregation for Open-domain QA
Viaarxiv icon

Improving grapheme-to-phoneme conversion by learning pronunciations from speech recordings

Jul 31, 2023
Manuel Sam Ribeiro, Giulia Comini, Jaime Lorenzo-Trueba

Figure 1 for Improving grapheme-to-phoneme conversion by learning pronunciations from speech recordings
Figure 2 for Improving grapheme-to-phoneme conversion by learning pronunciations from speech recordings
Figure 3 for Improving grapheme-to-phoneme conversion by learning pronunciations from speech recordings
Figure 4 for Improving grapheme-to-phoneme conversion by learning pronunciations from speech recordings
Viaarxiv icon

OxfordVGG Submission to the EGO4D AV Transcription Challenge

Add code
Bookmark button
Alert button
Jul 18, 2023
Jaesung Huh, Max Bain, Andrew Zisserman

Figure 1 for OxfordVGG Submission to the EGO4D AV Transcription Challenge
Figure 2 for OxfordVGG Submission to the EGO4D AV Transcription Challenge
Viaarxiv icon

Latent Phrase Matching for Dysarthric Speech

Jun 08, 2023
Colin Lea, Dianna Yee, Jaya Narain, Zifang Huang, Lauren Tooley, Jeffrey P. Bigham, Leah Findlater

Figure 1 for Latent Phrase Matching for Dysarthric Speech
Figure 2 for Latent Phrase Matching for Dysarthric Speech
Figure 3 for Latent Phrase Matching for Dysarthric Speech
Figure 4 for Latent Phrase Matching for Dysarthric Speech
Viaarxiv icon

Accented Speech Recognition under the Indian context

Sep 11, 2022
Ankit Grover

Figure 1 for Accented Speech Recognition under the Indian context
Figure 2 for Accented Speech Recognition under the Indian context
Figure 3 for Accented Speech Recognition under the Indian context
Figure 4 for Accented Speech Recognition under the Indian context
Viaarxiv icon