Alert button

"speech recognition": models, code, and papers
Alert button

Self-Supervised Speech Representation Learning: A Review

Add code
Bookmark button
Alert button
May 21, 2022
Abdelrahman Mohamed, Hung-yi Lee, Lasse Borgholt, Jakob D. Havtorn, Joakim Edin, Christian Igel, Katrin Kirchhoff, Shang-Wen Li, Karen Livescu, Lars Maaløe, Tara N. Sainath, Shinji Watanabe

Figure 1 for Self-Supervised Speech Representation Learning: A Review
Figure 2 for Self-Supervised Speech Representation Learning: A Review
Figure 3 for Self-Supervised Speech Representation Learning: A Review
Figure 4 for Self-Supervised Speech Representation Learning: A Review
Viaarxiv icon

Distilling the Knowledge of BERT for CTC-based ASR

Sep 05, 2022
Hayato Futami, Hirofumi Inaguma, Masato Mimura, Shinsuke Sakai, Tatsuya Kawahara

Figure 1 for Distilling the Knowledge of BERT for CTC-based ASR
Figure 2 for Distilling the Knowledge of BERT for CTC-based ASR
Figure 3 for Distilling the Knowledge of BERT for CTC-based ASR
Figure 4 for Distilling the Knowledge of BERT for CTC-based ASR
Viaarxiv icon

LRS3-TED: a large-scale dataset for visual speech recognition

Oct 28, 2018
Triantafyllos Afouras, Joon Son Chung, Andrew Zisserman

Figure 1 for LRS3-TED: a large-scale dataset for visual speech recognition
Viaarxiv icon

Espresso: A Fast End-to-end Neural Speech Recognition Toolkit

Add code
Bookmark button
Alert button
Sep 18, 2019
Yiming Wang, Tongfei Chen, Hainan Xu, Shuoyang Ding, Hang Lv, Yiwen Shao, Nanyun Peng, Lei Xie, Shinji Watanabe, Sanjeev Khudanpur

Figure 1 for Espresso: A Fast End-to-end Neural Speech Recognition Toolkit
Figure 2 for Espresso: A Fast End-to-end Neural Speech Recognition Toolkit
Figure 3 for Espresso: A Fast End-to-end Neural Speech Recognition Toolkit
Figure 4 for Espresso: A Fast End-to-end Neural Speech Recognition Toolkit
Viaarxiv icon

Arabic Speech Recognition System using CMU-Sphinx4

Apr 17, 2007
H. Satori, M. Harti, N. Chenfour

Figure 1 for Arabic Speech Recognition System using CMU-Sphinx4
Figure 2 for Arabic Speech Recognition System using CMU-Sphinx4
Figure 3 for Arabic Speech Recognition System using CMU-Sphinx4
Figure 4 for Arabic Speech Recognition System using CMU-Sphinx4
Viaarxiv icon

Techniques for Vocabulary Expansion in Hybrid Speech Recognition Systems

Mar 19, 2020
Nikolay Malkovsky, Vladimir Bataev, Dmitrii Sviridkin, Natalia Kizhaeva, Aleksandr Laptev, Ildar Valiev, Oleg Petrov

Figure 1 for Techniques for Vocabulary Expansion in Hybrid Speech Recognition Systems
Figure 2 for Techniques for Vocabulary Expansion in Hybrid Speech Recognition Systems
Viaarxiv icon

Endpoint Detection for Streaming End-to-End Multi-talker ASR

Add code
Bookmark button
Alert button
Jan 24, 2022
Liang Lu, Jinyu Li, Yifan Gong

Figure 1 for Endpoint Detection for Streaming End-to-End Multi-talker ASR
Figure 2 for Endpoint Detection for Streaming End-to-End Multi-talker ASR
Figure 3 for Endpoint Detection for Streaming End-to-End Multi-talker ASR
Figure 4 for Endpoint Detection for Streaming End-to-End Multi-talker ASR
Viaarxiv icon

Snow Mountain: Dataset of Audio Recordings of The Bible in Low Resource Languages

Add code
Bookmark button
Alert button
Jun 01, 2022
Kavitha Raju, Anjaly V, Ryan Lish, Joel Mathew

Figure 1 for Snow Mountain: Dataset of Audio Recordings of The Bible in Low Resource Languages
Figure 2 for Snow Mountain: Dataset of Audio Recordings of The Bible in Low Resource Languages
Figure 3 for Snow Mountain: Dataset of Audio Recordings of The Bible in Low Resource Languages
Figure 4 for Snow Mountain: Dataset of Audio Recordings of The Bible in Low Resource Languages
Viaarxiv icon

Parallel Rescoring with Transformer for Streaming On-Device Speech Recognition

Sep 02, 2020
Wei Li, James Qin, Chung-Cheng Chiu, Ruoming Pang, Yanzhang He

Figure 1 for Parallel Rescoring with Transformer for Streaming On-Device Speech Recognition
Figure 2 for Parallel Rescoring with Transformer for Streaming On-Device Speech Recognition
Figure 3 for Parallel Rescoring with Transformer for Streaming On-Device Speech Recognition
Figure 4 for Parallel Rescoring with Transformer for Streaming On-Device Speech Recognition
Viaarxiv icon

A Universally-Deployable ASR Frontend for Joint Acoustic Echo Cancellation, Speech Enhancement, and Voice Separation

Sep 14, 2022
Tom O'Malley, Arun Narayanan, Quan Wang

Figure 1 for A Universally-Deployable ASR Frontend for Joint Acoustic Echo Cancellation, Speech Enhancement, and Voice Separation
Figure 2 for A Universally-Deployable ASR Frontend for Joint Acoustic Echo Cancellation, Speech Enhancement, and Voice Separation
Figure 3 for A Universally-Deployable ASR Frontend for Joint Acoustic Echo Cancellation, Speech Enhancement, and Voice Separation
Figure 4 for A Universally-Deployable ASR Frontend for Joint Acoustic Echo Cancellation, Speech Enhancement, and Voice Separation
Viaarxiv icon