Alert button

"speech recognition": models, code, and papers
Alert button

RyanSpeech: A Corpus for Conversational Text-to-Speech Synthesis

Add code
Bookmark button
Alert button
Jun 15, 2021
Rohola Zandie, Mohammad H. Mahoor, Julia Madsen, Eshrat S. Emamian

Figure 1 for RyanSpeech: A Corpus for Conversational Text-to-Speech Synthesis
Figure 2 for RyanSpeech: A Corpus for Conversational Text-to-Speech Synthesis
Figure 3 for RyanSpeech: A Corpus for Conversational Text-to-Speech Synthesis
Figure 4 for RyanSpeech: A Corpus for Conversational Text-to-Speech Synthesis
Viaarxiv icon

Radically Old Way of Computing Spectra: Applications in End-to-End ASR

Add code
Bookmark button
Alert button
Apr 02, 2021
Samik Sadhu, Hynek Hermansky

Figure 1 for Radically Old Way of Computing Spectra: Applications in End-to-End ASR
Figure 2 for Radically Old Way of Computing Spectra: Applications in End-to-End ASR
Figure 3 for Radically Old Way of Computing Spectra: Applications in End-to-End ASR
Figure 4 for Radically Old Way of Computing Spectra: Applications in End-to-End ASR
Viaarxiv icon

Tiny Transducer: A Highly-efficient Speech Recognition Model on Edge Devices

Feb 07, 2021
Yuekai Zhang, Sining Sun, Long Ma

Figure 1 for Tiny Transducer: A Highly-efficient Speech Recognition Model on Edge Devices
Figure 2 for Tiny Transducer: A Highly-efficient Speech Recognition Model on Edge Devices
Figure 3 for Tiny Transducer: A Highly-efficient Speech Recognition Model on Edge Devices
Figure 4 for Tiny Transducer: A Highly-efficient Speech Recognition Model on Edge Devices
Viaarxiv icon

Keyword Transformer: A Self-Attention Model for Keyword Spotting

Add code
Bookmark button
Alert button
Apr 01, 2021
Axel Berg, Mark O'Connor, Miguel Tairum Cruz

Figure 1 for Keyword Transformer: A Self-Attention Model for Keyword Spotting
Figure 2 for Keyword Transformer: A Self-Attention Model for Keyword Spotting
Figure 3 for Keyword Transformer: A Self-Attention Model for Keyword Spotting
Figure 4 for Keyword Transformer: A Self-Attention Model for Keyword Spotting
Viaarxiv icon

Low Resource German ASR with Untranscribed Data Spoken by Non-native Children -- INTERSPEECH 2021 Shared Task SPAPL System

Jun 18, 2021
Jinhan Wang, Yunzheng Zhu, Ruchao Fan, Wei Chu, Abeer Alwan

Figure 1 for Low Resource German ASR with Untranscribed Data Spoken by Non-native Children -- INTERSPEECH 2021 Shared Task SPAPL System
Figure 2 for Low Resource German ASR with Untranscribed Data Spoken by Non-native Children -- INTERSPEECH 2021 Shared Task SPAPL System
Figure 3 for Low Resource German ASR with Untranscribed Data Spoken by Non-native Children -- INTERSPEECH 2021 Shared Task SPAPL System
Viaarxiv icon

Dynamic Gradient Aggregation for Federated Domain Adaptation

Jun 14, 2021
Dimitrios Dimitriadis, Kenichi Kumatani, Robert Gmyr, Yashesh Gaur, Sefik Emre Eskimez

Figure 1 for Dynamic Gradient Aggregation for Federated Domain Adaptation
Figure 2 for Dynamic Gradient Aggregation for Federated Domain Adaptation
Figure 3 for Dynamic Gradient Aggregation for Federated Domain Adaptation
Figure 4 for Dynamic Gradient Aggregation for Federated Domain Adaptation
Viaarxiv icon

Universal ASR: Unify and Improve Streaming ASR with Full-context Modeling

Oct 12, 2020
Jiahui Yu, Wei Han, Anmol Gulati, Chung-Cheng Chiu, Bo Li, Tara N. Sainath, Yonghui Wu, Ruoming Pang

Figure 1 for Universal ASR: Unify and Improve Streaming ASR with Full-context Modeling
Figure 2 for Universal ASR: Unify and Improve Streaming ASR with Full-context Modeling
Figure 3 for Universal ASR: Unify and Improve Streaming ASR with Full-context Modeling
Figure 4 for Universal ASR: Unify and Improve Streaming ASR with Full-context Modeling
Viaarxiv icon

The IWSLT 2021 BUT Speech Translation Systems

Jul 13, 2021
Hari Krishna Vydana, Martin Karafi'at, Luk'as Burget, "Honza" Cernock'y

Figure 1 for The IWSLT 2021 BUT Speech Translation Systems
Figure 2 for The IWSLT 2021 BUT Speech Translation Systems
Figure 3 for The IWSLT 2021 BUT Speech Translation Systems
Figure 4 for The IWSLT 2021 BUT Speech Translation Systems
Viaarxiv icon

Bangla Natural Language Processing: A Comprehensive Review of Classical, Machine Learning, and Deep Learning Based Methods

May 31, 2021
Ovishake Sen, Mohtasim Fuad, MD. Nazrul Islam, Jakaria Rabbi, MD. Kamrul Hasan, Awal Ahmed Fime, Md. Tahmid Hasan Fuad, Delowar Sikder, MD. Akil Raihan Iftee

Figure 1 for Bangla Natural Language Processing: A Comprehensive Review of Classical, Machine Learning, and Deep Learning Based Methods
Figure 2 for Bangla Natural Language Processing: A Comprehensive Review of Classical, Machine Learning, and Deep Learning Based Methods
Figure 3 for Bangla Natural Language Processing: A Comprehensive Review of Classical, Machine Learning, and Deep Learning Based Methods
Figure 4 for Bangla Natural Language Processing: A Comprehensive Review of Classical, Machine Learning, and Deep Learning Based Methods
Viaarxiv icon

MediaSpeech: Multilanguage ASR Benchmark and Dataset

Add code
Bookmark button
Alert button
Mar 30, 2021
Rostislav Kolobov, Olga Okhapkina, Olga Omelchishina, Andrey Platunov, Roman Bedyakin, Vyacheslav Moshkin, Dmitry Menshikov, Nikolay Mikhaylovskiy

Figure 1 for MediaSpeech: Multilanguage ASR Benchmark and Dataset
Figure 2 for MediaSpeech: Multilanguage ASR Benchmark and Dataset
Figure 3 for MediaSpeech: Multilanguage ASR Benchmark and Dataset
Figure 4 for MediaSpeech: Multilanguage ASR Benchmark and Dataset
Viaarxiv icon