Alert button

"speech recognition": models, code, and papers
Alert button

ImportantAug: a data augmentation agent for speech

Add code
Bookmark button
Alert button
Dec 14, 2021
Viet Anh Trinh, Hassan Salami Kavaki, Michael I Mandel

Figure 1 for ImportantAug: a data augmentation agent for speech
Figure 2 for ImportantAug: a data augmentation agent for speech
Figure 3 for ImportantAug: a data augmentation agent for speech
Figure 4 for ImportantAug: a data augmentation agent for speech
Viaarxiv icon

Computer-Generated Music for Tabletop Role-Playing Games

Add code
Bookmark button
Alert button
Aug 16, 2020
Lucas N. Ferreira, Levi H. S. Lelis, Jim Whitehead

Viaarxiv icon

Towards the evaluation of simultaneous speech translation from a communicative perspective

Mar 15, 2021
claudio Fantinuoli, Bianca Prandi

Figure 1 for Towards the evaluation of simultaneous speech translation from a communicative perspective
Figure 2 for Towards the evaluation of simultaneous speech translation from a communicative perspective
Figure 3 for Towards the evaluation of simultaneous speech translation from a communicative perspective
Viaarxiv icon

Quantization and Deployment of Deep Neural Networks on Microcontrollers

May 27, 2021
Pierre-Emmanuel Novac, Ghouthi Boukli Hacene, Alain Pegatoquet, Benoît Miramond, Vincent Gripon

Figure 1 for Quantization and Deployment of Deep Neural Networks on Microcontrollers
Figure 2 for Quantization and Deployment of Deep Neural Networks on Microcontrollers
Figure 3 for Quantization and Deployment of Deep Neural Networks on Microcontrollers
Figure 4 for Quantization and Deployment of Deep Neural Networks on Microcontrollers
Viaarxiv icon

A Distributed Optimisation Framework Combining Natural Gradient with Hessian-Free for Discriminative Sequence Training

Mar 12, 2021
Adnan Haider, Chao Zhang, Florian L. Kreyssig, Philip C. Woodland

Figure 1 for A Distributed Optimisation Framework Combining Natural Gradient with Hessian-Free for Discriminative Sequence Training
Figure 2 for A Distributed Optimisation Framework Combining Natural Gradient with Hessian-Free for Discriminative Sequence Training
Figure 3 for A Distributed Optimisation Framework Combining Natural Gradient with Hessian-Free for Discriminative Sequence Training
Figure 4 for A Distributed Optimisation Framework Combining Natural Gradient with Hessian-Free for Discriminative Sequence Training
Viaarxiv icon

Focus on the present: a regularization method for the ASR source-target attention layer

Nov 02, 2020
Nanxin Chen, Piotr Żelasko, Jesús Villalba, Najim Dehak

Figure 1 for Focus on the present: a regularization method for the ASR source-target attention layer
Figure 2 for Focus on the present: a regularization method for the ASR source-target attention layer
Figure 3 for Focus on the present: a regularization method for the ASR source-target attention layer
Figure 4 for Focus on the present: a regularization method for the ASR source-target attention layer
Viaarxiv icon

Training Speech Enhancement Systems with Noisy Speech Datasets

May 26, 2021
Koichi Saito, Stefan Uhlich, Giorgio Fabbro, Yuki Mitsufuji

Figure 1 for Training Speech Enhancement Systems with Noisy Speech Datasets
Figure 2 for Training Speech Enhancement Systems with Noisy Speech Datasets
Figure 3 for Training Speech Enhancement Systems with Noisy Speech Datasets
Figure 4 for Training Speech Enhancement Systems with Noisy Speech Datasets
Viaarxiv icon

Non-Attentive Tacotron: Robust and Controllable Neural TTS Synthesis Including Unsupervised Duration Modeling

Add code
Bookmark button
Alert button
Oct 08, 2020
Jonathan Shen, Ye Jia, Mike Chrzanowski, Yu Zhang, Isaac Elias, Heiga Zen, Yonghui Wu

Figure 1 for Non-Attentive Tacotron: Robust and Controllable Neural TTS Synthesis Including Unsupervised Duration Modeling
Figure 2 for Non-Attentive Tacotron: Robust and Controllable Neural TTS Synthesis Including Unsupervised Duration Modeling
Figure 3 for Non-Attentive Tacotron: Robust and Controllable Neural TTS Synthesis Including Unsupervised Duration Modeling
Figure 4 for Non-Attentive Tacotron: Robust and Controllable Neural TTS Synthesis Including Unsupervised Duration Modeling
Viaarxiv icon

Voice based self help System: User Experience Vs Accuracy

Apr 07, 2015
Sunil Kumar Kopparapu

Figure 1 for Voice based self help System: User Experience Vs Accuracy
Viaarxiv icon

DEVI: Open-source Human-Robot Interface for Interactive Receptionist Systems

Jan 02, 2021
Ramesha Karunasena, Piumi Sandarenu, Madushi Pinto, Achala Athukorala, Ranga Rodrigo, Peshala Jayasekara

Figure 1 for DEVI: Open-source Human-Robot Interface for Interactive Receptionist Systems
Figure 2 for DEVI: Open-source Human-Robot Interface for Interactive Receptionist Systems
Figure 3 for DEVI: Open-source Human-Robot Interface for Interactive Receptionist Systems
Figure 4 for DEVI: Open-source Human-Robot Interface for Interactive Receptionist Systems
Viaarxiv icon