Alert button

"speech": models, code, and papers
Alert button

Adaptive Natural Language Generation for Task-oriented Dialogue via Reinforcement Learning

Add code
Bookmark button
Alert button
Sep 16, 2022
Atsumoto Ohashi, Ryuichiro Higashinaka

Figure 1 for Adaptive Natural Language Generation for Task-oriented Dialogue via Reinforcement Learning
Figure 2 for Adaptive Natural Language Generation for Task-oriented Dialogue via Reinforcement Learning
Figure 3 for Adaptive Natural Language Generation for Task-oriented Dialogue via Reinforcement Learning
Figure 4 for Adaptive Natural Language Generation for Task-oriented Dialogue via Reinforcement Learning
Viaarxiv icon

Improving Distinction between ASR Errors and Speech Disfluencies with Feature Space Interpolation

Aug 04, 2021
Seongmin Park, Dongchan Shin, Sangyoun Paik, Subong Choi, Alena Kazakova, Jihwa Lee

Figure 1 for Improving Distinction between ASR Errors and Speech Disfluencies with Feature Space Interpolation
Figure 2 for Improving Distinction between ASR Errors and Speech Disfluencies with Feature Space Interpolation
Figure 3 for Improving Distinction between ASR Errors and Speech Disfluencies with Feature Space Interpolation
Figure 4 for Improving Distinction between ASR Errors and Speech Disfluencies with Feature Space Interpolation
Viaarxiv icon

Digital Einstein Experience: Fast Text-to-Speech for Conversational AI

Jul 21, 2021
Joanna Rownicka, Kilian Sprenkamp, Antonio Tripiana, Volodymyr Gromoglasov, Timo P Kunz

Figure 1 for Digital Einstein Experience: Fast Text-to-Speech for Conversational AI
Figure 2 for Digital Einstein Experience: Fast Text-to-Speech for Conversational AI
Viaarxiv icon

Towards the evaluation of simultaneous speech translation from a communicative perspective

Mar 15, 2021
claudio Fantinuoli, Bianca Prandi

Figure 1 for Towards the evaluation of simultaneous speech translation from a communicative perspective
Figure 2 for Towards the evaluation of simultaneous speech translation from a communicative perspective
Figure 3 for Towards the evaluation of simultaneous speech translation from a communicative perspective
Viaarxiv icon

Deep Multi-Frame MVDR Filtering for Binaural Noise Reduction

Add code
Bookmark button
Alert button
May 18, 2022
Marvin Tammen, Simon Doclo

Figure 1 for Deep Multi-Frame MVDR Filtering for Binaural Noise Reduction
Figure 2 for Deep Multi-Frame MVDR Filtering for Binaural Noise Reduction
Viaarxiv icon

Effectiveness of Mining Audio and Text Pairs from Public Data for Improving ASR Systems for Low-Resource Languages

Add code
Bookmark button
Alert button
Aug 26, 2022
Kaushal Santosh Bhogale, Abhigyan Raman, Tahir Javed, Sumanth Doddapaneni, Anoop Kunchukuttan, Pratyush Kumar, Mitesh M. Khapra

Figure 1 for Effectiveness of Mining Audio and Text Pairs from Public Data for Improving ASR Systems for Low-Resource Languages
Figure 2 for Effectiveness of Mining Audio and Text Pairs from Public Data for Improving ASR Systems for Low-Resource Languages
Figure 3 for Effectiveness of Mining Audio and Text Pairs from Public Data for Improving ASR Systems for Low-Resource Languages
Figure 4 for Effectiveness of Mining Audio and Text Pairs from Public Data for Improving ASR Systems for Low-Resource Languages
Viaarxiv icon

End-to-end label uncertainty modeling for speech emotion recognition using Bayesian neural networks

Oct 07, 2021
Navin Raj Prabhu, Guillaume Carbajal, Nale Lehmann-Willenbrock, Timo Gerkmann

Figure 1 for End-to-end label uncertainty modeling for speech emotion recognition using Bayesian neural networks
Figure 2 for End-to-end label uncertainty modeling for speech emotion recognition using Bayesian neural networks
Figure 3 for End-to-end label uncertainty modeling for speech emotion recognition using Bayesian neural networks
Figure 4 for End-to-end label uncertainty modeling for speech emotion recognition using Bayesian neural networks
Viaarxiv icon

Noise Classification Aided Attention-Based Neural Network for Monaural Speech Enhancement

Add code
Bookmark button
Alert button
May 31, 2021
Lu Ma, Song Yang, Yaguang Gong, Zhongqin Wu

Figure 1 for Noise Classification Aided Attention-Based Neural Network for Monaural Speech Enhancement
Figure 2 for Noise Classification Aided Attention-Based Neural Network for Monaural Speech Enhancement
Figure 3 for Noise Classification Aided Attention-Based Neural Network for Monaural Speech Enhancement
Figure 4 for Noise Classification Aided Attention-Based Neural Network for Monaural Speech Enhancement
Viaarxiv icon

ASVspoof 2021: accelerating progress in spoofed and deepfake speech detection

Add code
Bookmark button
Alert button
Sep 01, 2021
Junichi Yamagishi, Xin Wang, Massimiliano Todisco, Md Sahidullah, Jose Patino, Andreas Nautsch, Xuechen Liu, Kong Aik Lee, Tomi Kinnunen, Nicholas Evans, Héctor Delgado

Figure 1 for ASVspoof 2021: accelerating progress in spoofed and deepfake speech detection
Figure 2 for ASVspoof 2021: accelerating progress in spoofed and deepfake speech detection
Figure 3 for ASVspoof 2021: accelerating progress in spoofed and deepfake speech detection
Figure 4 for ASVspoof 2021: accelerating progress in spoofed and deepfake speech detection
Viaarxiv icon

Predicting pairwise preferences between TTS audio stimuli using parallel ratings data and anti-symmetric twin neural networks

Add code
Bookmark button
Alert button
Sep 22, 2022
Cassia Valentini-Botinhao, Manuel Sam Ribeiro, Oliver Watts, Korin Richmond, Gustav Eje Henter

Figure 1 for Predicting pairwise preferences between TTS audio stimuli using parallel ratings data and anti-symmetric twin neural networks
Figure 2 for Predicting pairwise preferences between TTS audio stimuli using parallel ratings data and anti-symmetric twin neural networks
Figure 3 for Predicting pairwise preferences between TTS audio stimuli using parallel ratings data and anti-symmetric twin neural networks
Figure 4 for Predicting pairwise preferences between TTS audio stimuli using parallel ratings data and anti-symmetric twin neural networks
Viaarxiv icon