Alert button

"speech": models, code, and papers
Alert button

Automatic Heteronym Resolution Pipeline Using RAD-TTS Aligners

Add code
Bookmark button
Alert button
Feb 28, 2023
Jocelyn Huang, Evelina Bakhturina, Oktai Tatanov

Figure 1 for Automatic Heteronym Resolution Pipeline Using RAD-TTS Aligners
Figure 2 for Automatic Heteronym Resolution Pipeline Using RAD-TTS Aligners
Figure 3 for Automatic Heteronym Resolution Pipeline Using RAD-TTS Aligners
Figure 4 for Automatic Heteronym Resolution Pipeline Using RAD-TTS Aligners
Viaarxiv icon

Preregistered protocol for: Articulatory changes in speech following treatment for oral or oropharyngeal cancer: a systematic review

Sep 14, 2022
Thomas B. Tienkamp, Teja Rebernik, Defne Abur, Rob J. J. H. van Son, Sebastiaan A. H. J. de Visscher, Max J. H. Witjes, Martijn Wieling

Figure 1 for Preregistered protocol for: Articulatory changes in speech following treatment for oral or oropharyngeal cancer: a systematic review
Figure 2 for Preregistered protocol for: Articulatory changes in speech following treatment for oral or oropharyngeal cancer: a systematic review
Figure 3 for Preregistered protocol for: Articulatory changes in speech following treatment for oral or oropharyngeal cancer: a systematic review
Figure 4 for Preregistered protocol for: Articulatory changes in speech following treatment for oral or oropharyngeal cancer: a systematic review
Viaarxiv icon

Incremental Speech Synthesis For Speech-To-Speech Translation

Add code
Bookmark button
Alert button
Oct 15, 2021
Danni Liu, Changhan Wang, Hongyu Gong, Xutai Ma, Yun Tang, Juan Pino

Figure 1 for Incremental Speech Synthesis For Speech-To-Speech Translation
Figure 2 for Incremental Speech Synthesis For Speech-To-Speech Translation
Figure 3 for Incremental Speech Synthesis For Speech-To-Speech Translation
Figure 4 for Incremental Speech Synthesis For Speech-To-Speech Translation
Viaarxiv icon

Chain-based Discriminative Autoencoders for Speech Recognition

Add code
Bookmark button
Alert button
Mar 28, 2022
Hung-Shin Lee, Pin-Tuan Huang, Yao-Fei Cheng, Hsin-Min Wang

Figure 1 for Chain-based Discriminative Autoencoders for Speech Recognition
Figure 2 for Chain-based Discriminative Autoencoders for Speech Recognition
Figure 3 for Chain-based Discriminative Autoencoders for Speech Recognition
Viaarxiv icon

Weight Averaging: A Simple Yet Effective Method to Overcome Catastrophic Forgetting in Automatic Speech Recognition

Oct 27, 2022
Steven Vander Eeckt, Hugo Van hamme

Figure 1 for Weight Averaging: A Simple Yet Effective Method to Overcome Catastrophic Forgetting in Automatic Speech Recognition
Figure 2 for Weight Averaging: A Simple Yet Effective Method to Overcome Catastrophic Forgetting in Automatic Speech Recognition
Viaarxiv icon

Wideband Audio Waveform Evaluation Networks: Efficient, Accurate Estimation of Speech Qualities

Add code
Bookmark button
Alert button
Jun 27, 2022
Andrew Catellier, Stephen Voran

Figure 1 for Wideband Audio Waveform Evaluation Networks: Efficient, Accurate Estimation of Speech Qualities
Figure 2 for Wideband Audio Waveform Evaluation Networks: Efficient, Accurate Estimation of Speech Qualities
Figure 3 for Wideband Audio Waveform Evaluation Networks: Efficient, Accurate Estimation of Speech Qualities
Figure 4 for Wideband Audio Waveform Evaluation Networks: Efficient, Accurate Estimation of Speech Qualities
Viaarxiv icon

Contrastive Representation Learning for Acoustic Parameter Estimation

Feb 22, 2023
Philipp Götz, Cagdas Tuna, Andreas Walther, Emanuël A. P. Habets

Figure 1 for Contrastive Representation Learning for Acoustic Parameter Estimation
Figure 2 for Contrastive Representation Learning for Acoustic Parameter Estimation
Figure 3 for Contrastive Representation Learning for Acoustic Parameter Estimation
Figure 4 for Contrastive Representation Learning for Acoustic Parameter Estimation
Viaarxiv icon

Bayesian Neural Network Language Modeling for Speech Recognition

Add code
Bookmark button
Alert button
Aug 28, 2022
Boyang Xue, Shoukang Hu, Junhao Xu, Mengzhe Geng, Xunying Liu, Helen Meng

Figure 1 for Bayesian Neural Network Language Modeling for Speech Recognition
Figure 2 for Bayesian Neural Network Language Modeling for Speech Recognition
Figure 3 for Bayesian Neural Network Language Modeling for Speech Recognition
Figure 4 for Bayesian Neural Network Language Modeling for Speech Recognition
Viaarxiv icon

Fast-U2++: Fast and Accurate End-to-End Speech Recognition in Joint CTC/Attention Frames

Add code
Bookmark button
Alert button
Nov 02, 2022
Chengdong Liang, Xiao-Lei Zhang, BinBin Zhang, Di Wu, Shengqiang Li, Xingchen Song, Zhendong Peng, Fuping Pan

Viaarxiv icon

WavThruVec: Latent speech representation as intermediate features for neural speech synthesis

Add code
Bookmark button
Alert button
Mar 31, 2022
Hubert Siuzdak, Piotr Dura, Pol van Rijn, Nori Jacoby

Figure 1 for WavThruVec: Latent speech representation as intermediate features for neural speech synthesis
Figure 2 for WavThruVec: Latent speech representation as intermediate features for neural speech synthesis
Figure 3 for WavThruVec: Latent speech representation as intermediate features for neural speech synthesis
Figure 4 for WavThruVec: Latent speech representation as intermediate features for neural speech synthesis
Viaarxiv icon