Alert button

"speech": models, code, and papers
Alert button

A Novel Decision Tree for Depression Recognition in Speech

Feb 22, 2020
Zhenyu Liu, Dongyu Wang, Lan Zhang, Bin Hu

Figure 1 for A Novel Decision Tree for Depression Recognition in Speech
Figure 2 for A Novel Decision Tree for Depression Recognition in Speech
Figure 3 for A Novel Decision Tree for Depression Recognition in Speech
Figure 4 for A Novel Decision Tree for Depression Recognition in Speech
Viaarxiv icon

A Comparison of Discrete Latent Variable Models for Speech Representation Learning

Oct 24, 2020
Henry Zhou, Alexei Baevski, Michael Auli

Figure 1 for A Comparison of Discrete Latent Variable Models for Speech Representation Learning
Figure 2 for A Comparison of Discrete Latent Variable Models for Speech Representation Learning
Figure 3 for A Comparison of Discrete Latent Variable Models for Speech Representation Learning
Figure 4 for A Comparison of Discrete Latent Variable Models for Speech Representation Learning
Viaarxiv icon

Who Needs Words? Lexicon-Free Speech Recognition

Add code
Bookmark button
Alert button
Apr 09, 2019
Tatiana Likhomanenko, Gabriel Synnaeve, Ronan Collobert

Figure 1 for Who Needs Words? Lexicon-Free Speech Recognition
Figure 2 for Who Needs Words? Lexicon-Free Speech Recognition
Figure 3 for Who Needs Words? Lexicon-Free Speech Recognition
Figure 4 for Who Needs Words? Lexicon-Free Speech Recognition
Viaarxiv icon

Joint AEC AND Beamforming with Double-Talk Detection using RNN-Transformer

Add code
Bookmark button
Alert button
Nov 09, 2021
Vinay Kothapally, Yong Xu, Meng Yu, Shi-Xiong Zhang, Dong Yu

Figure 1 for Joint AEC AND Beamforming with Double-Talk Detection using RNN-Transformer
Figure 2 for Joint AEC AND Beamforming with Double-Talk Detection using RNN-Transformer
Viaarxiv icon

A Review of Language and Speech Features for Cognitive-Linguistic Assessment

Jun 04, 2019
Rohit Voleti, Julie M. Liss, Visar Berisha

Figure 1 for A Review of Language and Speech Features for Cognitive-Linguistic Assessment
Figure 2 for A Review of Language and Speech Features for Cognitive-Linguistic Assessment
Figure 3 for A Review of Language and Speech Features for Cognitive-Linguistic Assessment
Figure 4 for A Review of Language and Speech Features for Cognitive-Linguistic Assessment
Viaarxiv icon

On Prosody Modeling for ASR+TTS based Voice Conversion

Add code
Bookmark button
Alert button
Jul 20, 2021
Wen-Chin Huang, Tomoki Hayashi, Xinjian Li, Shinji Watanabe, Tomoki Toda

Figure 1 for On Prosody Modeling for ASR+TTS based Voice Conversion
Figure 2 for On Prosody Modeling for ASR+TTS based Voice Conversion
Figure 3 for On Prosody Modeling for ASR+TTS based Voice Conversion
Figure 4 for On Prosody Modeling for ASR+TTS based Voice Conversion
Viaarxiv icon

Probabilistic Spherical Discriminant Analysis: An Alternative to PLDA for length-normalized embeddings

Add code
Bookmark button
Alert button
Mar 28, 2022
Niko Brümmer, Albert Swart, Ladislav Mošner, Anna Silnova, Oldřich Plchot, Themos Stafylakis, Lukáš Burget

Figure 1 for Probabilistic Spherical Discriminant Analysis: An Alternative to PLDA for length-normalized embeddings
Viaarxiv icon

Generalization Ability of MOS Prediction Networks

Add code
Bookmark button
Alert button
Oct 18, 2021
Erica Cooper, Wen-Chin Huang, Tomoki Toda, Junichi Yamagishi

Figure 1 for Generalization Ability of MOS Prediction Networks
Figure 2 for Generalization Ability of MOS Prediction Networks
Figure 3 for Generalization Ability of MOS Prediction Networks
Figure 4 for Generalization Ability of MOS Prediction Networks
Viaarxiv icon

Assessing Progress of Parkinson s Disease Using Acoustic Analysis of Phonation

Mar 17, 2022
Jiri Mekyska, Zoltan Galaz, Zdenek Mzourek, Zdenek Smekal, Irena Rektorova, Ilona Eliasova, Milena Kostalova, Martina Mrackova, Dagmar Berankov, Marcos Faundez-Zanuy, Karmele Lopez-de-Ipiña, Jesus B. Alonso-Hernandez

Figure 1 for Assessing Progress of Parkinson s Disease Using Acoustic Analysis of Phonation
Figure 2 for Assessing Progress of Parkinson s Disease Using Acoustic Analysis of Phonation
Figure 3 for Assessing Progress of Parkinson s Disease Using Acoustic Analysis of Phonation
Figure 4 for Assessing Progress of Parkinson s Disease Using Acoustic Analysis of Phonation
Viaarxiv icon

Attentive Temporal Pooling for Conformer-based Streaming Language Identification in Long-form Speech

Add code
Bookmark button
Alert button
Mar 21, 2022
Quan Wang, Yang Yu, Jason Pelecanos, Yiling Huang, Ignacio Lopez Moreno

Figure 1 for Attentive Temporal Pooling for Conformer-based Streaming Language Identification in Long-form Speech
Figure 2 for Attentive Temporal Pooling for Conformer-based Streaming Language Identification in Long-form Speech
Figure 3 for Attentive Temporal Pooling for Conformer-based Streaming Language Identification in Long-form Speech
Figure 4 for Attentive Temporal Pooling for Conformer-based Streaming Language Identification in Long-form Speech
Viaarxiv icon