Alert button

"speech": models, code, and papers
Alert button

Understanding Audio Features via Trainable Basis Functions

Add code
Bookmark button
Alert button
Apr 25, 2022
Kwan Yee Heung, Kin Wai Cheuk, Dorien Herremans

Figure 1 for Understanding Audio Features via Trainable Basis Functions
Figure 2 for Understanding Audio Features via Trainable Basis Functions
Figure 3 for Understanding Audio Features via Trainable Basis Functions
Figure 4 for Understanding Audio Features via Trainable Basis Functions
Viaarxiv icon

Probabilistic Spherical Discriminant Analysis: An Alternative to PLDA for length-normalized embeddings

Add code
Bookmark button
Alert button
Mar 28, 2022
Niko Brümmer, Albert Swart, Ladislav Mošner, Anna Silnova, Oldřich Plchot, Themos Stafylakis, Lukáš Burget

Figure 1 for Probabilistic Spherical Discriminant Analysis: An Alternative to PLDA for length-normalized embeddings
Viaarxiv icon

Assessing Progress of Parkinson s Disease Using Acoustic Analysis of Phonation

Mar 17, 2022
Jiri Mekyska, Zoltan Galaz, Zdenek Mzourek, Zdenek Smekal, Irena Rektorova, Ilona Eliasova, Milena Kostalova, Martina Mrackova, Dagmar Berankov, Marcos Faundez-Zanuy, Karmele Lopez-de-Ipiña, Jesus B. Alonso-Hernandez

Figure 1 for Assessing Progress of Parkinson s Disease Using Acoustic Analysis of Phonation
Figure 2 for Assessing Progress of Parkinson s Disease Using Acoustic Analysis of Phonation
Figure 3 for Assessing Progress of Parkinson s Disease Using Acoustic Analysis of Phonation
Figure 4 for Assessing Progress of Parkinson s Disease Using Acoustic Analysis of Phonation
Viaarxiv icon

Robust Unsupervised Audio-visual Speech Enhancement Using a Mixture of Variational Autoencoders

Add code
Bookmark button
Alert button
Nov 10, 2019
Mostafa Sadeghi, Xavier Alameda-Pineda

Figure 1 for Robust Unsupervised Audio-visual Speech Enhancement Using a Mixture of Variational Autoencoders
Viaarxiv icon

Learning to detect dysarthria from raw speech

Add code
Bookmark button
Alert button
Nov 27, 2018
Juliette Millet, Neil Zeghidour

Figure 1 for Learning to detect dysarthria from raw speech
Figure 2 for Learning to detect dysarthria from raw speech
Figure 3 for Learning to detect dysarthria from raw speech
Figure 4 for Learning to detect dysarthria from raw speech
Viaarxiv icon

Audio-Visual Synchronisation in the wild

Dec 08, 2021
Honglie Chen, Weidi Xie, Triantafyllos Afouras, Arsha Nagrani, Andrea Vedaldi, Andrew Zisserman

Figure 1 for Audio-Visual Synchronisation in the wild
Figure 2 for Audio-Visual Synchronisation in the wild
Figure 3 for Audio-Visual Synchronisation in the wild
Figure 4 for Audio-Visual Synchronisation in the wild
Viaarxiv icon

Sequence-To-Sequence Voice Conversion using F0 and Time Conditioning and Adversarial Learning

Oct 07, 2021
Frederik Bous, Laurent Benaroya, Nicolas Obin, Axel Roebel

Figure 1 for Sequence-To-Sequence Voice Conversion using F0 and Time Conditioning and Adversarial Learning
Figure 2 for Sequence-To-Sequence Voice Conversion using F0 and Time Conditioning and Adversarial Learning
Viaarxiv icon

Minimum Word Error Rate Training with Language Model Fusion for End-to-End Speech Recognition

Jun 04, 2021
Zhong Meng, Yu Wu, Naoyuki Kanda, Liang Lu, Xie Chen, Guoli Ye, Eric Sun, Jinyu Li, Yifan Gong

Figure 1 for Minimum Word Error Rate Training with Language Model Fusion for End-to-End Speech Recognition
Figure 2 for Minimum Word Error Rate Training with Language Model Fusion for End-to-End Speech Recognition
Figure 3 for Minimum Word Error Rate Training with Language Model Fusion for End-to-End Speech Recognition
Viaarxiv icon

New Insights on Target Speaker Extraction

Feb 01, 2022
Mohamed Elminshawi, Wolfgang Mack, Soumitro Chakrabarty, Emanuël A. P. Habets

Figure 1 for New Insights on Target Speaker Extraction
Figure 2 for New Insights on Target Speaker Extraction
Figure 3 for New Insights on Target Speaker Extraction
Figure 4 for New Insights on Target Speaker Extraction
Viaarxiv icon

SoK: The Faults in our ASRs: An Overview of Attacks against Automatic Speech Recognition and Speaker Identification Systems

Jul 21, 2020
Hadi Abdullah, Kevin Warren, Vincent Bindschaedler, Nicolas Papernot, Patrick Traynor

Figure 1 for SoK: The Faults in our ASRs: An Overview of Attacks against Automatic Speech Recognition and Speaker Identification Systems
Figure 2 for SoK: The Faults in our ASRs: An Overview of Attacks against Automatic Speech Recognition and Speaker Identification Systems
Figure 3 for SoK: The Faults in our ASRs: An Overview of Attacks against Automatic Speech Recognition and Speaker Identification Systems
Figure 4 for SoK: The Faults in our ASRs: An Overview of Attacks against Automatic Speech Recognition and Speaker Identification Systems
Viaarxiv icon