Alert button

"speech": models, code, and papers
Alert button

Disfluencies and Human Speech Transcription Errors

Add code
Bookmark button
Alert button
Apr 08, 2019
Vicky Zayats, Trang Tran, Richard Wright, Courtney Mansfield, Mari Ostendorf

Figure 1 for Disfluencies and Human Speech Transcription Errors
Figure 2 for Disfluencies and Human Speech Transcription Errors
Figure 3 for Disfluencies and Human Speech Transcription Errors
Figure 4 for Disfluencies and Human Speech Transcription Errors
Viaarxiv icon

Comparison of Speech Representations for Automatic Quality Estimation in Multi-Speaker Text-to-Speech Synthesis

Add code
Bookmark button
Alert button
Feb 28, 2020
Jennifer Williams, Joanna Rownicka, Pilar Oplustil, Simon King

Figure 1 for Comparison of Speech Representations for Automatic Quality Estimation in Multi-Speaker Text-to-Speech Synthesis
Figure 2 for Comparison of Speech Representations for Automatic Quality Estimation in Multi-Speaker Text-to-Speech Synthesis
Figure 3 for Comparison of Speech Representations for Automatic Quality Estimation in Multi-Speaker Text-to-Speech Synthesis
Figure 4 for Comparison of Speech Representations for Automatic Quality Estimation in Multi-Speaker Text-to-Speech Synthesis
Viaarxiv icon

Transferring neural speech waveform synthesizers to musical instrument sounds generation

Add code
Bookmark button
Alert button
Oct 27, 2019
Yi Zhao, Xin Wang, Lauri Juvela, Junichi Yamagishi

Figure 1 for Transferring neural speech waveform synthesizers to musical instrument sounds generation
Figure 2 for Transferring neural speech waveform synthesizers to musical instrument sounds generation
Figure 3 for Transferring neural speech waveform synthesizers to musical instrument sounds generation
Figure 4 for Transferring neural speech waveform synthesizers to musical instrument sounds generation
Viaarxiv icon

PriMock57: A Dataset Of Primary Care Mock Consultations

Add code
Bookmark button
Alert button
Apr 01, 2022
Alex Papadopoulos Korfiatis, Francesco Moramarco, Radmila Sarac, Aleksandar Savkov

Figure 1 for PriMock57: A Dataset Of Primary Care Mock Consultations
Figure 2 for PriMock57: A Dataset Of Primary Care Mock Consultations
Figure 3 for PriMock57: A Dataset Of Primary Care Mock Consultations
Figure 4 for PriMock57: A Dataset Of Primary Care Mock Consultations
Viaarxiv icon

Cross Lingual Cross Corpus Speech Emotion Recognition

Mar 18, 2020
Shivali Goel, Homayoon Beigi

Figure 1 for Cross Lingual Cross Corpus Speech Emotion Recognition
Figure 2 for Cross Lingual Cross Corpus Speech Emotion Recognition
Figure 3 for Cross Lingual Cross Corpus Speech Emotion Recognition
Figure 4 for Cross Lingual Cross Corpus Speech Emotion Recognition
Viaarxiv icon

Hearing Lips: Improving Lip Reading by Distilling Speech Recognizers

Nov 26, 2019
Ya Zhao, Rui Xu, Xinchao Wang, Peng Hou, Haihong Tang, Mingli Song

Figure 1 for Hearing Lips: Improving Lip Reading by Distilling Speech Recognizers
Figure 2 for Hearing Lips: Improving Lip Reading by Distilling Speech Recognizers
Figure 3 for Hearing Lips: Improving Lip Reading by Distilling Speech Recognizers
Figure 4 for Hearing Lips: Improving Lip Reading by Distilling Speech Recognizers
Viaarxiv icon

Multi-speaker Text-to-speech Synthesis Using Deep Gaussian Processes

Add code
Bookmark button
Alert button
Aug 07, 2020
Kentaro Mitsui, Tomoki Koriyama, Hiroshi Saruwatari

Viaarxiv icon

Improve Cross-lingual Voice Cloning Using Low-quality Code-switched Data

Add code
Bookmark button
Alert button
Oct 14, 2021
Haitong Zhang, Yue Lin

Figure 1 for Improve Cross-lingual Voice Cloning Using Low-quality Code-switched Data
Figure 2 for Improve Cross-lingual Voice Cloning Using Low-quality Code-switched Data
Figure 3 for Improve Cross-lingual Voice Cloning Using Low-quality Code-switched Data
Figure 4 for Improve Cross-lingual Voice Cloning Using Low-quality Code-switched Data
Viaarxiv icon

Improving Cross-Lingual Transfer Learning for End-to-End Speech Recognition with Speech Translation

Add code
Bookmark button
Alert button
Jun 09, 2020
Changhan Wang, Juan Pino, Jiatao Gu

Figure 1 for Improving Cross-Lingual Transfer Learning for End-to-End Speech Recognition with Speech Translation
Figure 2 for Improving Cross-Lingual Transfer Learning for End-to-End Speech Recognition with Speech Translation
Figure 3 for Improving Cross-Lingual Transfer Learning for End-to-End Speech Recognition with Speech Translation
Figure 4 for Improving Cross-Lingual Transfer Learning for End-to-End Speech Recognition with Speech Translation
Viaarxiv icon

Word Discovery in Visually Grounded, Self-Supervised Speech Models

Add code
Bookmark button
Alert button
Mar 28, 2022
Puyuan Peng, David Harwath

Figure 1 for Word Discovery in Visually Grounded, Self-Supervised Speech Models
Figure 2 for Word Discovery in Visually Grounded, Self-Supervised Speech Models
Figure 3 for Word Discovery in Visually Grounded, Self-Supervised Speech Models
Figure 4 for Word Discovery in Visually Grounded, Self-Supervised Speech Models
Viaarxiv icon