Alert button

"speech": models, code, and papers
Alert button

WaveNODE: A Continuous Normalizing Flow for Speech Synthesis

Add code
Bookmark button
Alert button
Jun 09, 2020
Hyeongju Kim, Hyeonseung Lee, Woo Hyun Kang, Sung Jun Cheon, Byoung Jin Choi, Nam Soo Kim

Figure 1 for WaveNODE: A Continuous Normalizing Flow for Speech Synthesis
Figure 2 for WaveNODE: A Continuous Normalizing Flow for Speech Synthesis
Figure 3 for WaveNODE: A Continuous Normalizing Flow for Speech Synthesis
Figure 4 for WaveNODE: A Continuous Normalizing Flow for Speech Synthesis
Viaarxiv icon

Audio-Based Deep Learning Frameworks for Detecting COVID-19

Feb 10, 2022
Dat Ngo, Lam Pham, Truong Hoang, Sefki Kolozali, Delaram Jarchi

Figure 1 for Audio-Based Deep Learning Frameworks for Detecting COVID-19
Figure 2 for Audio-Based Deep Learning Frameworks for Detecting COVID-19
Figure 3 for Audio-Based Deep Learning Frameworks for Detecting COVID-19
Figure 4 for Audio-Based Deep Learning Frameworks for Detecting COVID-19
Viaarxiv icon

wav2letter++: The Fastest Open-source Speech Recognition System

Add code
Bookmark button
Alert button
Dec 18, 2018
Vineel Pratap, Awni Hannun, Qiantong Xu, Jeff Cai, Jacob Kahn, Gabriel Synnaeve, Vitaliy Liptchinsky, Ronan Collobert

Figure 1 for wav2letter++: The Fastest Open-source Speech Recognition System
Figure 2 for wav2letter++: The Fastest Open-source Speech Recognition System
Figure 3 for wav2letter++: The Fastest Open-source Speech Recognition System
Figure 4 for wav2letter++: The Fastest Open-source Speech Recognition System
Viaarxiv icon

Time Alignment using Lip Images for Frame-based Electrolaryngeal Voice Conversion

Add code
Bookmark button
Alert button
Sep 08, 2021
Yi-Syuan Liou, Wen-Chin Huang, Ming-Chi Yen, Shu-Wei Tsai, Yu-Huai Peng, Tomoki Toda, Yu Tsao, Hsin-Min Wang

Figure 1 for Time Alignment using Lip Images for Frame-based Electrolaryngeal Voice Conversion
Figure 2 for Time Alignment using Lip Images for Frame-based Electrolaryngeal Voice Conversion
Figure 3 for Time Alignment using Lip Images for Frame-based Electrolaryngeal Voice Conversion
Figure 4 for Time Alignment using Lip Images for Frame-based Electrolaryngeal Voice Conversion
Viaarxiv icon

Streamable Neural Audio Synthesis With Non-Causal Convolutions

Add code
Bookmark button
Alert button
Apr 14, 2022
Antoine Caillon, Philippe Esling

Figure 1 for Streamable Neural Audio Synthesis With Non-Causal Convolutions
Figure 2 for Streamable Neural Audio Synthesis With Non-Causal Convolutions
Figure 3 for Streamable Neural Audio Synthesis With Non-Causal Convolutions
Figure 4 for Streamable Neural Audio Synthesis With Non-Causal Convolutions
Viaarxiv icon

End-to-end Microphone Permutation and Number Invariant Multi-channel Speech Separation

Add code
Bookmark button
Alert button
Nov 26, 2019
Yi Luo, Zhuo Chen, Nima Mesgarani, Takuya Yoshioka

Figure 1 for End-to-end Microphone Permutation and Number Invariant Multi-channel Speech Separation
Figure 2 for End-to-end Microphone Permutation and Number Invariant Multi-channel Speech Separation
Figure 3 for End-to-end Microphone Permutation and Number Invariant Multi-channel Speech Separation
Viaarxiv icon

Real Time Speech Enhancement in the Waveform Domain

Add code
Bookmark button
Alert button
Jun 23, 2020
Alexandre Defossez, Gabriel Synnaeve, Yossi Adi

Figure 1 for Real Time Speech Enhancement in the Waveform Domain
Figure 2 for Real Time Speech Enhancement in the Waveform Domain
Figure 3 for Real Time Speech Enhancement in the Waveform Domain
Figure 4 for Real Time Speech Enhancement in the Waveform Domain
Viaarxiv icon

The Zero Resource Speech Challenge 2019: TTS without T

Apr 25, 2019
Ewan Dunbar, Robin Algayres, Julien Karadayi, Mathieu Bernard, Juan Benjumea, Xuan-Nga Cao, Lucie Miskic, Charlotte Dugrain, Lucas Ondel, Alan W. Black, Laurent Besacier, Sakriani Sakti, Emmanuel Dupoux

Figure 1 for The Zero Resource Speech Challenge 2019: TTS without T
Figure 2 for The Zero Resource Speech Challenge 2019: TTS without T
Figure 3 for The Zero Resource Speech Challenge 2019: TTS without T
Figure 4 for The Zero Resource Speech Challenge 2019: TTS without T
Viaarxiv icon

Combining Multiple Views for Visual Speech Recognition

Jun 28, 2018
Marina Zimmermann, Mostafa Mehdipour Ghazi, Hazım Kemal Ekenel, Jean-Philippe Thiran

Figure 1 for Combining Multiple Views for Visual Speech Recognition
Figure 2 for Combining Multiple Views for Visual Speech Recognition
Figure 3 for Combining Multiple Views for Visual Speech Recognition
Viaarxiv icon