Alert button

"speech": models, code, and papers
Alert button

Glow-WaveGAN: Learning Speech Representations from GAN-based Variational Auto-Encoder For High Fidelity Flow-based Speech Synthesis

Add code
Bookmark button
Alert button
Jun 21, 2021
Jian Cong, Shan Yang, Lei Xie, Dan Su

Figure 1 for Glow-WaveGAN: Learning Speech Representations from GAN-based Variational Auto-Encoder For High Fidelity Flow-based Speech Synthesis
Figure 2 for Glow-WaveGAN: Learning Speech Representations from GAN-based Variational Auto-Encoder For High Fidelity Flow-based Speech Synthesis
Figure 3 for Glow-WaveGAN: Learning Speech Representations from GAN-based Variational Auto-Encoder For High Fidelity Flow-based Speech Synthesis
Figure 4 for Glow-WaveGAN: Learning Speech Representations from GAN-based Variational Auto-Encoder For High Fidelity Flow-based Speech Synthesis
Viaarxiv icon

MeWEHV: Mel and Wave Embeddings for Human Voice Tasks

Sep 28, 2022
Andrés Vasco-Carofilis, Laura Fernández-Robles, Enrique Alegre, Eduardo Fidalgo

Figure 1 for MeWEHV: Mel and Wave Embeddings for Human Voice Tasks
Figure 2 for MeWEHV: Mel and Wave Embeddings for Human Voice Tasks
Figure 3 for MeWEHV: Mel and Wave Embeddings for Human Voice Tasks
Figure 4 for MeWEHV: Mel and Wave Embeddings for Human Voice Tasks
Viaarxiv icon

DBT-Net: Dual-branch federative magnitude and phase estimation with attention-in-attention transformer for monaural speech enhancement

Add code
Bookmark button
Alert button
Feb 16, 2022
Guochen Yu, Andong Li, Hui Wang, Yutian Wang, Yuxuan Ke, Chengshi Zheng

Figure 1 for DBT-Net: Dual-branch federative magnitude and phase estimation with attention-in-attention transformer for monaural speech enhancement
Figure 2 for DBT-Net: Dual-branch federative magnitude and phase estimation with attention-in-attention transformer for monaural speech enhancement
Figure 3 for DBT-Net: Dual-branch federative magnitude and phase estimation with attention-in-attention transformer for monaural speech enhancement
Figure 4 for DBT-Net: Dual-branch federative magnitude and phase estimation with attention-in-attention transformer for monaural speech enhancement
Viaarxiv icon

Romanian Speech Recognition Experiments from the ROBIN Project

Add code
Bookmark button
Alert button
Nov 23, 2021
Andrei-Marius Avram, Vasile Păiş, Dan Tufiş

Figure 1 for Romanian Speech Recognition Experiments from the ROBIN Project
Figure 2 for Romanian Speech Recognition Experiments from the ROBIN Project
Figure 3 for Romanian Speech Recognition Experiments from the ROBIN Project
Viaarxiv icon

Analysis and Tuning of a Voice Assistant System for Dysfluent Speech

Jun 18, 2021
Vikramjit Mitra, Zifang Huang, Colin Lea, Lauren Tooley, Sarah Wu, Darren Botten, Ashwini Palekar, Shrinath Thelapurath, Panayiotis Georgiou, Sachin Kajarekar, Jefferey Bigham

Figure 1 for Analysis and Tuning of a Voice Assistant System for Dysfluent Speech
Figure 2 for Analysis and Tuning of a Voice Assistant System for Dysfluent Speech
Figure 3 for Analysis and Tuning of a Voice Assistant System for Dysfluent Speech
Figure 4 for Analysis and Tuning of a Voice Assistant System for Dysfluent Speech
Viaarxiv icon

Deciphering Speech: a Zero-Resource Approach to Cross-Lingual Transfer in ASR

Nov 12, 2021
Ondrej Klejch, Electra Wallington, Peter Bell

Figure 1 for Deciphering Speech: a Zero-Resource Approach to Cross-Lingual Transfer in ASR
Figure 2 for Deciphering Speech: a Zero-Resource Approach to Cross-Lingual Transfer in ASR
Figure 3 for Deciphering Speech: a Zero-Resource Approach to Cross-Lingual Transfer in ASR
Figure 4 for Deciphering Speech: a Zero-Resource Approach to Cross-Lingual Transfer in ASR
Viaarxiv icon

Accented Speech Recognition Inspired by Human Perception

Apr 09, 2021
Xiangyun Chu, Elizabeth Combs, Amber Wang, Michael Picheny

Figure 1 for Accented Speech Recognition Inspired by Human Perception
Figure 2 for Accented Speech Recognition Inspired by Human Perception
Figure 3 for Accented Speech Recognition Inspired by Human Perception
Figure 4 for Accented Speech Recognition Inspired by Human Perception
Viaarxiv icon

Distilling a Pretrained Language Model to a Multilingual ASR Model

Add code
Bookmark button
Alert button
Jun 25, 2022
Kwanghee Choi, Hyung-Min Park

Figure 1 for Distilling a Pretrained Language Model to a Multilingual ASR Model
Figure 2 for Distilling a Pretrained Language Model to a Multilingual ASR Model
Figure 3 for Distilling a Pretrained Language Model to a Multilingual ASR Model
Figure 4 for Distilling a Pretrained Language Model to a Multilingual ASR Model
Viaarxiv icon

An Investigation of Monotonic Transducers for Large-Scale Automatic Speech Recognition

Apr 19, 2022
Niko Moritz, Frank Seide, Duc Le, Jay Mahadeokar, Christian Fuegen

Figure 1 for An Investigation of Monotonic Transducers for Large-Scale Automatic Speech Recognition
Figure 2 for An Investigation of Monotonic Transducers for Large-Scale Automatic Speech Recognition
Figure 3 for An Investigation of Monotonic Transducers for Large-Scale Automatic Speech Recognition
Figure 4 for An Investigation of Monotonic Transducers for Large-Scale Automatic Speech Recognition
Viaarxiv icon

From Spelling to Grammar: A New Framework for Chinese Grammatical Error Correction

Add code
Bookmark button
Alert button
Nov 03, 2022
Xiuyu Wu, Yunfang Wu

Figure 1 for From Spelling to Grammar: A New Framework for Chinese Grammatical Error Correction
Figure 2 for From Spelling to Grammar: A New Framework for Chinese Grammatical Error Correction
Figure 3 for From Spelling to Grammar: A New Framework for Chinese Grammatical Error Correction
Figure 4 for From Spelling to Grammar: A New Framework for Chinese Grammatical Error Correction
Viaarxiv icon