Alert button

"speech": models, code, and papers
Alert button

Text-free non-parallel many-to-many voice conversion using normalising flows

Mar 15, 2022
Thomas Merritt, Abdelhamid Ezzerg, Piotr Biliński, Magdalena Proszewska, Kamil Pokora, Roberto Barra-Chicote, Daniel Korzekwa

Figure 1 for Text-free non-parallel many-to-many voice conversion using normalising flows
Figure 2 for Text-free non-parallel many-to-many voice conversion using normalising flows
Figure 3 for Text-free non-parallel many-to-many voice conversion using normalising flows
Figure 4 for Text-free non-parallel many-to-many voice conversion using normalising flows
Viaarxiv icon

What shall we do with an hour of data? Speech recognition for the un- and under-served languages of Common Voice

Add code
Bookmark button
Alert button
May 10, 2021
Francis M. Tyers, Josh Meyer

Figure 1 for What shall we do with an hour of data? Speech recognition for the un- and under-served languages of Common Voice
Figure 2 for What shall we do with an hour of data? Speech recognition for the un- and under-served languages of Common Voice
Figure 3 for What shall we do with an hour of data? Speech recognition for the un- and under-served languages of Common Voice
Figure 4 for What shall we do with an hour of data? Speech recognition for the un- and under-served languages of Common Voice
Viaarxiv icon

Learning to Understand Child-directed and Adult-directed Speech

Add code
Bookmark button
Alert button
May 06, 2020
Lieke Gelderloos, Grzegorz Chrupała, Afra Alishahi

Figure 1 for Learning to Understand Child-directed and Adult-directed Speech
Figure 2 for Learning to Understand Child-directed and Adult-directed Speech
Figure 3 for Learning to Understand Child-directed and Adult-directed Speech
Figure 4 for Learning to Understand Child-directed and Adult-directed Speech
Viaarxiv icon

Detecting English Speech in the Air Traffic Control Voice Communication

Add code
Bookmark button
Alert button
Apr 06, 2021
Igor Szoke, Santosh Kesiraju, Ondrej Novotny, Martin Kocour, Karel Vesely, Jan "Honza" Cernocky

Figure 1 for Detecting English Speech in the Air Traffic Control Voice Communication
Figure 2 for Detecting English Speech in the Air Traffic Control Voice Communication
Figure 3 for Detecting English Speech in the Air Traffic Control Voice Communication
Figure 4 for Detecting English Speech in the Air Traffic Control Voice Communication
Viaarxiv icon

Heterogeneous Target Speech Separation

Add code
Bookmark button
Alert button
Apr 07, 2022
Efthymios Tzinis, Gordon Wichern, Aswin Subramanian, Paris Smaragdis, Jonathan Le Roux

Figure 1 for Heterogeneous Target Speech Separation
Figure 2 for Heterogeneous Target Speech Separation
Figure 3 for Heterogeneous Target Speech Separation
Figure 4 for Heterogeneous Target Speech Separation
Viaarxiv icon

Voice Cloning: a Multi-Speaker Text-to-Speech Synthesis Approach based on Transfer Learning

Add code
Bookmark button
Alert button
Feb 10, 2021
Giuseppe Ruggiero, Enrico Zovato, Luigi Di Caro, Vincent Pollet

Figure 1 for Voice Cloning: a Multi-Speaker Text-to-Speech Synthesis Approach based on Transfer Learning
Figure 2 for Voice Cloning: a Multi-Speaker Text-to-Speech Synthesis Approach based on Transfer Learning
Figure 3 for Voice Cloning: a Multi-Speaker Text-to-Speech Synthesis Approach based on Transfer Learning
Figure 4 for Voice Cloning: a Multi-Speaker Text-to-Speech Synthesis Approach based on Transfer Learning
Viaarxiv icon

WAVPROMPT: Towards Few-Shot Spoken Language Understanding with Frozen Language Models

Add code
Bookmark button
Alert button
Apr 14, 2022
Heting Gao, Junrui Ni, Kaizhi Qian, Yang Zhang, Shiyu Chang, Mark Hasegawa-Johnson

Figure 1 for WAVPROMPT: Towards Few-Shot Spoken Language Understanding with Frozen Language Models
Figure 2 for WAVPROMPT: Towards Few-Shot Spoken Language Understanding with Frozen Language Models
Figure 3 for WAVPROMPT: Towards Few-Shot Spoken Language Understanding with Frozen Language Models
Figure 4 for WAVPROMPT: Towards Few-Shot Spoken Language Understanding with Frozen Language Models
Viaarxiv icon

Dual-branch Attention-In-Attention Transformer for single-channel speech enhancement

Add code
Bookmark button
Alert button
Nov 05, 2021
Guochen Yu, Andong Li, Yutian Wang, Yinuo Guo, Hui Wang, Chengshi Zheng

Figure 1 for Dual-branch Attention-In-Attention Transformer for single-channel speech enhancement
Figure 2 for Dual-branch Attention-In-Attention Transformer for single-channel speech enhancement
Figure 3 for Dual-branch Attention-In-Attention Transformer for single-channel speech enhancement
Figure 4 for Dual-branch Attention-In-Attention Transformer for single-channel speech enhancement
Viaarxiv icon

Speaker-Independent Microphone Identification in Noisy Conditions

Jun 23, 2022
Antonio Giganti, Luca Cuccovillo, Paolo Bestagini, Patrick Aichroth, Stefano Tubaro

Figure 1 for Speaker-Independent Microphone Identification in Noisy Conditions
Figure 2 for Speaker-Independent Microphone Identification in Noisy Conditions
Figure 3 for Speaker-Independent Microphone Identification in Noisy Conditions
Figure 4 for Speaker-Independent Microphone Identification in Noisy Conditions
Viaarxiv icon

Foster Strengths and Circumvent Weaknesses: a Speech Enhancement Framework with Two-branch Collaborative Learning

Oct 12, 2021
Wenxin Tai, Jiajia Li, Yixiang Wang, Tian Lan, Qiao Liu

Figure 1 for Foster Strengths and Circumvent Weaknesses: a Speech Enhancement Framework with Two-branch Collaborative Learning
Figure 2 for Foster Strengths and Circumvent Weaknesses: a Speech Enhancement Framework with Two-branch Collaborative Learning
Figure 3 for Foster Strengths and Circumvent Weaknesses: a Speech Enhancement Framework with Two-branch Collaborative Learning
Figure 4 for Foster Strengths and Circumvent Weaknesses: a Speech Enhancement Framework with Two-branch Collaborative Learning
Viaarxiv icon