Alert button

"speech": models, code, and papers
Alert button

CycleGAN-Based Unpaired Speech Dereverberation

Add code
Bookmark button
Alert button
Mar 29, 2022
Hannah Muckenhirn, Aleksandr Safin, Hakan Erdogan, Felix de Chaumont Quitry, Marco Tagliasacchi, Scott Wisdom, John R. Hershey

Figure 1 for CycleGAN-Based Unpaired Speech Dereverberation
Figure 2 for CycleGAN-Based Unpaired Speech Dereverberation
Figure 3 for CycleGAN-Based Unpaired Speech Dereverberation
Viaarxiv icon

Tackling data scarcity in speech translation using zero-shot multilingual machine translation techniques

Add code
Bookmark button
Alert button
Jan 26, 2022
Tu Anh Dinh, Danni Liu, Jan Niehues

Figure 1 for Tackling data scarcity in speech translation using zero-shot multilingual machine translation techniques
Figure 2 for Tackling data scarcity in speech translation using zero-shot multilingual machine translation techniques
Figure 3 for Tackling data scarcity in speech translation using zero-shot multilingual machine translation techniques
Figure 4 for Tackling data scarcity in speech translation using zero-shot multilingual machine translation techniques
Viaarxiv icon

Speech Decomposition Based on a Hybrid Speech Model and Optimal Segmentation

May 04, 2021
Alfredo Esquivel Jaramillo, Jesper Kjær Nielsen, Mads Græsbøll Christensen

Figure 1 for Speech Decomposition Based on a Hybrid Speech Model and Optimal Segmentation
Figure 2 for Speech Decomposition Based on a Hybrid Speech Model and Optimal Segmentation
Figure 3 for Speech Decomposition Based on a Hybrid Speech Model and Optimal Segmentation
Viaarxiv icon

Improved Meta Learning for Low Resource Speech Recognition

May 11, 2022
Satwinder Singh, Ruili Wang, Feng Hou

Figure 1 for Improved Meta Learning for Low Resource Speech Recognition
Figure 2 for Improved Meta Learning for Low Resource Speech Recognition
Figure 3 for Improved Meta Learning for Low Resource Speech Recognition
Figure 4 for Improved Meta Learning for Low Resource Speech Recognition
Viaarxiv icon

Dubbing in Practice: A Large Scale Study of Human Localization With Insights for Automatic Dubbing

Add code
Bookmark button
Alert button
Dec 23, 2022
William Brannon, Yogesh Virkar, Brian Thompson

Figure 1 for Dubbing in Practice: A Large Scale Study of Human Localization With Insights for Automatic Dubbing
Figure 2 for Dubbing in Practice: A Large Scale Study of Human Localization With Insights for Automatic Dubbing
Figure 3 for Dubbing in Practice: A Large Scale Study of Human Localization With Insights for Automatic Dubbing
Figure 4 for Dubbing in Practice: A Large Scale Study of Human Localization With Insights for Automatic Dubbing
Viaarxiv icon

Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synthesis

Add code
Bookmark button
Alert button
Apr 03, 2022
Yixuan Zhou, Changhe Song, Xiang Li, Luwen Zhang, Zhiyong Wu, Yanyao Bian, Dan Su, Helen Meng

Figure 1 for Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synthesis
Figure 2 for Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synthesis
Figure 3 for Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synthesis
Figure 4 for Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synthesis
Viaarxiv icon

Mutual Learning of Single- and Multi-Channel End-to-End Neural Diarization

Add code
Bookmark button
Alert button
Oct 07, 2022
Shota Horiguchi, Yuki Takashima, Shinji Watanabe, Paola Garcia

Figure 1 for Mutual Learning of Single- and Multi-Channel End-to-End Neural Diarization
Figure 2 for Mutual Learning of Single- and Multi-Channel End-to-End Neural Diarization
Figure 3 for Mutual Learning of Single- and Multi-Channel End-to-End Neural Diarization
Figure 4 for Mutual Learning of Single- and Multi-Channel End-to-End Neural Diarization
Viaarxiv icon

Improving Dual-Microphone Speech Enhancement by Learning Cross-Channel Features with Multi-Head Attention

May 03, 2022
Xinmeng Xu, Rongzhi Gu, Yuexian Zou

Figure 1 for Improving Dual-Microphone Speech Enhancement by Learning Cross-Channel Features with Multi-Head Attention
Figure 2 for Improving Dual-Microphone Speech Enhancement by Learning Cross-Channel Features with Multi-Head Attention
Figure 3 for Improving Dual-Microphone Speech Enhancement by Learning Cross-Channel Features with Multi-Head Attention
Figure 4 for Improving Dual-Microphone Speech Enhancement by Learning Cross-Channel Features with Multi-Head Attention
Viaarxiv icon

Are disentangled representations all you need to build speaker anonymization systems?

Add code
Bookmark button
Alert button
Aug 24, 2022
Pierre Champion, Denis Jouvet, Anthony Larcher

Figure 1 for Are disentangled representations all you need to build speaker anonymization systems?
Figure 2 for Are disentangled representations all you need to build speaker anonymization systems?
Viaarxiv icon

Selecting and combining complementary feature representations and classifiers for hate speech detection

Add code
Bookmark button
Alert button
Jan 18, 2022
Rafael M. O. Cruz, Woshington V. de Sousa, George D. C. Cavalcanti

Figure 1 for Selecting and combining complementary feature representations and classifiers for hate speech detection
Figure 2 for Selecting and combining complementary feature representations and classifiers for hate speech detection
Figure 3 for Selecting and combining complementary feature representations and classifiers for hate speech detection
Figure 4 for Selecting and combining complementary feature representations and classifiers for hate speech detection
Viaarxiv icon