Alert button

"speech": models, code, and papers
Alert button

Masked Autoencoders that Listen

Add code
Bookmark button
Alert button
Jul 13, 2022
Po-Yao, Huang, Hu Xu, Juncheng Li, Alexei Baevski, Michael Auli, Wojciech Galuba, Florian Metze, Christoph Feichtenhofer

Figure 1 for Masked Autoencoders that Listen
Figure 2 for Masked Autoencoders that Listen
Figure 3 for Masked Autoencoders that Listen
Figure 4 for Masked Autoencoders that Listen
Viaarxiv icon

SLNSpeech: solving extended speech separation problem by the help of sign language

Jul 21, 2020
Jiasong Wu, Taotao Li, Youyong Kong, Guanyu Yang, Lotfi Senhadji, Huazhong Shu

Figure 1 for SLNSpeech: solving extended speech separation problem by the help of sign language
Figure 2 for SLNSpeech: solving extended speech separation problem by the help of sign language
Figure 3 for SLNSpeech: solving extended speech separation problem by the help of sign language
Figure 4 for SLNSpeech: solving extended speech separation problem by the help of sign language
Viaarxiv icon

Vietnamese Capitalization and Punctuation Recovery Models

Add code
Bookmark button
Alert button
Jul 04, 2022
Hoang Thi Thu Uyen, Nguyen Anh Tu, Ta Duc Huy

Figure 1 for Vietnamese Capitalization and Punctuation Recovery Models
Figure 2 for Vietnamese Capitalization and Punctuation Recovery Models
Figure 3 for Vietnamese Capitalization and Punctuation Recovery Models
Viaarxiv icon

Speaker-Independent Speech-Driven Visual Speech Synthesis using Domain-Adapted Acoustic Models

May 15, 2019
Ahmed Hussen Abdelaziz, Barry-John Theobald, Justin Binder, Gabriele Fanelli, Paul Dixon, Nicholas Apostoloff, Thibaut Weise, Sachin Kajareker

Figure 1 for Speaker-Independent Speech-Driven Visual Speech Synthesis using Domain-Adapted Acoustic Models
Figure 2 for Speaker-Independent Speech-Driven Visual Speech Synthesis using Domain-Adapted Acoustic Models
Figure 3 for Speaker-Independent Speech-Driven Visual Speech Synthesis using Domain-Adapted Acoustic Models
Viaarxiv icon

Calibrate and Refine! A Novel and Agile Framework for ASR-error Robust Intent Detection

Add code
Bookmark button
Alert button
May 23, 2022
Peilin Zhou, Dading Chong, Helin Wang, Qingcheng Zeng

Figure 1 for Calibrate and Refine! A Novel and Agile Framework for ASR-error Robust Intent Detection
Figure 2 for Calibrate and Refine! A Novel and Agile Framework for ASR-error Robust Intent Detection
Figure 3 for Calibrate and Refine! A Novel and Agile Framework for ASR-error Robust Intent Detection
Figure 4 for Calibrate and Refine! A Novel and Agile Framework for ASR-error Robust Intent Detection
Viaarxiv icon

GraphPB: Graphical Representations of Prosody Boundary in Speech Synthesis

Dec 03, 2020
Aolan Sun, Jianzong Wang, Ning Cheng, Huayi Peng, Zhen Zeng, Lingwei Kong, Jing Xiao

Figure 1 for GraphPB: Graphical Representations of Prosody Boundary in Speech Synthesis
Figure 2 for GraphPB: Graphical Representations of Prosody Boundary in Speech Synthesis
Figure 3 for GraphPB: Graphical Representations of Prosody Boundary in Speech Synthesis
Figure 4 for GraphPB: Graphical Representations of Prosody Boundary in Speech Synthesis
Viaarxiv icon

Robust Speaker Recognition with Transformers Using wav2vec 2.0

Add code
Bookmark button
Alert button
Mar 28, 2022
Sergey Novoselov, Galina Lavrentyeva, Anastasia Avdeeva, Vladimir Volokhov, Aleksei Gusev

Figure 1 for Robust Speaker Recognition with Transformers Using wav2vec 2.0
Figure 2 for Robust Speaker Recognition with Transformers Using wav2vec 2.0
Figure 3 for Robust Speaker Recognition with Transformers Using wav2vec 2.0
Figure 4 for Robust Speaker Recognition with Transformers Using wav2vec 2.0
Viaarxiv icon

TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices

Aug 16, 2020
Alexander Wong, Mahmoud Famouri, Maya Pavlova, Siddharth Surana

Figure 1 for TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices
Figure 2 for TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices
Figure 3 for TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices
Viaarxiv icon

Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem

Add code
Bookmark button
Alert button
Dec 17, 2021
Jing Shi, Xuankai Chang, Tomoki Hayashi, Yen-Ju Lu, Shinji Watanabe, Bo Xu

Figure 1 for Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem
Figure 2 for Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem
Figure 3 for Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem
Figure 4 for Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem
Viaarxiv icon

Learning Robust and Multilingual Speech Representations

Add code
Bookmark button
Alert button
Jan 29, 2020
Kazuya Kawakami, Luyu Wang, Chris Dyer, Phil Blunsom, Aaron van den Oord

Figure 1 for Learning Robust and Multilingual Speech Representations
Figure 2 for Learning Robust and Multilingual Speech Representations
Figure 3 for Learning Robust and Multilingual Speech Representations
Figure 4 for Learning Robust and Multilingual Speech Representations
Viaarxiv icon