Alert button

"speech": models, code, and papers
Alert button

Speech-language Pre-training for End-to-end Spoken Language Understanding

Feb 11, 2021
Yao Qian, Ximo Bian, Yu Shi, Naoyuki Kanda, Leo Shen, Zhen Xiao, Michael Zeng

Figure 1 for Speech-language Pre-training for End-to-end Spoken Language Understanding
Figure 2 for Speech-language Pre-training for End-to-end Spoken Language Understanding
Figure 3 for Speech-language Pre-training for End-to-end Spoken Language Understanding
Figure 4 for Speech-language Pre-training for End-to-end Spoken Language Understanding
Viaarxiv icon

Diff-TTS: A Denoising Diffusion Model for Text-to-Speech

Add code
Bookmark button
Alert button
Apr 03, 2021
Myeonghun Jeong, Hyeongju Kim, Sung Jun Cheon, Byoung Jin Choi, Nam Soo Kim

Figure 1 for Diff-TTS: A Denoising Diffusion Model for Text-to-Speech
Figure 2 for Diff-TTS: A Denoising Diffusion Model for Text-to-Speech
Figure 3 for Diff-TTS: A Denoising Diffusion Model for Text-to-Speech
Figure 4 for Diff-TTS: A Denoising Diffusion Model for Text-to-Speech
Viaarxiv icon

Bayesian Recurrent Units and the Forward-Backward Algorithm

Add code
Bookmark button
Alert button
Jul 21, 2022
Alexandre Bittar, Philip N. Garner

Figure 1 for Bayesian Recurrent Units and the Forward-Backward Algorithm
Figure 2 for Bayesian Recurrent Units and the Forward-Backward Algorithm
Figure 3 for Bayesian Recurrent Units and the Forward-Backward Algorithm
Viaarxiv icon

Direct Noisy Speech Modeling for Noisy-to-Noisy Voice Conversion

Add code
Bookmark button
Alert button
Nov 13, 2021
Chao Xie, Yi-Chiao Wu, Patrick Lumban Tobing, Wen-Chin Huang, Tomoki Toda

Figure 1 for Direct Noisy Speech Modeling for Noisy-to-Noisy Voice Conversion
Figure 2 for Direct Noisy Speech Modeling for Noisy-to-Noisy Voice Conversion
Figure 3 for Direct Noisy Speech Modeling for Noisy-to-Noisy Voice Conversion
Figure 4 for Direct Noisy Speech Modeling for Noisy-to-Noisy Voice Conversion
Viaarxiv icon

Direct multimodal few-shot learning of speech and images

Add code
Bookmark button
Alert button
Dec 10, 2020
Leanne Nortje, Herman Kamper

Figure 1 for Direct multimodal few-shot learning of speech and images
Figure 2 for Direct multimodal few-shot learning of speech and images
Figure 3 for Direct multimodal few-shot learning of speech and images
Figure 4 for Direct multimodal few-shot learning of speech and images
Viaarxiv icon

Language model fusion for streaming end to end speech recognition

Apr 09, 2021
Rodrigo Cabrera, Xiaofeng Liu, Mohammadreza Ghodsi, Zebulun Matteson, Eugene Weinstein, Anjuli Kannan

Figure 1 for Language model fusion for streaming end to end speech recognition
Figure 2 for Language model fusion for streaming end to end speech recognition
Figure 3 for Language model fusion for streaming end to end speech recognition
Figure 4 for Language model fusion for streaming end to end speech recognition
Viaarxiv icon

INTERSPEECH 2021 ConferencingSpeech Challenge: Towards Far-field Multi-Channel Speech Enhancement for Video Conferencing

Add code
Bookmark button
Alert button
Apr 02, 2021
Wei Rao, Yihui Fu, Yanxin Hu, Xin Xu, Yvkai Jv, Jiangyu Han, Zhongjie Jiang, Lei Xie, Yannan Wang, Shinji Watanabe, Zheng-Hua Tan, Hui Bu, Tao Yu, Shidong Shang

Figure 1 for INTERSPEECH 2021 ConferencingSpeech Challenge: Towards Far-field Multi-Channel Speech Enhancement for Video Conferencing
Figure 2 for INTERSPEECH 2021 ConferencingSpeech Challenge: Towards Far-field Multi-Channel Speech Enhancement for Video Conferencing
Figure 3 for INTERSPEECH 2021 ConferencingSpeech Challenge: Towards Far-field Multi-Channel Speech Enhancement for Video Conferencing
Viaarxiv icon

Thai Wav2Vec2.0 with CommonVoice V8

Add code
Bookmark button
Alert button
Aug 09, 2022
Wannaphong Phatthiyaphaibun, Chompakorn Chaksangchaichot, Peerat Limkonchotiwat, Ekapol Chuangsuwanich, Sarana Nutanong

Figure 1 for Thai Wav2Vec2.0 with CommonVoice V8
Figure 2 for Thai Wav2Vec2.0 with CommonVoice V8
Viaarxiv icon

Implicit Channel Learning for Machine Learning Applications in 6G Wireless Networks

Jun 24, 2022
Ahmet M. Elbir, Wei Shi, Kumar Vijay Mishra, Anastasios K. Papazafeiropoulos, Symeon Chatzinotas

Figure 1 for Implicit Channel Learning for Machine Learning Applications in 6G Wireless Networks
Figure 2 for Implicit Channel Learning for Machine Learning Applications in 6G Wireless Networks
Figure 3 for Implicit Channel Learning for Machine Learning Applications in 6G Wireless Networks
Figure 4 for Implicit Channel Learning for Machine Learning Applications in 6G Wireless Networks
Viaarxiv icon

Learning to Count Words in Fluent Speech enables Online Speech Recognition

Add code
Bookmark button
Alert button
Jun 11, 2020
George Sterpu, Christian Saam, Naomi Harte

Figure 1 for Learning to Count Words in Fluent Speech enables Online Speech Recognition
Figure 2 for Learning to Count Words in Fluent Speech enables Online Speech Recognition
Figure 3 for Learning to Count Words in Fluent Speech enables Online Speech Recognition
Figure 4 for Learning to Count Words in Fluent Speech enables Online Speech Recognition
Viaarxiv icon