Alert button

"speech": models, code, and papers
Alert button

Visual Context-driven Audio Feature Enhancement for Robust End-to-End Audio-Visual Speech Recognition

Jul 13, 2022
Joanna Hong, Minsu Kim, Daehun Yoo, Yong Man Ro

Figure 1 for Visual Context-driven Audio Feature Enhancement for Robust End-to-End Audio-Visual Speech Recognition
Figure 2 for Visual Context-driven Audio Feature Enhancement for Robust End-to-End Audio-Visual Speech Recognition
Figure 3 for Visual Context-driven Audio Feature Enhancement for Robust End-to-End Audio-Visual Speech Recognition
Figure 4 for Visual Context-driven Audio Feature Enhancement for Robust End-to-End Audio-Visual Speech Recognition
Viaarxiv icon

Multilingual Simultaneous Speech Translation

Mar 29, 2022
Shashank Subramanya, Jan Niehues

Figure 1 for Multilingual Simultaneous Speech Translation
Figure 2 for Multilingual Simultaneous Speech Translation
Figure 3 for Multilingual Simultaneous Speech Translation
Figure 4 for Multilingual Simultaneous Speech Translation
Viaarxiv icon

Freeform Body Motion Generation from Speech

Add code
Bookmark button
Alert button
Mar 04, 2022
Jing Xu, Wei Zhang, Yalong Bai, Qibin Sun, Tao Mei

Figure 1 for Freeform Body Motion Generation from Speech
Figure 2 for Freeform Body Motion Generation from Speech
Figure 3 for Freeform Body Motion Generation from Speech
Figure 4 for Freeform Body Motion Generation from Speech
Viaarxiv icon

Hate Speech and Offensive Language Detection in Bengali

Add code
Bookmark button
Alert button
Oct 07, 2022
Mithun Das, Somnath Banerjee, Punyajoy Saha, Animesh Mukherjee

Figure 1 for Hate Speech and Offensive Language Detection in Bengali
Figure 2 for Hate Speech and Offensive Language Detection in Bengali
Figure 3 for Hate Speech and Offensive Language Detection in Bengali
Figure 4 for Hate Speech and Offensive Language Detection in Bengali
Viaarxiv icon

Incremental Speech Synthesis For Speech-To-Speech Translation

Add code
Bookmark button
Alert button
Oct 15, 2021
Danni Liu, Changhan Wang, Hongyu Gong, Xutai Ma, Yun Tang, Juan Pino

Figure 1 for Incremental Speech Synthesis For Speech-To-Speech Translation
Figure 2 for Incremental Speech Synthesis For Speech-To-Speech Translation
Figure 3 for Incremental Speech Synthesis For Speech-To-Speech Translation
Figure 4 for Incremental Speech Synthesis For Speech-To-Speech Translation
Viaarxiv icon

ClearBuds: Wireless Binaural Earbuds for Learning-Based Speech Enhancement

Jun 27, 2022
Ishan Chatterjee, Maruchi Kim, Vivek Jayaram, Shyamnath Gollakota, Ira Kemelmacher-Shlizerman, Shwetak Patel, Steven M. Seitz

Figure 1 for ClearBuds: Wireless Binaural Earbuds for Learning-Based Speech Enhancement
Figure 2 for ClearBuds: Wireless Binaural Earbuds for Learning-Based Speech Enhancement
Figure 3 for ClearBuds: Wireless Binaural Earbuds for Learning-Based Speech Enhancement
Figure 4 for ClearBuds: Wireless Binaural Earbuds for Learning-Based Speech Enhancement
Viaarxiv icon

Extending DNN-based Multiplicative Masking to Deep Subband Filtering for Improved Dereverberation

Add code
Bookmark button
Alert button
Mar 01, 2023
Jean-Marie Lemercier, Julian Tobergte, Timo Gerkmann

Figure 1 for Extending DNN-based Multiplicative Masking to Deep Subband Filtering for Improved Dereverberation
Figure 2 for Extending DNN-based Multiplicative Masking to Deep Subband Filtering for Improved Dereverberation
Figure 3 for Extending DNN-based Multiplicative Masking to Deep Subband Filtering for Improved Dereverberation
Figure 4 for Extending DNN-based Multiplicative Masking to Deep Subband Filtering for Improved Dereverberation
Viaarxiv icon

Controlling High-Dimensional Data With Sparse Input

Add code
Bookmark button
Alert button
Mar 14, 2023
Dan Andrei Iliescu, Devang Savita Ram Mohan, Tian Huey Teh, Zack Hodari

Figure 1 for Controlling High-Dimensional Data With Sparse Input
Figure 2 for Controlling High-Dimensional Data With Sparse Input
Figure 3 for Controlling High-Dimensional Data With Sparse Input
Figure 4 for Controlling High-Dimensional Data With Sparse Input
Viaarxiv icon

A Systematic Comparison of Phonetic Aware Techniques for Speech Enhancement

Add code
Bookmark button
Alert button
Jun 22, 2022
Or Tal, Moshe Mandel, Felix Kreuk, Yossi Adi

Figure 1 for A Systematic Comparison of Phonetic Aware Techniques for Speech Enhancement
Figure 2 for A Systematic Comparison of Phonetic Aware Techniques for Speech Enhancement
Figure 3 for A Systematic Comparison of Phonetic Aware Techniques for Speech Enhancement
Figure 4 for A Systematic Comparison of Phonetic Aware Techniques for Speech Enhancement
Viaarxiv icon

Chain-based Discriminative Autoencoders for Speech Recognition

Add code
Bookmark button
Alert button
Mar 28, 2022
Hung-Shin Lee, Pin-Tuan Huang, Yao-Fei Cheng, Hsin-Min Wang

Figure 1 for Chain-based Discriminative Autoencoders for Speech Recognition
Figure 2 for Chain-based Discriminative Autoencoders for Speech Recognition
Figure 3 for Chain-based Discriminative Autoencoders for Speech Recognition
Viaarxiv icon