Alert button

"speech": models, code, and papers
Alert button

Investigating self-supervised learning for speech enhancement and separation

Mar 15, 2022
Zili Huang, Shinji Watanabe, Shu-wen Yang, Paola Garcia, Sanjeev Khudanpur

Figure 1 for Investigating self-supervised learning for speech enhancement and separation
Figure 2 for Investigating self-supervised learning for speech enhancement and separation
Figure 3 for Investigating self-supervised learning for speech enhancement and separation
Figure 4 for Investigating self-supervised learning for speech enhancement and separation
Viaarxiv icon

Audio-visual multi-channel speech separation, dereverberation and recognition

Apr 08, 2022
Guinan Li, Jianwei Yu, Jiajun Deng, Xunying Liu, Helen Meng

Figure 1 for Audio-visual multi-channel speech separation, dereverberation and recognition
Figure 2 for Audio-visual multi-channel speech separation, dereverberation and recognition
Figure 3 for Audio-visual multi-channel speech separation, dereverberation and recognition
Figure 4 for Audio-visual multi-channel speech separation, dereverberation and recognition
Viaarxiv icon

A Composite T60 Regression and Classification Approach for Speech Dereverberation

Feb 09, 2023
Yuying Li, Yuchen Liu, Donald S. Williamson

Figure 1 for A Composite T60 Regression and Classification Approach for Speech Dereverberation
Figure 2 for A Composite T60 Regression and Classification Approach for Speech Dereverberation
Figure 3 for A Composite T60 Regression and Classification Approach for Speech Dereverberation
Figure 4 for A Composite T60 Regression and Classification Approach for Speech Dereverberation
Viaarxiv icon

TRESTLE: Toolkit for Reproducible Execution of Speech, Text and Language Experiments

Add code
Bookmark button
Alert button
Feb 14, 2023
Changye Li, Trevor Cohen, Martin Michalowski, Serguei Pakhomov

Figure 1 for TRESTLE: Toolkit for Reproducible Execution of Speech, Text and Language Experiments
Figure 2 for TRESTLE: Toolkit for Reproducible Execution of Speech, Text and Language Experiments
Figure 3 for TRESTLE: Toolkit for Reproducible Execution of Speech, Text and Language Experiments
Figure 4 for TRESTLE: Toolkit for Reproducible Execution of Speech, Text and Language Experiments
Viaarxiv icon

FedNST: Federated Noisy Student Training for Automatic Speech Recognition

Jun 06, 2022
Haaris Mehmood, Agnieszka Dobrowolska, Karthikeyan Saravanan, Mete Ozay

Figure 1 for FedNST: Federated Noisy Student Training for Automatic Speech Recognition
Figure 2 for FedNST: Federated Noisy Student Training for Automatic Speech Recognition
Figure 3 for FedNST: Federated Noisy Student Training for Automatic Speech Recognition
Figure 4 for FedNST: Federated Noisy Student Training for Automatic Speech Recognition
Viaarxiv icon

DP-Parse: Finding Word Boundaries from Raw Speech with an Instance Lexicon

Add code
Bookmark button
Alert button
Jun 22, 2022
Robin Algayres, Tristan Ricoul, Julien Karadayi, Hugo Laurençon, Salah Zaiem, Abdelrahman Mohamed, Benoît Sagot, Emmanuel Dupoux

Figure 1 for DP-Parse: Finding Word Boundaries from Raw Speech with an Instance Lexicon
Figure 2 for DP-Parse: Finding Word Boundaries from Raw Speech with an Instance Lexicon
Figure 3 for DP-Parse: Finding Word Boundaries from Raw Speech with an Instance Lexicon
Figure 4 for DP-Parse: Finding Word Boundaries from Raw Speech with an Instance Lexicon
Viaarxiv icon

The Norwegian Parliamentary Speech Corpus

Add code
Bookmark button
Alert button
Jan 26, 2022
Per Erik Solberg, Pablo Ortiz

Figure 1 for The Norwegian Parliamentary Speech Corpus
Figure 2 for The Norwegian Parliamentary Speech Corpus
Figure 3 for The Norwegian Parliamentary Speech Corpus
Viaarxiv icon

Privacy-Preserving Speech Representation Learning using Vector Quantization

Mar 15, 2022
Pierre Champion, Denis Jouvet, Anthony Larcher

Figure 1 for Privacy-Preserving Speech Representation Learning using Vector Quantization
Figure 2 for Privacy-Preserving Speech Representation Learning using Vector Quantization
Figure 3 for Privacy-Preserving Speech Representation Learning using Vector Quantization
Figure 4 for Privacy-Preserving Speech Representation Learning using Vector Quantization
Viaarxiv icon

Variable-rate hierarchical CPC leads to acoustic unit discovery in speech

Add code
Bookmark button
Alert button
Jun 07, 2022
Santiago Cuervo, Adrian Łańcucki, Ricard Marxer, Paweł Rychlikowski, Jan Chorowski

Figure 1 for Variable-rate hierarchical CPC leads to acoustic unit discovery in speech
Figure 2 for Variable-rate hierarchical CPC leads to acoustic unit discovery in speech
Figure 3 for Variable-rate hierarchical CPC leads to acoustic unit discovery in speech
Figure 4 for Variable-rate hierarchical CPC leads to acoustic unit discovery in speech
Viaarxiv icon

Efficiency 360: Efficient Vision Transformers

Add code
Bookmark button
Alert button
Feb 23, 2023
Badri N. Patro, Vijay Srinivas Agneeswaran

Figure 1 for Efficiency 360: Efficient Vision Transformers
Figure 2 for Efficiency 360: Efficient Vision Transformers
Figure 3 for Efficiency 360: Efficient Vision Transformers
Figure 4 for Efficiency 360: Efficient Vision Transformers
Viaarxiv icon