Alert button

"speech": models, code, and papers
Alert button

Quantifying the Dialect Gap and its Correlates Across Languages

Oct 23, 2023
Anjali Kantharuban, Ivan Vulić, Anna Korhonen

Viaarxiv icon

Delayed Memory Unit: Modelling Temporal Dependency Through Delay Gate

Add code
Bookmark button
Alert button
Oct 23, 2023
Pengfei Sun, Jibin Wu, Malu Zhang, Paul Devos, Dick Botteldooren

Viaarxiv icon

InterroLang: Exploring NLP Models and Datasets through Dialogue-based Explanations

Add code
Bookmark button
Alert button
Oct 23, 2023
Nils Feldhus, Qianli Wang, Tatiana Anikina, Sahil Chopra, Cennet Oguz, Sebastian Möller

Figure 1 for InterroLang: Exploring NLP Models and Datasets through Dialogue-based Explanations
Figure 2 for InterroLang: Exploring NLP Models and Datasets through Dialogue-based Explanations
Figure 3 for InterroLang: Exploring NLP Models and Datasets through Dialogue-based Explanations
Figure 4 for InterroLang: Exploring NLP Models and Datasets through Dialogue-based Explanations
Viaarxiv icon

UniverSLU: Universal Spoken Language Understanding for Diverse Classification and Sequence Generation Tasks with a Single Network

Oct 04, 2023
Siddhant Arora, Hayato Futami, Jee-weon Jung, Yifan Peng, Roshan Sharma, Yosuke Kashiwagi, Emiru Tsunoo, Shinji Watanabe

Figure 1 for UniverSLU: Universal Spoken Language Understanding for Diverse Classification and Sequence Generation Tasks with a Single Network
Figure 2 for UniverSLU: Universal Spoken Language Understanding for Diverse Classification and Sequence Generation Tasks with a Single Network
Figure 3 for UniverSLU: Universal Spoken Language Understanding for Diverse Classification and Sequence Generation Tasks with a Single Network
Figure 4 for UniverSLU: Universal Spoken Language Understanding for Diverse Classification and Sequence Generation Tasks with a Single Network
Viaarxiv icon

Improving severity preservation of healthy-to-pathological voice conversion with global style tokens

Add code
Bookmark button
Alert button
Oct 04, 2023
Bence Mark Halpern, Wen-Chin Huang, Lester Phillip Violeta, R. J. J. H. van Son, Tomoki Toda

Figure 1 for Improving severity preservation of healthy-to-pathological voice conversion with global style tokens
Figure 2 for Improving severity preservation of healthy-to-pathological voice conversion with global style tokens
Figure 3 for Improving severity preservation of healthy-to-pathological voice conversion with global style tokens
Figure 4 for Improving severity preservation of healthy-to-pathological voice conversion with global style tokens
Viaarxiv icon

Classification of Dysarthria based on the Levels of Severity. A Systematic Review

Oct 11, 2023
Afnan Al-Ali, Somaya Al-Maadeed, Moutaz Saleh, Rani Chinnappa Naidu, Zachariah C Alex, Prakash Ramachandran, Rajeev Khoodeeram, Rajesh Kumar M

Viaarxiv icon

GASS: Generalizing Audio Source Separation with Large-scale Data

Add code
Bookmark button
Alert button
Sep 29, 2023
Jordi Pons, Xiaoyu Liu, Santiago Pascual, Joan Serrà

Figure 1 for GASS: Generalizing Audio Source Separation with Large-scale Data
Figure 2 for GASS: Generalizing Audio Source Separation with Large-scale Data
Figure 3 for GASS: Generalizing Audio Source Separation with Large-scale Data
Figure 4 for GASS: Generalizing Audio Source Separation with Large-scale Data
Viaarxiv icon

End-to-end Multichannel Speaker-Attributed ASR: Speaker Guided Decoder and Input Feature Analysis

Oct 16, 2023
Can Cui, Imran Ahamad Sheikh, Mostafa Sadeghi, Emmanuel Vincent

Viaarxiv icon

Alzheimer's Disease Detection from Spontaneous Speech and Text: A review

Jul 19, 2023
Vrindha M. K., Geethu V., Anurenjan P. R., Deepak S., Sreeni K. G.

Figure 1 for Alzheimer's Disease Detection from Spontaneous Speech and Text: A review
Figure 2 for Alzheimer's Disease Detection from Spontaneous Speech and Text: A review
Figure 3 for Alzheimer's Disease Detection from Spontaneous Speech and Text: A review
Viaarxiv icon

Audio-Visual Speech Enhancement Using Self-supervised Learning to Improve Speech Intelligibility in Cochlear Implant Simulations

Jul 15, 2023
Richard Lee Lai, Jen-Cheng Hou, Mandar Gogate, Kia Dashtipour, Amir Hussain, Yu Tsao

Figure 1 for Audio-Visual Speech Enhancement Using Self-supervised Learning to Improve Speech Intelligibility in Cochlear Implant Simulations
Figure 2 for Audio-Visual Speech Enhancement Using Self-supervised Learning to Improve Speech Intelligibility in Cochlear Implant Simulations
Figure 3 for Audio-Visual Speech Enhancement Using Self-supervised Learning to Improve Speech Intelligibility in Cochlear Implant Simulations
Figure 4 for Audio-Visual Speech Enhancement Using Self-supervised Learning to Improve Speech Intelligibility in Cochlear Implant Simulations
Viaarxiv icon