Alert button

"speech": models, code, and papers
Alert button

Lisan: Yemenu, Irqi, Libyan, and Sudanese Arabic Dialect Copora with Morphological Annotations

Dec 13, 2022
Mustafa Jarrar, Fadi A Zaraket, Tymaa Hammouda, Daanish Masood Alavi, Martin Waahlisch

Figure 1 for Lisan: Yemenu, Irqi, Libyan, and Sudanese Arabic Dialect Copora with Morphological Annotations
Figure 2 for Lisan: Yemenu, Irqi, Libyan, and Sudanese Arabic Dialect Copora with Morphological Annotations
Figure 3 for Lisan: Yemenu, Irqi, Libyan, and Sudanese Arabic Dialect Copora with Morphological Annotations
Figure 4 for Lisan: Yemenu, Irqi, Libyan, and Sudanese Arabic Dialect Copora with Morphological Annotations
Viaarxiv icon

FMFCC-A: A Challenging Mandarin Dataset for Synthetic Speech Detection

Add code
Bookmark button
Alert button
Oct 18, 2021
Zhenyu Zhang, Yewei Gu, Xiaowei Yi, Xianfeng Zhao

Figure 1 for FMFCC-A: A Challenging Mandarin Dataset for Synthetic Speech Detection
Figure 2 for FMFCC-A: A Challenging Mandarin Dataset for Synthetic Speech Detection
Figure 3 for FMFCC-A: A Challenging Mandarin Dataset for Synthetic Speech Detection
Figure 4 for FMFCC-A: A Challenging Mandarin Dataset for Synthetic Speech Detection
Viaarxiv icon

MAST: Multiscale Audio Spectrogram Transformers

Add code
Bookmark button
Alert button
Nov 02, 2022
Sreyan Ghosh, Ashish Seth, S. Umesh, Dinesh Manocha

Figure 1 for MAST: Multiscale Audio Spectrogram Transformers
Figure 2 for MAST: Multiscale Audio Spectrogram Transformers
Figure 3 for MAST: Multiscale Audio Spectrogram Transformers
Viaarxiv icon

Controllable Multichannel Speech Dereverberation based on Deep Neural Networks

Add code
Bookmark button
Alert button
Oct 16, 2021
Ziteng Wang, Yueyue Na, Biao Tian, Qiang Fu

Figure 1 for Controllable Multichannel Speech Dereverberation based on Deep Neural Networks
Figure 2 for Controllable Multichannel Speech Dereverberation based on Deep Neural Networks
Figure 3 for Controllable Multichannel Speech Dereverberation based on Deep Neural Networks
Figure 4 for Controllable Multichannel Speech Dereverberation based on Deep Neural Networks
Viaarxiv icon

Annealing Double-Head: An Architecture for Online Calibration of Deep Neural Networks

Add code
Bookmark button
Alert button
Dec 27, 2022
Erdong Guo, David Draper, Maria De Iorio

Figure 1 for Annealing Double-Head: An Architecture for Online Calibration of Deep Neural Networks
Figure 2 for Annealing Double-Head: An Architecture for Online Calibration of Deep Neural Networks
Figure 3 for Annealing Double-Head: An Architecture for Online Calibration of Deep Neural Networks
Figure 4 for Annealing Double-Head: An Architecture for Online Calibration of Deep Neural Networks
Viaarxiv icon

Towards a Perceptual Model for Estimating the Quality of Visual Speech

Mar 24, 2022
Zakaria Aldeneh, Masha Fedzechkina, Skyler Seto, Katherine Metcalf, Miguel Sarabia, Nicholas Apostoloff, Barry-John Theobald

Figure 1 for Towards a Perceptual Model for Estimating the Quality of Visual Speech
Figure 2 for Towards a Perceptual Model for Estimating the Quality of Visual Speech
Figure 3 for Towards a Perceptual Model for Estimating the Quality of Visual Speech
Figure 4 for Towards a Perceptual Model for Estimating the Quality of Visual Speech
Viaarxiv icon

Interpretable Acoustic Representation Learning on Breathing and Speech Signals for COVID-19 Detection

Add code
Bookmark button
Alert button
Jun 27, 2022
Debottam Dutta, Debarpan Bhattacharya, Sriram Ganapathy, Amir H. Poorjam, Deepak Mittal, Maneesh Singh

Figure 1 for Interpretable Acoustic Representation Learning on Breathing and Speech Signals for COVID-19 Detection
Figure 2 for Interpretable Acoustic Representation Learning on Breathing and Speech Signals for COVID-19 Detection
Figure 3 for Interpretable Acoustic Representation Learning on Breathing and Speech Signals for COVID-19 Detection
Figure 4 for Interpretable Acoustic Representation Learning on Breathing and Speech Signals for COVID-19 Detection
Viaarxiv icon

Counter Hate Speech in Social Media: A Survey

Feb 21, 2022
Dana Alsagheer, Hadi Mansourifar, Weidong Shi

Viaarxiv icon

Multi-Dialect Arabic Speech Recognition

Dec 25, 2021
Abbas Raza Ali

Figure 1 for Multi-Dialect Arabic Speech Recognition
Figure 2 for Multi-Dialect Arabic Speech Recognition
Figure 3 for Multi-Dialect Arabic Speech Recognition
Figure 4 for Multi-Dialect Arabic Speech Recognition
Viaarxiv icon

Fast-Slow Transformer for Visually Grounding Speech

Add code
Bookmark button
Alert button
Sep 16, 2021
Puyuan Peng, David Harwath

Figure 1 for Fast-Slow Transformer for Visually Grounding Speech
Figure 2 for Fast-Slow Transformer for Visually Grounding Speech
Figure 3 for Fast-Slow Transformer for Visually Grounding Speech
Figure 4 for Fast-Slow Transformer for Visually Grounding Speech
Viaarxiv icon