Alert button

"speech": models, code, and papers
Alert button

Speaker Normalization for Self-supervised Speech Emotion Recognition

Feb 02, 2022
Itai Gat, Hagai Aronowitz, Weizhong Zhu, Edmilson Morais, Ron Hoory

Figure 1 for Speaker Normalization for Self-supervised Speech Emotion Recognition
Figure 2 for Speaker Normalization for Self-supervised Speech Emotion Recognition
Figure 3 for Speaker Normalization for Self-supervised Speech Emotion Recognition
Viaarxiv icon

Necessity and Sufficiency for Explaining Text Classifiers: A Case Study in Hate Speech Detection

Add code
Bookmark button
Alert button
May 06, 2022
Esma Balkir, Isar Nejadgholi, Kathleen C. Fraser, Svetlana Kiritchenko

Figure 1 for Necessity and Sufficiency for Explaining Text Classifiers: A Case Study in Hate Speech Detection
Figure 2 for Necessity and Sufficiency for Explaining Text Classifiers: A Case Study in Hate Speech Detection
Figure 3 for Necessity and Sufficiency for Explaining Text Classifiers: A Case Study in Hate Speech Detection
Figure 4 for Necessity and Sufficiency for Explaining Text Classifiers: A Case Study in Hate Speech Detection
Viaarxiv icon

TFCN: Temporal-Frequential Convolutional Network for Single-Channel Speech Enhancement

Jan 03, 2022
Xupeng Jia, Dongmei Li

Figure 1 for TFCN: Temporal-Frequential Convolutional Network for Single-Channel Speech Enhancement
Figure 2 for TFCN: Temporal-Frequential Convolutional Network for Single-Channel Speech Enhancement
Figure 3 for TFCN: Temporal-Frequential Convolutional Network for Single-Channel Speech Enhancement
Figure 4 for TFCN: Temporal-Frequential Convolutional Network for Single-Channel Speech Enhancement
Viaarxiv icon

Towards Intelligibility-Oriented Audio-Visual Speech Enhancement

Nov 18, 2021
Tassadaq Hussain, Mandar Gogate, Kia Dashtipour, Amir Hussain

Figure 1 for Towards Intelligibility-Oriented Audio-Visual Speech Enhancement
Figure 2 for Towards Intelligibility-Oriented Audio-Visual Speech Enhancement
Figure 3 for Towards Intelligibility-Oriented Audio-Visual Speech Enhancement
Figure 4 for Towards Intelligibility-Oriented Audio-Visual Speech Enhancement
Viaarxiv icon

Predicting Affective Vocal Bursts with Finetuned wav2vec 2.0

Add code
Bookmark button
Alert button
Sep 27, 2022
Bagus Tris Atmaja, Akira Sasou

Figure 1 for Predicting Affective Vocal Bursts with Finetuned wav2vec 2.0
Figure 2 for Predicting Affective Vocal Bursts with Finetuned wav2vec 2.0
Figure 3 for Predicting Affective Vocal Bursts with Finetuned wav2vec 2.0
Figure 4 for Predicting Affective Vocal Bursts with Finetuned wav2vec 2.0
Viaarxiv icon

Large-Scale Hate Speech Detection with Cross-Domain Transfer

Add code
Bookmark button
Alert button
Mar 02, 2022
Cagri Toraman, Furkan Şahinuç, Eyup Halit Yılmaz

Figure 1 for Large-Scale Hate Speech Detection with Cross-Domain Transfer
Figure 2 for Large-Scale Hate Speech Detection with Cross-Domain Transfer
Figure 3 for Large-Scale Hate Speech Detection with Cross-Domain Transfer
Figure 4 for Large-Scale Hate Speech Detection with Cross-Domain Transfer
Viaarxiv icon

Code Switched and Code Mixed Speech Recognition for Indic languages

Add code
Bookmark button
Alert button
Mar 30, 2022
Harveen Singh Chadha, Priyanshi Shah, Ankur Dhuriya, Neeraj Chhimwal, Anirudh Gupta, Vivek Raghavan

Figure 1 for Code Switched and Code Mixed Speech Recognition for Indic languages
Figure 2 for Code Switched and Code Mixed Speech Recognition for Indic languages
Figure 3 for Code Switched and Code Mixed Speech Recognition for Indic languages
Figure 4 for Code Switched and Code Mixed Speech Recognition for Indic languages
Viaarxiv icon

SHAS: Approaching optimal Segmentation for End-to-End Speech Translation

Add code
Bookmark button
Alert button
Feb 09, 2022
Ioannis Tsiamas, Gerard I. Gállego, José A. R. Fonollosa, Marta R. Costa-jussà

Figure 1 for SHAS: Approaching optimal Segmentation for End-to-End Speech Translation
Figure 2 for SHAS: Approaching optimal Segmentation for End-to-End Speech Translation
Figure 3 for SHAS: Approaching optimal Segmentation for End-to-End Speech Translation
Figure 4 for SHAS: Approaching optimal Segmentation for End-to-End Speech Translation
Viaarxiv icon

Probing Statistical Representations For End-To-End ASR

Nov 03, 2022
Anna Ollerenshaw, Md Asif Jalal, Thomas Hain

Figure 1 for Probing Statistical Representations For End-To-End ASR
Figure 2 for Probing Statistical Representations For End-To-End ASR
Figure 3 for Probing Statistical Representations For End-To-End ASR
Figure 4 for Probing Statistical Representations For End-To-End ASR
Viaarxiv icon

Conversational Speech Separation: an Evaluation Study for Streaming Applications

May 31, 2022
Giovanni Morrone, Samuele Cornell, Enrico Zovato, Alessio Brutti, Stefano Squartini

Figure 1 for Conversational Speech Separation: an Evaluation Study for Streaming Applications
Figure 2 for Conversational Speech Separation: an Evaluation Study for Streaming Applications
Figure 3 for Conversational Speech Separation: an Evaluation Study for Streaming Applications
Figure 4 for Conversational Speech Separation: an Evaluation Study for Streaming Applications
Viaarxiv icon