Alert button

"speech": models, code, and papers
Alert button

Uncovering Political Hate Speech During Indian Election Campaign: A New Low-Resource Dataset and Baselines

Add code
Bookmark button
Alert button
Jun 27, 2023
Farhan Ahmad Jafri, Mohammad Aman Siddiqui, Surendrabikram Thapa, Kritesh Rauniyar, Usman Naseem, Imran Razzak

Figure 1 for Uncovering Political Hate Speech During Indian Election Campaign: A New Low-Resource Dataset and Baselines
Figure 2 for Uncovering Political Hate Speech During Indian Election Campaign: A New Low-Resource Dataset and Baselines
Figure 3 for Uncovering Political Hate Speech During Indian Election Campaign: A New Low-Resource Dataset and Baselines
Figure 4 for Uncovering Political Hate Speech During Indian Election Campaign: A New Low-Resource Dataset and Baselines
Viaarxiv icon

Fake the Real: Backdoor Attack on Deep Speech Classification via Voice Conversion

Jun 28, 2023
Zhe Ye, Terui Mao, Li Dong, Diqun Yan

Figure 1 for Fake the Real: Backdoor Attack on Deep Speech Classification via Voice Conversion
Figure 2 for Fake the Real: Backdoor Attack on Deep Speech Classification via Voice Conversion
Figure 3 for Fake the Real: Backdoor Attack on Deep Speech Classification via Voice Conversion
Figure 4 for Fake the Real: Backdoor Attack on Deep Speech Classification via Voice Conversion
Viaarxiv icon

DSE-TTS: Dual Speaker Embedding for Cross-Lingual Text-to-Speech

Add code
Bookmark button
Alert button
Jun 25, 2023
Sen Liu, Yiwei Guo, Chenpeng Du, Xie Chen, Kai Yu

Figure 1 for DSE-TTS: Dual Speaker Embedding for Cross-Lingual Text-to-Speech
Figure 2 for DSE-TTS: Dual Speaker Embedding for Cross-Lingual Text-to-Speech
Figure 3 for DSE-TTS: Dual Speaker Embedding for Cross-Lingual Text-to-Speech
Figure 4 for DSE-TTS: Dual Speaker Embedding for Cross-Lingual Text-to-Speech
Viaarxiv icon

Adapting Text-based Dialogue State Tracker for Spoken Dialogues

Add code
Bookmark button
Alert button
Aug 30, 2023
Jaeseok Yoon, Seunghyun Hwang, Ran Han, Jeonguk Bang, Kee-Eung Kim

Figure 1 for Adapting Text-based Dialogue State Tracker for Spoken Dialogues
Figure 2 for Adapting Text-based Dialogue State Tracker for Spoken Dialogues
Figure 3 for Adapting Text-based Dialogue State Tracker for Spoken Dialogues
Figure 4 for Adapting Text-based Dialogue State Tracker for Spoken Dialogues
Viaarxiv icon

Employing Real Training Data for Deep Noise Suppression

Sep 05, 2023
Ziyi Xu, Marvin Sach, Jan Pirklbauer, Tim Fingscheidt

Figure 1 for Employing Real Training Data for Deep Noise Suppression
Figure 2 for Employing Real Training Data for Deep Noise Suppression
Figure 3 for Employing Real Training Data for Deep Noise Suppression
Viaarxiv icon

The Art of Embedding Fusion: Optimizing Hate Speech Detection

Add code
Bookmark button
Alert button
Jun 26, 2023
Mohammad Aflah Khan, Neemesh Yadav, Mohit Jain, Sanyam Goyal

Figure 1 for The Art of Embedding Fusion: Optimizing Hate Speech Detection
Figure 2 for The Art of Embedding Fusion: Optimizing Hate Speech Detection
Figure 3 for The Art of Embedding Fusion: Optimizing Hate Speech Detection
Figure 4 for The Art of Embedding Fusion: Optimizing Hate Speech Detection
Viaarxiv icon

Simultaneously Learning Speaker's Direction and Head Orientation from Binaural Recordings

Sep 26, 2023
Harshvardhan Takawale, Nirupam Roy

Figure 1 for Simultaneously Learning Speaker's Direction and Head Orientation from Binaural Recordings
Figure 2 for Simultaneously Learning Speaker's Direction and Head Orientation from Binaural Recordings
Figure 3 for Simultaneously Learning Speaker's Direction and Head Orientation from Binaural Recordings
Figure 4 for Simultaneously Learning Speaker's Direction and Head Orientation from Binaural Recordings
Viaarxiv icon

A multi-modal approach for identifying schizophrenia using cross-modal attention

Sep 26, 2023
Gowtham Premananth, Yashish M. Siriwardena, Philip Resnik, Carol Espy-Wilson

Viaarxiv icon

Big model only for hard audios: Sample dependent Whisper model selection for efficient inferences

Add code
Bookmark button
Alert button
Sep 22, 2023
Hugo Malard, Salah Zaiem, Robin Algayres

Figure 1 for Big model only for hard audios: Sample dependent Whisper model selection for efficient inferences
Figure 2 for Big model only for hard audios: Sample dependent Whisper model selection for efficient inferences
Figure 3 for Big model only for hard audios: Sample dependent Whisper model selection for efficient inferences
Figure 4 for Big model only for hard audios: Sample dependent Whisper model selection for efficient inferences
Viaarxiv icon

Masked Modeling Duo for Speech: Specializing General-Purpose Audio Representation to Speech using Denoising Distillation

Add code
Bookmark button
Alert button
May 23, 2023
Daisuke Niizumi, Daiki Takeuchi, Yasunori Ohishi, Noboru Harada, Kunio Kashino

Figure 1 for Masked Modeling Duo for Speech: Specializing General-Purpose Audio Representation to Speech using Denoising Distillation
Figure 2 for Masked Modeling Duo for Speech: Specializing General-Purpose Audio Representation to Speech using Denoising Distillation
Figure 3 for Masked Modeling Duo for Speech: Specializing General-Purpose Audio Representation to Speech using Denoising Distillation
Figure 4 for Masked Modeling Duo for Speech: Specializing General-Purpose Audio Representation to Speech using Denoising Distillation
Viaarxiv icon