Alert button

"speech": models, code, and papers
Alert button

Automatic Translation of Hate Speech to Non-hate Speech in Social Media Texts

Jun 02, 2023
Yevhen Kostiuk, Atnafu Lambebo Tonja, Grigori Sidorov, Olga Kolesnikova

Figure 1 for Automatic Translation of Hate Speech to Non-hate Speech in Social Media Texts
Figure 2 for Automatic Translation of Hate Speech to Non-hate Speech in Social Media Texts
Figure 3 for Automatic Translation of Hate Speech to Non-hate Speech in Social Media Texts
Figure 4 for Automatic Translation of Hate Speech to Non-hate Speech in Social Media Texts
Viaarxiv icon

Multichannel Voice Trigger Detection Based on Transform-average-concatenate

Sep 27, 2023
Takuya Higuchi, Avamarie Brueggeman, Masood Delfarah, Stephen Shum

Viaarxiv icon

KIT's Multilingual Speech Translation System for IWSLT 2023

Jun 15, 2023
Danni Liu, Thai Binh Nguyen, Sai Koneru, Enes Yavuz Ugan, Ngoc-Quan Pham, Tuan-Nam Nguyen, Tu Anh Dinh, Carlos Mullov, Alexander Waibel, Jan Niehues

Figure 1 for KIT's Multilingual Speech Translation System for IWSLT 2023
Figure 2 for KIT's Multilingual Speech Translation System for IWSLT 2023
Figure 3 for KIT's Multilingual Speech Translation System for IWSLT 2023
Figure 4 for KIT's Multilingual Speech Translation System for IWSLT 2023
Viaarxiv icon

AV-TranSpeech: Audio-Visual Robust Speech-to-Speech Translation

May 24, 2023
Rongjie Huang, Huadai Liu, Xize Cheng, Yi Ren, Linjun Li, Zhenhui Ye, Jinzheng He, Lichao Zhang, Jinglin Liu, Xiang Yin, Zhou Zhao

Figure 1 for AV-TranSpeech: Audio-Visual Robust Speech-to-Speech Translation
Figure 2 for AV-TranSpeech: Audio-Visual Robust Speech-to-Speech Translation
Figure 3 for AV-TranSpeech: Audio-Visual Robust Speech-to-Speech Translation
Figure 4 for AV-TranSpeech: Audio-Visual Robust Speech-to-Speech Translation
Viaarxiv icon

Multilingual context-based pronunciation learning for Text-to-Speech

Jul 31, 2023
Giulia Comini, Manuel Sam Ribeiro, Fan Yang, Heereen Shim, Jaime Lorenzo-Trueba

Figure 1 for Multilingual context-based pronunciation learning for Text-to-Speech
Figure 2 for Multilingual context-based pronunciation learning for Text-to-Speech
Figure 3 for Multilingual context-based pronunciation learning for Text-to-Speech
Figure 4 for Multilingual context-based pronunciation learning for Text-to-Speech
Viaarxiv icon

Empirical Interpretation of the Relationship Between Speech Acoustic Context and Emotion Recognition

Jun 30, 2023
Anna Ollerenshaw, Md Asif Jalal, Rosanna Milner, Thomas Hain

Figure 1 for Empirical Interpretation of the Relationship Between Speech Acoustic Context and Emotion Recognition
Figure 2 for Empirical Interpretation of the Relationship Between Speech Acoustic Context and Emotion Recognition
Figure 3 for Empirical Interpretation of the Relationship Between Speech Acoustic Context and Emotion Recognition
Figure 4 for Empirical Interpretation of the Relationship Between Speech Acoustic Context and Emotion Recognition
Viaarxiv icon

Alternative Pseudo-Labeling for Semi-Supervised Automatic Speech Recognition

Aug 12, 2023
Han Zhu, Dongji Gao, Gaofeng Cheng, Daniel Povey, Pengyuan Zhang, Yonghong Yan

Figure 1 for Alternative Pseudo-Labeling for Semi-Supervised Automatic Speech Recognition
Figure 2 for Alternative Pseudo-Labeling for Semi-Supervised Automatic Speech Recognition
Figure 3 for Alternative Pseudo-Labeling for Semi-Supervised Automatic Speech Recognition
Figure 4 for Alternative Pseudo-Labeling for Semi-Supervised Automatic Speech Recognition
Viaarxiv icon

Enhanced Neural Beamformer with Spatial Information for Target Speech Extraction

Jun 28, 2023
Aoqi Guo, Junnan Wu, Peng Gao, Wenbo Zhu, Qinwen Guo, Dazhi Gao, Yujun Wang

Figure 1 for Enhanced Neural Beamformer with Spatial Information for Target Speech Extraction
Figure 2 for Enhanced Neural Beamformer with Spatial Information for Target Speech Extraction
Figure 3 for Enhanced Neural Beamformer with Spatial Information for Target Speech Extraction
Figure 4 for Enhanced Neural Beamformer with Spatial Information for Target Speech Extraction
Viaarxiv icon

Replay to Remember: Continual Layer-Specific Fine-tuning for German Speech Recognition

Jul 14, 2023
Theresa Pekarek Rosin, Stefan Wermter

Figure 1 for Replay to Remember: Continual Layer-Specific Fine-tuning for German Speech Recognition
Figure 2 for Replay to Remember: Continual Layer-Specific Fine-tuning for German Speech Recognition
Figure 3 for Replay to Remember: Continual Layer-Specific Fine-tuning for German Speech Recognition
Figure 4 for Replay to Remember: Continual Layer-Specific Fine-tuning for German Speech Recognition
Viaarxiv icon

AudioSR: Versatile Audio Super-resolution at Scale

Sep 13, 2023
Haohe Liu, Ke Chen, Qiao Tian, Wenwu Wang, Mark D. Plumbley

Figure 1 for AudioSR: Versatile Audio Super-resolution at Scale
Figure 2 for AudioSR: Versatile Audio Super-resolution at Scale
Figure 3 for AudioSR: Versatile Audio Super-resolution at Scale
Figure 4 for AudioSR: Versatile Audio Super-resolution at Scale
Viaarxiv icon