Alert button

"speech": models, code, and papers
Alert button

Rethinking and Improving Multi-task Learning for End-to-end Speech Translation

Nov 07, 2023
Yuhao Zhang, Chen Xu, Bei Li, Hao Chen, Tong Xiao, Chunliang Zhang, Jingbo Zhu

Viaarxiv icon

1-step Speech Processing and Understanding Using CTC Loss

Nov 08, 2023
Karan Singla, Shahab Jalavand, Yeon-Jun Kim, Antonio Moreno Daniel, Srinivas Bangalore, Andrej Ljolje, Ben Stern

Viaarxiv icon

Automatic Textual Normalization for Hate Speech Detection

Nov 15, 2023
Anh Thi-Hoang Nguyen, Dung Ha Nguyen, Nguyet Thi Nguyen, Khanh Thanh-Duy Ho, Kiet Van Nguyen

Viaarxiv icon

Latent Feature-based Data Splits to Improve Generalisation Evaluation: A Hate Speech Detection Case Study

Nov 16, 2023
Maike Züfle, Verna Dankers, Ivan Titov

Viaarxiv icon

End-to-End Single-Channel Speaker-Turn Aware Conversational Speech Translation

Nov 01, 2023
Juan Zuluaga-Gomez, Zhaocheng Huang, Xing Niu, Rohit Paturi, Sundararajan Srinivasan, Prashant Mathur, Brian Thompson, Marcello Federico

Viaarxiv icon

Towards Real-World Streaming Speech Translation for Code-Switched Speech

Oct 19, 2023
Belen Alastruey, Matthias Sperber, Christian Gollan, Dominic Telaar, Tim Ng, Aashish Agargwal

Viaarxiv icon

External Knowledge Augmented Polyphone Disambiguation Using Large Language Model

Dec 19, 2023
Chen Li

Viaarxiv icon

Scenario-Aware Audio-Visual TF-GridNet for Target Speech Extraction

Oct 30, 2023
Zexu Pan, Gordon Wichern, Yoshiki Masuyama, Francois G. Germain, Sameer Khurana, Chiori Hori, Jonathan Le Roux

Viaarxiv icon

Seeing Through the Conversation: Audio-Visual Speech Separation based on Diffusion Model

Oct 30, 2023
Suyeon Lee, Chaeyoung Jung, Youngjoon Jang, Jaehun Kim, Joon Son Chung

Viaarxiv icon

LLMs and Finetuning: Benchmarking cross-domain performance for hate speech detection

Oct 29, 2023
Ahmad Nasir, Aadish Sharma, Kokil Jaidka

Viaarxiv icon