Alert button

"speech": models, code, and papers
Alert button

Listening Between the Lines: Synthetic Speech Detection Disregarding Verbal Content

Feb 08, 2024
Davide Salvi, Temesgen Semu Balcha, Paolo Bestagini, Stefano Tubaro

Viaarxiv icon

BASE TTS: Lessons from building a billion-parameter Text-to-Speech model on 100K hours of data

Feb 15, 2024
Mateusz Łajszczak, Guillermo Cámbara, Yang Li, Fatih Beyhan, Arent van Korlaar, Fan Yang, Arnaud Joly, Álvaro Martín-Cortinas, Ammar Abbas, Adam Michalski, Alexis Moinet, Sri Karlapati, Ewa Muszyńska, Haohan Guo, Bartosz Putrycz, Soledad López Gambino, Kayeon Yoo, Elena Sokolova, Thomas Drugman

Viaarxiv icon

Persian Speech Emotion Recognition by Fine-Tuning Transformers

Feb 11, 2024
Minoo Shayaninasab, Bagher Babaali

Viaarxiv icon

Subjective $\textit{Isms}$? On the Danger of Conflating Hate and Offence in Abusive Language Detection

Mar 04, 2024
Amanda Cercas Curry, Gavin Abercrombie, Zeerak Talat

Viaarxiv icon

Improving Speaker Assignment in Speaker-Attributed ASR for Real Meeting Applications

Mar 11, 2024
Can Cui, Imran Ahamad Sheikh, Mostafa Sadeghi, Emmanuel Vincent

Figure 1 for Improving Speaker Assignment in Speaker-Attributed ASR for Real Meeting Applications
Figure 2 for Improving Speaker Assignment in Speaker-Attributed ASR for Real Meeting Applications
Figure 3 for Improving Speaker Assignment in Speaker-Attributed ASR for Real Meeting Applications
Figure 4 for Improving Speaker Assignment in Speaker-Attributed ASR for Real Meeting Applications
Viaarxiv icon

When LLMs Meets Acoustic Landmarks: An Efficient Approach to Integrate Speech into Large Language Models for Depression Detection

Feb 17, 2024
Xiangyu Zhang, Hexin Liu, Kaishuai Xu, Qiquan Zhang, Daijiao Liu, Beena Ahmed, Julien Epps

Viaarxiv icon

TweetInfo: An Interactive System to Mitigate Online Harm

Add code
Bookmark button
Alert button
Mar 03, 2024
Gautam Kishore Shahi

Figure 1 for TweetInfo: An Interactive System to Mitigate Online Harm
Figure 2 for TweetInfo: An Interactive System to Mitigate Online Harm
Viaarxiv icon

Syllable based DNN-HMM Cantonese Speech to Text System

Feb 13, 2024
Timothy Wong, Claire Li, Sam Lam, Billy Chiu, Qin Lu, Minglei Li, Dan Xiong, Roy Shing Yu, Vincent T. Y. Ng

Viaarxiv icon

Automatic design optimization of preference-based subjective evaluation with online learning in crowdsourcing environment

Mar 10, 2024
Yusuke Yasuda, Tomoki Toda

Figure 1 for Automatic design optimization of preference-based subjective evaluation with online learning in crowdsourcing environment
Figure 2 for Automatic design optimization of preference-based subjective evaluation with online learning in crowdsourcing environment
Figure 3 for Automatic design optimization of preference-based subjective evaluation with online learning in crowdsourcing environment
Figure 4 for Automatic design optimization of preference-based subjective evaluation with online learning in crowdsourcing environment
Viaarxiv icon

Exploratory Data Analysis on Code-mixed Misogynistic Comments

Mar 09, 2024
Sargam Yadav, Abhishek Kaushik, Kevin McDaid

Figure 1 for Exploratory Data Analysis on Code-mixed Misogynistic Comments
Figure 2 for Exploratory Data Analysis on Code-mixed Misogynistic Comments
Figure 3 for Exploratory Data Analysis on Code-mixed Misogynistic Comments
Figure 4 for Exploratory Data Analysis on Code-mixed Misogynistic Comments
Viaarxiv icon