Alert button

"speech": models, code, and papers
Alert button

When LLMs Meets Acoustic Landmarks: An Efficient Approach to Integrate Speech into Large Language Models for Depression Detection

Feb 17, 2024
Xiangyu Zhang, Hexin Liu, Kaishuai Xu, Qiquan Zhang, Daijiao Liu, Beena Ahmed, Julien Epps

Viaarxiv icon

TweetInfo: An Interactive System to Mitigate Online Harm

Add code
Bookmark button
Alert button
Mar 03, 2024
Gautam Kishore Shahi

Figure 1 for TweetInfo: An Interactive System to Mitigate Online Harm
Figure 2 for TweetInfo: An Interactive System to Mitigate Online Harm
Viaarxiv icon

Syllable based DNN-HMM Cantonese Speech to Text System

Feb 13, 2024
Timothy Wong, Claire Li, Sam Lam, Billy Chiu, Qin Lu, Minglei Li, Dan Xiong, Roy Shing Yu, Vincent T. Y. Ng

Viaarxiv icon

Automatic design optimization of preference-based subjective evaluation with online learning in crowdsourcing environment

Mar 10, 2024
Yusuke Yasuda, Tomoki Toda

Figure 1 for Automatic design optimization of preference-based subjective evaluation with online learning in crowdsourcing environment
Figure 2 for Automatic design optimization of preference-based subjective evaluation with online learning in crowdsourcing environment
Figure 3 for Automatic design optimization of preference-based subjective evaluation with online learning in crowdsourcing environment
Figure 4 for Automatic design optimization of preference-based subjective evaluation with online learning in crowdsourcing environment
Viaarxiv icon

Exploratory Data Analysis on Code-mixed Misogynistic Comments

Mar 09, 2024
Sargam Yadav, Abhishek Kaushik, Kevin McDaid

Figure 1 for Exploratory Data Analysis on Code-mixed Misogynistic Comments
Figure 2 for Exploratory Data Analysis on Code-mixed Misogynistic Comments
Figure 3 for Exploratory Data Analysis on Code-mixed Misogynistic Comments
Figure 4 for Exploratory Data Analysis on Code-mixed Misogynistic Comments
Viaarxiv icon

A Rational Analysis of the Speech-to-Song Illusion

Feb 10, 2024
Raja Marjieh, Pol van Rijn, Ilia Sucholutsky, Harin Lee, Thomas L. Griffiths, Nori Jacoby

Viaarxiv icon

BASE TTS: Lessons from building a billion-parameter Text-to-Speech model on 100K hours of data

Feb 12, 2024
Mateusz Łajszczak, Guillermo Cámbara, Yang Li, Fatih Beyhan, Arent van Korlaar, Fan Yang, Arnaud Joly, Álvaro Martín-Cortinas, Ammar Abbas, Adam Michalski, Alexis Moinet, Sri Karlapati, Ewa Muszyńska, Haohan Guo, Bartosz Putrycz, Soledad López Gambino, Kayeon Yoo, Elena Sokolova, Thomas Drugman

Viaarxiv icon

Efficient Post-Training Augmentation for Adaptive Inference in Heterogeneous and Distributed IoT Environments

Mar 12, 2024
Max Sponner, Lorenzo Servadei, Bernd Waschneck, Robert Wille, Akash Kumar

Figure 1 for Efficient Post-Training Augmentation for Adaptive Inference in Heterogeneous and Distributed IoT Environments
Figure 2 for Efficient Post-Training Augmentation for Adaptive Inference in Heterogeneous and Distributed IoT Environments
Figure 3 for Efficient Post-Training Augmentation for Adaptive Inference in Heterogeneous and Distributed IoT Environments
Figure 4 for Efficient Post-Training Augmentation for Adaptive Inference in Heterogeneous and Distributed IoT Environments
Viaarxiv icon

Making Flow-Matching-Based Zero-Shot Text-to-Speech Laugh as You Like

Feb 12, 2024
Naoyuki Kanda, Xiaofei Wang, Sefik Emre Eskimez, Manthan Thakker, Hemin Yang, Zirun Zhu, Min Tang, Canrun Li, Steven Tsai, Zhen Xiao, Yufei Xia, Jinzhu Li, Yanqing Liu, Sheng Zhao, Michael Zeng

Viaarxiv icon

VLSP 2023 -- LTER: A Summary of the Challenge on Legal Textual Entailment Recognition

Mar 06, 2024
Vu Tran, Ha-Thanh Nguyen, Trung Vo, Son T. Luu, Hoang-Anh Dang, Ngoc-Cam Le, Thi-Thuy Le, Minh-Tien Nguyen, Truong-Son Nguyen, Le-Minh Nguyen

Figure 1 for VLSP 2023 -- LTER: A Summary of the Challenge on Legal Textual Entailment Recognition
Figure 2 for VLSP 2023 -- LTER: A Summary of the Challenge on Legal Textual Entailment Recognition
Figure 3 for VLSP 2023 -- LTER: A Summary of the Challenge on Legal Textual Entailment Recognition
Figure 4 for VLSP 2023 -- LTER: A Summary of the Challenge on Legal Textual Entailment Recognition
Viaarxiv icon