Alert button

"speech": models, code, and papers
Alert button

Latent Feature-based Data Splits to Improve Generalisation Evaluation: A Hate Speech Detection Case Study

Add code
Bookmark button
Alert button
Nov 16, 2023
Maike Züfle, Verna Dankers, Ivan Titov

Viaarxiv icon

Path Signature Representation of Patient-Clinician Interactions as a Predictor for Neuropsychological Tests Outcomes in Children: A Proof of Concept

Dec 12, 2023
Giulio Falcioni, Alexandra Georgescu, Emilia Molimpakis, Lev Gottlieb, Taylor Kuhn, Stefano Goria

Viaarxiv icon

Accented Speech Recognition With Accent-specific Codebooks

Add code
Bookmark button
Alert button
Oct 25, 2023
Darshan Prabhu, Preethi Jyothi, Sriram Ganapathy, Vinit Unni

Figure 1 for Accented Speech Recognition With Accent-specific Codebooks
Figure 2 for Accented Speech Recognition With Accent-specific Codebooks
Figure 3 for Accented Speech Recognition With Accent-specific Codebooks
Figure 4 for Accented Speech Recognition With Accent-specific Codebooks
Viaarxiv icon

Muted: Multilingual Targeted Offensive Speech Identification and Visualization

Dec 18, 2023
Christoph Tillmann, Aashka Trivedi, Sara Rosenthal, Santosh Borse, Rong Zhang, Avirup Sil, Bishwaranjan Bhattacharjee

Viaarxiv icon

VOT: Revolutionizing Speaker Verification with Memory and Attention Mechanisms

Dec 28, 2023
Hongyu Wang, Hui Li, Bo Li

Viaarxiv icon

Make BERT-based Chinese Spelling Check Model Enhanced by Layerwise Attention and Gaussian Mixture Model

Dec 27, 2023
Yongchang Cao, Liang He, Zhen Wu, Xinyu Dai

Viaarxiv icon

Extending Whisper with prompt tuning to target-speaker ASR

Dec 13, 2023
Hao Ma, Zhiyuan Peng, Mingjie Shao, Jing Li, Ju Liu

Viaarxiv icon

Acoustic BPE for Speech Generation with Discrete Tokens

Add code
Bookmark button
Alert button
Oct 23, 2023
Feiyu Shen, Yiwei Guo, Chenpeng Du, Xie Chen, Kai Yu

Viaarxiv icon

Enhancing Consistency in Multimodal Dialogue System Using LLM with Dialogue Scenario

Dec 20, 2023
Hiroki Onozeki, Zhiyang Qi, Kazuma Akiyama, Ryutaro Asahara, Takumasa Kaneko, Michimasa Inaba

Viaarxiv icon

Audio-visual fine-tuning of audio-only ASR models

Dec 14, 2023
Avner May, Dmitriy Serdyuk, Ankit Parag Shah, Otavio Braga, Olivier Siohan

Viaarxiv icon