Alert button

"speech": models, code, and papers
Alert button

Lattice Rescoring Based on Large Ensemble of Complementary Neural Language Models

Dec 20, 2023
Atsunori Ogawa, Naohiro Tawara, Marc Delcroix, Shoko Araki

Viaarxiv icon

The Art of Deception: Robust Backdoor Attack using Dynamic Stacking of Triggers

Jan 03, 2024
Orson Mengara

Viaarxiv icon

Paralinguistics-Enhanced Large Language Modeling of Spoken Dialogue

Dec 23, 2023
Guan-Ting Lin, Prashanth Gurunath Shivakumar, Ankur Gandhe, Chao-Han Huck Yang, Yile Gu, Shalini Ghosh, Andreas Stolcke, Hung-yi Lee, Ivan Bulyko

Viaarxiv icon

Pre-trained Speech Processing Models Contain Human-Like Biases that Propagate to Speech Emotion Recognition

Add code
Bookmark button
Alert button
Oct 29, 2023
Isaac Slaughter, Craig Greenberg, Reva Schwartz, Aylin Caliskan

Viaarxiv icon

Overview Of The 2023 Icassp Sp Clarity Challenge: Speech Enhancement For Hearing Aids

Nov 24, 2023
Trevor J. Cox, Jon Barker, Will Bailey, Simone Graetzer, Michael A. Akeroyd, John F. Culling, Graham Naylor

Viaarxiv icon

Prompt-driven Target Speech Diarization

Add code
Bookmark button
Alert button
Oct 23, 2023
Yidi Jiang, Zhengyang Chen, Ruijie Tao, Liqun Deng, Yanmin Qian, Haizhou Li

Viaarxiv icon

Self-Supervised Adaptive AV Fusion Module for Pre-Trained ASR Models

Dec 21, 2023
Christopher Simic, Tobias Bocklet

Viaarxiv icon

COSMIC: Data Efficient Instruction-tuning For Speech In-Context Learning

Nov 03, 2023
Jing Pan, Jian Wu, Yashesh Gaur, Sunit Sivasankaran, Zhuo Chen, Shujie Liu, Jinyu Li

Viaarxiv icon

Normalization of Lithuanian Text Using Regular Expressions

Jan 01, 2024
Pijus Kasparaitis

Viaarxiv icon

TeLeS: Temporal Lexeme Similarity Score to Estimate Confidence in End-to-End ASR

Jan 06, 2024
Nagarathna Ravi, Thishyan Raj T, Vipul Arora

Viaarxiv icon