Alert button

"speech": models, code, and papers
Alert button

AGADIR: Towards Array-Geometry Agnostic Directional Speech Recognition

Jan 18, 2024
Ju Lin, Niko Moritz, Yiteng Huang, Ruiming Xie, Ming Sun, Christian Fuegen, Frank Seide

Viaarxiv icon

Overview of the L3DAS23 Challenge on Audio-Visual Extended Reality

Feb 14, 2024
Christian Marinoni, Riccardo Fosco Gramaccioni, Changan Chen, Aurelio Uncini, Danilo Comminiello

Viaarxiv icon

Detecting Post-Stroke Aphasia Via Brain Responses to Speech in a Deep Learning Framework

Jan 17, 2024
Pieter De Clercq, Corentin Puffay, Jill Kries, Hugo Van Hamme, Maaike Vandermosten, Tom Francart, Jonas Vanthornhout

Viaarxiv icon

AIR-Bench: Benchmarking Large Audio-Language Models via Generative Comprehension

Feb 12, 2024
Qian Yang, Jin Xu, Wenrui Liu, Yunfei Chu, Ziyue Jiang, Xiaohuan Zhou, Yichong Leng, Yuanjun Lv, Zhou Zhao, Chang Zhou, Jingren Zhou

Viaarxiv icon

A Two-Stage Framework in Cross-Spectrum Domain for Real-Time Speech Enhancement

Jan 19, 2024
Yuewei Zhang, Huanbin Zou, Jie Zhu

Viaarxiv icon

Revisiting Self-supervised Learning of Speech Representation from a Mutual Information Perspective

Jan 16, 2024
Alexander H. Liu, Sung-Lin Yeh, James Glass

Viaarxiv icon

Emotion-Aware Contrastive Adaptation Network for Source-Free Cross-Corpus Speech Emotion Recognition

Jan 23, 2024
Yan Zhao, Jincen Wang, Cheng Lu, Sunan Li, Björn Schuller, Yuan Zong, Wenming Zheng

Viaarxiv icon

StreamVoice: Streamable Context-Aware Language Modeling for Real-time Zero-Shot Voice Conversion

Feb 07, 2024
Zhichao Wang, Yuanzhe Chen, Xinsheng Wang, Zhuo Chen, Lei Xie, Yuping Wang, Yuxuan Wang

Viaarxiv icon

Punctuation Restoration Improves Structure Understanding without Supervision

Feb 13, 2024
Junghyun Min, Minho Lee, Woochul Lee, Yeonsoo Lee

Viaarxiv icon

Recent Advances in Hate Speech Moderation: Multimodality and the Role of Large Models

Jan 30, 2024
Ming Shan Hee, Shivam Sharma, Rui Cao, Palash Nandi, Preslav Nakov, Tanmoy Chakraborty, Roy Ka-Wei Lee

Viaarxiv icon