Alert button

"speech": models, code, and papers
Alert button

MetaHate: A Dataset for Unifying Efforts on Hate Speech Detection

Add code
Bookmark button
Alert button
Jan 12, 2024
Paloma Piot, Patricia Martín-Rodilla, Javier Parapar

Viaarxiv icon

Retrieval Augmented End-to-End Spoken Dialog Models

Feb 02, 2024
Mingqiu Wang, Izhak Shafran, Hagen Soltau, Wei Han, Yuan Cao, Dian Yu, Laurent El Shafey

Viaarxiv icon

Learning Disentangled Speech Representations with Contrastive Learning and Time-Invariant Retrieval

Jan 16, 2024
Yimin Deng, Huaizhen Tang, Xulong Zhang, Ning Cheng, Jing Xiao, Jianzong Wang

Viaarxiv icon

StreamVoice: Streamable Context-Aware Language Modeling for Real-time Zero-Shot Voice Conversion

Feb 07, 2024
Zhichao Wang, Yuanzhe Chen, Xinsheng Wang, Zhuo Chen, Lei Xie, Yuping Wang, Yuxuan Wang

Viaarxiv icon

AFaCTA: Assisting the Annotation of Factual Claim Detection with Reliable LLM Annotators

Feb 16, 2024
Jingwei Ni, Minjing Shi, Dominik Stammbach, Mrinmaya Sachan, Elliott Ash, Markus Leippold

Viaarxiv icon

Cross-Speaker Encoding Network for Multi-Talker Speech Recognition

Jan 08, 2024
Jiawen Kang, Lingwei Meng, Mingyu Cui, Haohan Guo, Xixin Wu, Xunying Liu, Helen Meng

Viaarxiv icon

Revealing Emotional Clusters in Speaker Embeddings: A Contrastive Learning Strategy for Speech Emotion Recognition

Jan 19, 2024
Ismail Rasim Ulgen, Zongyang Du, Carlos Busso, Berrak Sisman

Viaarxiv icon

Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction

Jan 12, 2024
Ye-Xin Lu, Yang Ai, Hui-Peng Du, Zhen-Hua Ling

Viaarxiv icon

Zero Resource Cross-Lingual Part Of Speech Tagging

Jan 11, 2024
Sahil Chopra

Viaarxiv icon

Decoding of Selective Attention to Speech From Ear-EEG Recordings

Jan 10, 2024
Mike Thornton, Danilo Mandic, Tobias Reichenbach

Viaarxiv icon