Alert button

"speech": models, code, and papers
Alert button

PAM: Prompting Audio-Language Models for Audio Quality Assessment

Add code
Bookmark button
Alert button
Feb 01, 2024
Soham Deshmukh, Dareen Alharthi, Benjamin Elizalde, Hannes Gamper, Mahmoud Al Ismail, Rita Singh, Bhiksha Raj, Huaming Wang

Viaarxiv icon

Turn-taking and Backchannel Prediction with Acoustic and Large Language Model Fusion

Jan 26, 2024
Jinhan Wang, Long Chen, Aparna Khare, Anirudh Raju, Pranav Dheram, Di He, Minhua Wu, Andreas Stolcke, Venkatesh Ravichandran

Viaarxiv icon

Masked Audio Modeling with CLAP and Multi-Objective Learning

Jan 29, 2024
Yifei Xin, Xiulian Peng, Yan Lu

Viaarxiv icon

BAE-Net: A Low complexity and high fidelity Bandwidth-Adaptive neural network for speech super-resolution

Dec 21, 2023
Guochen Yu, Xiguang Zheng, Nan Li, Runqiang Han, Chengshi Zheng, Chen Zhang, Chao Zhou, Qi Huang, Bing Yu

Viaarxiv icon

Mimic: Speaking Style Disentanglement for Speech-Driven 3D Facial Animation

Add code
Bookmark button
Alert button
Dec 18, 2023
Hui Fu, Zeqing Wang, Ke Gong, Keze Wang, Tianshui Chen, Haojie Li, Haifeng Zeng, Wenxiong Kang

Viaarxiv icon

Overlap-aware End-to-End Supervised Hierarchical Graph Clustering for Speaker Diarization

Jan 23, 2024
Prachi Singh, Sriram Ganapathy

Viaarxiv icon

Attention-Guided Adaptation for Code-Switching Speech Recognition

Dec 14, 2023
Bobbi Aditya, Mahdin Rohmatillah, Liang-Hsuan Tai, Jen-Tzung Chien

Figure 1 for Attention-Guided Adaptation for Code-Switching Speech Recognition
Figure 2 for Attention-Guided Adaptation for Code-Switching Speech Recognition
Figure 3 for Attention-Guided Adaptation for Code-Switching Speech Recognition
Figure 4 for Attention-Guided Adaptation for Code-Switching Speech Recognition
Viaarxiv icon

Performance Assessment of ChatGPT vs Bard in Detecting Alzheimer's Dementia

Jan 30, 2024
Balamurali B T, Jer-Ming Chen

Viaarxiv icon

Embedding-based search in JetBrains IDEs

Add code
Bookmark button
Alert button
Jan 26, 2024
Evgeny Abramov, Nikolai Palchikov

Viaarxiv icon

A RAG-based Question Answering System Proposal for Understanding Islam: MufassirQAS LLM

Jan 31, 2024
Ahmet Yusuf Alan, Enis Karaarslan, Ömer Aydin

Viaarxiv icon