Alert button

"speech": models, code, and papers
Alert button

Turn-taking and Backchannel Prediction with Acoustic and Large Language Model Fusion

Jan 26, 2024
Jinhan Wang, Long Chen, Aparna Khare, Anirudh Raju, Pranav Dheram, Di He, Minhua Wu, Andreas Stolcke, Venkatesh Ravichandran

Viaarxiv icon

Hallucinations in Neural Automatic Speech Recognition: Identifying Errors and Hallucinatory Models

Jan 03, 2024
Rita Frieske, Bertram E. Shi

Figure 1 for Hallucinations in Neural Automatic Speech Recognition: Identifying Errors and Hallucinatory Models
Figure 2 for Hallucinations in Neural Automatic Speech Recognition: Identifying Errors and Hallucinatory Models
Figure 3 for Hallucinations in Neural Automatic Speech Recognition: Identifying Errors and Hallucinatory Models
Figure 4 for Hallucinations in Neural Automatic Speech Recognition: Identifying Errors and Hallucinatory Models
Viaarxiv icon

Consistency Based Unsupervised Self-training For ASR Personalisation

Jan 22, 2024
Jisi Zhang, Vandana Rajan, Haaris Mehmood, David Tuckey, Pablo Peso Parada, Md Asif Jalal, Karthikeyan Saravanan, Gil Ho Lee, Jungin Lee, Seokyeong Jung

Viaarxiv icon

BANSpEmo: A Bangla Emotional Speech Recognition Dataset

Dec 21, 2023
Md Gulzar Hussain, Mahmuda Rahman, Babe Sultana, Ye Shiren

Viaarxiv icon

Embedding-based search in JetBrains IDEs

Jan 26, 2024
Evgeny Abramov, Nikolai Palchikov

Viaarxiv icon

Exploring data augmentation in bias mitigation against non-native-accented speech

Dec 24, 2023
Yuanyuan Zhang, Aaricia Herygers, Tanvina Patel, Zhengjun Yue, Odette Scharenborg

Viaarxiv icon

Quantifying Stereotypes in Language

Jan 28, 2024
Yang Liu

Viaarxiv icon

A RAG-based Question Answering System Proposal for Understanding Islam: MufassirQAS LLM

Feb 01, 2024
Ahmet Yusuf Alan, Enis Karaarslan, Ömer Aydin

Viaarxiv icon

Overlap-aware End-to-End Supervised Hierarchical Graph Clustering for Speaker Diarization

Jan 23, 2024
Prachi Singh, Sriram Ganapathy

Viaarxiv icon

A Proactive and Dual Prevention Mechanism against Illegal Song Covers empowered by Singing Voice Conversion

Jan 30, 2024
Guangke Chen, Yedi Zhang, Fu Song, Ting Wang, Xiaoning Du, Yang Liu

Viaarxiv icon