Spoken


Multilingual Gloss-free Sign Language Translation: Towards Building a Sign Language Foundation Model

Add code
May 30, 2025
Viaarxiv icon

Dynamic Context-Aware Streaming Pretrained Language Model For Inverse Text Normalization

Add code
May 30, 2025
Viaarxiv icon

Voice Conversion Improves Cross-Domain Robustness for Spoken Arabic Dialect Identification

Add code
May 30, 2025
Viaarxiv icon

Spoken Language Modeling with Duration-Penalized Self-Supervised Units

Add code
May 29, 2025
Viaarxiv icon

LLM-Synth4KWS: Scalable Automatic Generation and Synthesis of Confusable Data for Custom Keyword Spotting

Add code
May 29, 2025
Viaarxiv icon

Spoken question answering for visual queries

Add code
May 29, 2025
Viaarxiv icon

In-context Language Learning for Endangered Languages in Speech Recognition

Add code
May 28, 2025
Viaarxiv icon

Counting trees: A treebank-driven exploration of syntactic variation in speech and writing across languages

Add code
May 28, 2025
Viaarxiv icon

Large Language Models for Depression Recognition in Spoken Language Integrating Psychological Knowledge

Add code
May 28, 2025
Viaarxiv icon

StressTest: Can YOUR Speech LM Handle the Stress?

Add code
May 28, 2025
Viaarxiv icon