Picture for Kwanghee Choi

Kwanghee Choi

Prosodic ABX: A Language-Agnostic Method for Measuring Prosodic Contrast in Speech Representations

Add code
Apr 02, 2026
Viaarxiv icon

An Empirical Recipe for Universal Phone Recognition

Add code
Mar 30, 2026
Viaarxiv icon

Adapting Self-Supervised Speech Representations for Cross-lingual Dysarthria Detection in Parkinson's Disease

Add code
Mar 23, 2026
Viaarxiv icon

Self-Supervised Speech Models Encode Phonetic Context via Position-dependent Orthogonal Subspaces

Add code
Mar 13, 2026
Viaarxiv icon

The CMU-AIST submission for the ICME 2025 Audio Encoder Challenge

Add code
Jan 22, 2026
Viaarxiv icon

PRiSM: Benchmarking Phone Realization in Speech Models

Add code
Jan 20, 2026
Viaarxiv icon

Linear Script Representations in Speech Foundation Models Enable Zero-Shot Transliteration

Add code
Jan 06, 2026
Viaarxiv icon

CS-FLEURS: A Massively Multilingual and Code-Switched Speech Dataset

Add code
Sep 17, 2025
Viaarxiv icon

OpenBEATs: A Fully Open-Source General-Purpose Audio Encoder

Add code
Jul 18, 2025
Figure 1 for OpenBEATs: A Fully Open-Source General-Purpose Audio Encoder
Figure 2 for OpenBEATs: A Fully Open-Source General-Purpose Audio Encoder
Figure 3 for OpenBEATs: A Fully Open-Source General-Purpose Audio Encoder
Figure 4 for OpenBEATs: A Fully Open-Source General-Purpose Audio Encoder
Viaarxiv icon

Towards Inclusive ASR: Investigating Voice Conversion for Dysarthric Speech Recognition in Low-Resource Languages

Add code
May 20, 2025
Viaarxiv icon