Picture for Thanathai Lertpetchpun

Thanathai Lertpetchpun

Trade-offs Between Capacity and Robustness in Neural Audio Codecs for Adversarially Robust Speech Recognition

Add code
Mar 10, 2026
Viaarxiv icon

Targeted Speaker Poisoning Framework in Zero-Shot Text-to-Speech

Add code
Mar 08, 2026
Viaarxiv icon

Quantifying Speaker Embedding Phonological Rule Interactions in Accented Speech Synthesis

Add code
Jan 20, 2026
Viaarxiv icon

Amplifying Artifacts with Speech Enhancement in Voice Anti-spoofing

Add code
Jun 13, 2025
Figure 1 for Amplifying Artifacts with Speech Enhancement in Voice Anti-spoofing
Figure 2 for Amplifying Artifacts with Speech Enhancement in Voice Anti-spoofing
Figure 3 for Amplifying Artifacts with Speech Enhancement in Voice Anti-spoofing
Figure 4 for Amplifying Artifacts with Speech Enhancement in Voice Anti-spoofing
Viaarxiv icon

Developing a High-performance Framework for Speech Emotion Recognition in Naturalistic Conditions Challenge for Emotional Attribute Prediction

Add code
Jun 12, 2025
Viaarxiv icon

Developing a Top-tier Framework in Naturalistic Conditions Challenge for Categorized Emotion Prediction: From Speech Foundation Models and Learning Objective to Data Augmentation and Engineering Choices

Add code
May 28, 2025
Viaarxiv icon

Vox-Profile: A Speech Foundation Model Benchmark for Characterizing Diverse Speaker and Speech Traits

Add code
May 20, 2025
Viaarxiv icon