Picture for Berlin Chen

Berlin Chen

Beyond Modality Limitations: A Unified MLLM Approach to Automated Speaking Assessment with Effective Curriculum Learning

Add code
Aug 18, 2025
Viaarxiv icon

Revealing the Role of Audio Channels in ASR Performance Degradation

Add code
Aug 12, 2025
Viaarxiv icon

QAMRO: Quality-aware Adaptive Margin Ranking Optimization for Human-aligned Assessment of Audio Generation Systems

Add code
Aug 12, 2025
Viaarxiv icon

JCAPT: A Joint Modeling Approach for CAPT

Add code
Jun 24, 2025
Viaarxiv icon

The NTNU System at the S&I Challenge 2025 SLA Open Track

Add code
Jun 05, 2025
Viaarxiv icon

Acoustically Precise Hesitation Tagging Is Essential for End-to-End Verbatim Transcription Systems

Add code
Jun 04, 2025
Viaarxiv icon

A Novel Data Augmentation Approach for Automatic Speaking Assessment on Opinion Expressions

Add code
Jun 04, 2025
Viaarxiv icon

Long-Context State-Space Video World Models

Add code
May 26, 2025
Viaarxiv icon

Towards Efficient and Multifaceted Computer-assisted Pronunciation Training Leveraging Hierarchical Selective State Space Model and Decoupled Cross-entropy Loss

Add code
Feb 11, 2025
Viaarxiv icon

Enhancing Code-Switching ASR Leveraging Non-Peaky CTC Loss and Deep Language Posterior Injection

Add code
Nov 26, 2024
Viaarxiv icon