Picture for Berlin Chen

Berlin Chen

The NTNU System at the S&I Challenge 2025 SLA Open Track

Add code
Jun 05, 2025
Viaarxiv icon

Acoustically Precise Hesitation Tagging Is Essential for End-to-End Verbatim Transcription Systems

Add code
Jun 04, 2025
Viaarxiv icon

A Novel Data Augmentation Approach for Automatic Speaking Assessment on Opinion Expressions

Add code
Jun 04, 2025
Viaarxiv icon

Long-Context State-Space Video World Models

Add code
May 26, 2025
Viaarxiv icon

Towards Efficient and Multifaceted Computer-assisted Pronunciation Training Leveraging Hierarchical Selective State Space Model and Decoupled Cross-entropy Loss

Add code
Feb 11, 2025
Viaarxiv icon

Enhancing Code-Switching ASR Leveraging Non-Peaky CTC Loss and Deep Language Posterior Injection

Add code
Nov 26, 2024
Viaarxiv icon

A Novel LLM-based Two-stage Summarization Approach for Long Dialogues

Add code
Oct 09, 2024
Figure 1 for A Novel LLM-based Two-stage Summarization Approach for Long Dialogues
Figure 2 for A Novel LLM-based Two-stage Summarization Approach for Long Dialogues
Figure 3 for A Novel LLM-based Two-stage Summarization Approach for Long Dialogues
Figure 4 for A Novel LLM-based Two-stage Summarization Approach for Long Dialogues
Viaarxiv icon

Zero-Shot Text-to-Speech as Golden Speech Generator: A Systematic Framework and its Applicability in Automatic Pronunciation Assessment

Add code
Sep 11, 2024
Viaarxiv icon

Automated Speaking Assessment of Conversation Tests with Novel Graph-based Modeling on Spoken Response Coherence

Add code
Sep 11, 2024
Viaarxiv icon

An Effective Context-Balanced Adaptation Approach for Long-Tailed Speech Recognition

Add code
Sep 10, 2024
Figure 1 for An Effective Context-Balanced Adaptation Approach for Long-Tailed Speech Recognition
Figure 2 for An Effective Context-Balanced Adaptation Approach for Long-Tailed Speech Recognition
Figure 3 for An Effective Context-Balanced Adaptation Approach for Long-Tailed Speech Recognition
Figure 4 for An Effective Context-Balanced Adaptation Approach for Long-Tailed Speech Recognition
Viaarxiv icon