Picture for Shinji Watanabe

Shinji Watanabe

CLSP

Streaming Endpointer for Spoken Dialogue using Neural Audio Codecs and Label-Delayed Training

Add code
Jun 08, 2025
Viaarxiv icon

Improving Multilingual Speech Models on ML-SUPERB 2.0: Fine-tuning with Data Augmentation and LID-Aware CTC

Add code
May 30, 2025
Viaarxiv icon

Explainable Depression Detection using Masked Hard Instance Mining

Add code
May 30, 2025
Viaarxiv icon

ARECHO: Autoregressive Evaluation via Chain-Based Hypothesis Optimization for Speech Multi-Metric Estimation

Add code
May 30, 2025
Viaarxiv icon

Interspeech 2025 URGENT Speech Enhancement Challenge

Add code
May 29, 2025
Viaarxiv icon

Uni-VERSA: Versatile Speech Assessment with a Unified Network

Add code
May 27, 2025
Viaarxiv icon

Context-Driven Dynamic Pruning for Large Speech Foundation Models

Add code
May 24, 2025
Viaarxiv icon

Differentiable K-means for Fully-optimized Discrete Token-based ASR

Add code
May 22, 2025
Viaarxiv icon

BLAB: Brutally Long Audio Bench

Add code
May 05, 2025
Viaarxiv icon

On The Landscape of Spoken Language Models: A Comprehensive Survey

Add code
Apr 11, 2025
Viaarxiv icon