Picture for Jiatong Shi

Jiatong Shi

Improving Speech Enhancement with Multi-Metric Supervision from Learned Quality Assessment

Add code
Jun 13, 2025
Viaarxiv icon

Discrete Audio Tokens: More Than a Survey!

Add code
Jun 12, 2025
Viaarxiv icon

DiscoSum: Discourse-aware News Summarization

Add code
Jun 07, 2025
Viaarxiv icon

ARECHO: Autoregressive Evaluation via Chain-Based Hypothesis Optimization for Speech Multi-Metric Estimation

Add code
May 30, 2025
Viaarxiv icon

Uni-VERSA: Versatile Speech Assessment with a Unified Network

Add code
May 27, 2025
Viaarxiv icon

Aligning Text-to-Music Evaluation with Human Preferences

Add code
Mar 20, 2025
Viaarxiv icon

ESPnet-SDS: Unified Toolkit and Demo for Spoken Dialogue Systems

Add code
Mar 11, 2025
Viaarxiv icon

Balancing Speech Understanding and Generation Using Continual Pre-training for Codec-based Speech LLM

Add code
Feb 24, 2025
Viaarxiv icon

ESPnet-SpeechLM: An Open Speech Language Model Toolkit

Add code
Feb 21, 2025
Viaarxiv icon

VERSA: A Versatile Evaluation Toolkit for Speech, Audio, and Music

Add code
Dec 23, 2024
Viaarxiv icon