Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:LLM-Based Multi-Reference Evaluation for Efficient and Robust Assessment of Phrase Break Annotations

Jun 19, 2026

Younghan Park, Hoyeon Lee, Hawon Jeong, Jong-Hwan Kim

Share this with someone who'll enjoy it:

Abstract:Reliable evaluation of phrase break annotations is crucial, as subtle variations in prosodic boundaries directly affect the clarity and naturalness of speech. However, existing approaches exhibit major limitations: single-reference evaluation assumes a unique gold phrasing for an utterance despite multiple valid phrasings, while human judgment, though flexible, is labor-intensive and unscalable. To address these, we propose LLM-based Multi-Reference Evaluation (LMRE) for phrase break annotations that models the one-to-many nature of prosodic phrasing and generates multiple valid phrasings from minimal demonstrations. On a Korean testbed of 1,356 annotations covering five strategies, LMRE shows stronger alignment with human judgment than single-reference evaluation in both acceptance behavior and score correlation. Our findings demonstrate that LMRE effectively achieves both scalability and multi-reference support, highlighting the potential of LLMs for evaluation in the speech domain.

* Accepted at Interspeech 2026

View paper on

Share this with someone who'll enjoy it:

Title:LLM-Based Multi-Reference Evaluation for Efficient and Robust Assessment of Phrase Break Annotations

Paper and Code