Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Vania Dimitrova

School of Computing, University of Leeds, UK

Score Before You Speak: Improving Persona Consistency in Dialogue Generation using Response Quality Scores

Aug 09, 2025

Arpita Saggar, Jonathan C. Darling, Vania Dimitrova, Duygu Sarikaya, David C. Hogg

Figure 1 for Score Before You Speak: Improving Persona Consistency in Dialogue Generation using Response Quality Scores

Figure 2 for Score Before You Speak: Improving Persona Consistency in Dialogue Generation using Response Quality Scores

Figure 3 for Score Before You Speak: Improving Persona Consistency in Dialogue Generation using Response Quality Scores

Figure 4 for Score Before You Speak: Improving Persona Consistency in Dialogue Generation using Response Quality Scores

Abstract:Persona-based dialogue generation is an important milestone towards building conversational artificial intelligence. Despite the ever-improving capabilities of large language models (LLMs), effectively integrating persona fidelity in conversations remains challenging due to the limited diversity in existing dialogue data. We propose a novel framework SBS (Score-Before-Speaking), which outperforms previous methods and yields improvements for both million and billion-parameter models. Unlike previous methods, SBS unifies the learning of responses and their relative quality into a single step. The key innovation is to train a dialogue model to correlate augmented responses with a quality score during training and then leverage this knowledge at inference. We use noun-based substitution for augmentation and semantic similarity-based scores as a proxy for response quality. Through extensive experiments with benchmark datasets (PERSONA-CHAT and ConvAI2), we show that score-conditioned training allows existing models to better capture a spectrum of persona-consistent dialogues. Our ablation studies also demonstrate that including scores in the input prompt during training is superior to conventional training setups. Code and further details are available at https://arpita2512.github.io/score_before_you_speak

* Camera-Ready version for ECAI 2025. 8 pages

Via

Access Paper or Ask Questions

Weakly Supervised Text Classification on Free Text Comments in Patient-Reported Outcome Measures

Aug 11, 2023

Anna-Grace Linton, Vania Dimitrova, Amy Downing, Richard Wagland, Adam Glaser

Figure 1 for Weakly Supervised Text Classification on Free Text Comments in Patient-Reported Outcome Measures

Figure 2 for Weakly Supervised Text Classification on Free Text Comments in Patient-Reported Outcome Measures

Figure 3 for Weakly Supervised Text Classification on Free Text Comments in Patient-Reported Outcome Measures

Figure 4 for Weakly Supervised Text Classification on Free Text Comments in Patient-Reported Outcome Measures

Abstract:Free text comments (FTC) in patient-reported outcome measures (PROMs) data are typically analysed using manual methods, such as content analysis, which is labour-intensive and time-consuming. Machine learning analysis methods are largely unsupervised, necessitating post-analysis interpretation. Weakly supervised text classification (WSTC) can be a valuable method of analysis to classify domain-specific text data in which there is limited labelled data. In this paper, we apply five WSTC techniques to FTC in PROMs data to identify health-related quality of life (HRQoL) themes reported by colorectal cancer patients. The WSTC methods label all the themes mentioned in the FTC. The results showed moderate performance on the PROMs data, mainly due to the precision of the models, and variation between themes. Evaluation of the classification performance illustrated the potential and limitations of keyword based WSTC to label PROMs FTC when labelled data is limited.

* Accepted and presented at Health Text Analytics conference 2023 (UK)

Via

Access Paper or Ask Questions