Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:A Personalized Conversational Benchmark: Towards Simulating Personalized Conversations

May 20, 2025

Li Li, Peilin Cai, Ryan A. Rossi, Franck Dernoncourt, Branislav Kveton, Junda Wu, Tong Yu, Linxin Song, Tiankai Yang, Yuehan Qin(+14 more)

Figure 1 for A Personalized Conversational Benchmark: Towards Simulating Personalized Conversations

Figure 2 for A Personalized Conversational Benchmark: Towards Simulating Personalized Conversations

Figure 3 for A Personalized Conversational Benchmark: Towards Simulating Personalized Conversations

Figure 4 for A Personalized Conversational Benchmark: Towards Simulating Personalized Conversations

Share this with someone who'll enjoy it:

Abstract:We present PersonaConvBench, a large-scale benchmark for evaluating personalized reasoning and generation in multi-turn conversations with large language models (LLMs). Unlike existing work that focuses on either personalization or conversational structure in isolation, PersonaConvBench integrates both, offering three core tasks: sentence classification, impact regression, and user-centric text generation across ten diverse Reddit-based domains. This design enables systematic analysis of how personalized conversational context shapes LLM outputs in realistic multi-user scenarios. We benchmark several commercial and open-source LLMs under a unified prompting setup and observe that incorporating personalized history yields substantial performance improvements, including a 198 percent relative gain over the best non-conversational baseline in sentiment classification. By releasing PersonaConvBench with evaluations and code, we aim to support research on LLMs that adapt to individual styles, track long-term context, and produce contextually rich, engaging responses.

View paper on

Share this with someone who'll enjoy it:

Title:A Personalized Conversational Benchmark: Towards Simulating Personalized Conversations

Paper and Code