Dialogue Evaluation


The Silicon Mirror: Dynamic Behavioral Gating for Anti-Sycophancy in LLM Agents

Add code
Apr 02, 2026
Viaarxiv icon

Care-Conditioned Neuromodulation for Autonomy-Preserving Supportive Dialogue Agents

Add code
Apr 02, 2026
Viaarxiv icon

PsychAgent: An Experience-Driven Lifelong Learning Agent for Self-Evolving Psychological Counselor

Add code
Apr 02, 2026
Viaarxiv icon

Multi-Layered Memory Architectures for LLM Agents: An Experimental Evaluation of Long-Term Context Retention

Add code
Mar 31, 2026
Viaarxiv icon

Beyond Idealized Patients: Evaluating LLMs under Challenging Patient Behaviors in Medical Consultations

Add code
Mar 31, 2026
Viaarxiv icon

Impact of enriched meaning representations for language generation in dialogue tasks: A comprehensive exploration of the relevance of tasks, corpora and metrics

Add code
Mar 31, 2026
Viaarxiv icon

Benchmarking Interaction, Beyond Policy: a Reproducible Benchmark for Collaborative Instance Object Navigation

Add code
Mar 31, 2026
Viaarxiv icon

CounselReflect: A Toolkit for Auditing Mental-Health Dialogues

Add code
Mar 31, 2026
Viaarxiv icon

A Safety-Aware Role-Orchestrated Multi-Agent LLM Framework for Behavioral Health Communication Simulation

Add code
Mar 31, 2026
Viaarxiv icon

DongYuan: An LLM-Based Framework for Integrative Chinese and Western Medicine Spleen-Stomach Disorders Diagnosis

Add code
Mar 30, 2026
Viaarxiv icon