Dialogue Management


PERMA: Benchmarking Personalized Memory Agents via Event-Driven Preference and Realistic Task Environments

Add code
Mar 24, 2026
Viaarxiv icon

The Validity Gap in Health AI Evaluation: A Cross-Sectional Analysis of Benchmark Composition

Add code
Mar 18, 2026
Viaarxiv icon

SoulX-Duplug: Plug-and-Play Streaming State Prediction Module for Realtime Full-Duplex Speech Conversation

Add code
Mar 16, 2026
Viaarxiv icon

SuperLocalMemory V3: Information-Geometric Foundations for Zero-LLM Enterprise Agent Memory

Add code
Mar 15, 2026
Viaarxiv icon

GCAgent: Enhancing Group Chat Communication through Dialogue Agents System

Add code
Mar 05, 2026
Viaarxiv icon

StyleBench: Evaluating Speech Language Models on Conversational Speaking Style Control

Add code
Mar 08, 2026
Viaarxiv icon

MT-PingEval: Evaluating Multi-Turn Collaboration with Private Information Games

Add code
Feb 27, 2026
Viaarxiv icon

From Dialogue to Execution: Mixture-of-Agents Assisted Interactive Planning for Behavior Tree-Based Long-Horizon Robot Execution

Add code
Mar 01, 2026
Viaarxiv icon

HyMem: Hybrid Memory Architecture with Dynamic Retrieval Scheduling

Add code
Feb 15, 2026
Viaarxiv icon

TraceMem: Weaving Narrative Memory Schemata from User Conversational Traces

Add code
Feb 10, 2026
Viaarxiv icon