chatbots


MPCI-Bench: A Benchmark for Multimodal Pairwise Contextual Integrity Evaluation of Language Model Agents

Add code
Jan 13, 2026
Viaarxiv icon

An Under-Explored Application for Explainable Multimodal Misogyny Detection in code-mixed Hindi-English

Add code
Jan 13, 2026
Viaarxiv icon

Do You Understand How I Feel?: Towards Verified Empathy in Therapy Chatbots

Add code
Jan 13, 2026
Viaarxiv icon

Emotional Support Evaluation Framework via Controllable and Diverse Seeker Simulator

Add code
Jan 12, 2026
Viaarxiv icon

Labels have Human Values: Value Calibration of Subjective Tasks

Add code
Jan 10, 2026
Viaarxiv icon

The Echo Chamber Multi-Turn LLM Jailbreak

Add code
Jan 09, 2026
Viaarxiv icon

Router-Suggest: Dynamic Routing for Multimodal Auto-Completion in Visually-Grounded Dialogs

Add code
Jan 09, 2026
Viaarxiv icon

Can AI Chatbots Provide Coaching in Engineering? Beyond Information Processing Toward Mastery

Add code
Jan 07, 2026
Viaarxiv icon

Do Chatbot LLMs Talk Too Much? The YapBench Benchmark

Add code
Jan 02, 2026
Viaarxiv icon

C2PO: Diagnosing and Disentangling Bias Shortcuts in LLMs

Add code
Dec 29, 2025
Viaarxiv icon