chatbots


C2PO: Diagnosing and Disentangling Bias Shortcuts in LLMs

Add code
Dec 29, 2025
Viaarxiv icon

Authors Should Label Their Own Documents

Add code
Dec 27, 2025
Viaarxiv icon

Adversarial Training for Failure-Sensitive User Simulation in Mental Health Dialogue Optimization

Add code
Dec 23, 2025
Viaarxiv icon

Evaluating the Challenges of LLMs in Real-world Medical Follow-up: A Comparative Study and An Optimized Framework

Add code
Dec 22, 2025
Viaarxiv icon

Conscious Data Contribution via Community-Driven Chain-of-Thought Distillation

Add code
Dec 20, 2025
Viaarxiv icon

Subjective Question Generation and Answer Evaluation using NLP

Add code
Dec 19, 2025
Viaarxiv icon

ShareChat: A Dataset of Chatbot Conversations in the Wild

Add code
Dec 19, 2025
Viaarxiv icon

Towards Explainable Conversational AI for Early Diagnosis with Large Language Models

Add code
Dec 19, 2025
Viaarxiv icon

Needle in the Web: A Benchmark for Retrieving Targeted Web Pages in the Wild

Add code
Dec 18, 2025
Viaarxiv icon

A Women's Health Benchmark for Large Language Models

Add code
Dec 18, 2025
Viaarxiv icon