chatbots


A Women's Health Benchmark for Large Language Models

Add code
Dec 18, 2025
Viaarxiv icon

Needle in the Web: A Benchmark for Retrieving Targeted Web Pages in the Wild

Add code
Dec 18, 2025
Viaarxiv icon

Authors Should Annotate

Add code
Dec 15, 2025
Viaarxiv icon

ORIBA: Exploring LLM-Driven Role-Play Chatbot as a Creativity Support Tool for Original Character Artists

Add code
Dec 14, 2025
Viaarxiv icon

Systematization of Knowledge: Security and Safety in the Model Context Protocol Ecosystem

Add code
Dec 13, 2025
Viaarxiv icon

Causal Judge Evaluation: Calibrated Surrogate Metrics for LLM Systems

Add code
Dec 11, 2025
Viaarxiv icon

Decoding Student Minds: Leveraging Conversational Agents for Psychological and Learning Analysis

Add code
Dec 11, 2025
Viaarxiv icon

PolyLingua: Margin-based Inter-class Transformer for Robust Cross-domain Language Detection

Add code
Dec 10, 2025
Viaarxiv icon

PoultryTalk: A Multi-modal Retrieval-Augmented Generation (RAG) System for Intelligent Poultry Management and Decision Support

Add code
Dec 08, 2025
Viaarxiv icon

Privacy Challenges and Solutions in Retrieval-Augmented Generation-Enhanced LLMs for Healthcare Chatbots: A Review of Applications, Risks, and Future Directions

Add code
Nov 17, 2025
Viaarxiv icon