chatbots


The Leaderboard Illusion

Add code
Apr 29, 2025
Viaarxiv icon

Chatbot Arena Meets Nuggets: Towards Explanations and Diagnostics in the Evaluation of LLM Responses

Add code
Apr 28, 2025
Viaarxiv icon

Sample-Efficient Language Model for Hinglish Conversational AI

Add code
Apr 27, 2025
Viaarxiv icon

AI Chatbots for Mental Health: Values and Harms from Lived Experiences of Depression

Add code
Apr 26, 2025
Viaarxiv icon

Scaling Laws For Scalable Oversight

Add code
Apr 25, 2025
Viaarxiv icon

Tempo: Application-aware LLM Serving with Mixed SLO Requirements

Add code
Apr 24, 2025
Figure 1 for Tempo: Application-aware LLM Serving with Mixed SLO Requirements
Figure 2 for Tempo: Application-aware LLM Serving with Mixed SLO Requirements
Figure 3 for Tempo: Application-aware LLM Serving with Mixed SLO Requirements
Figure 4 for Tempo: Application-aware LLM Serving with Mixed SLO Requirements
Viaarxiv icon

Context-Enhanced Contrastive Search for Improved LLM Text Generation

Add code
Apr 22, 2025
Viaarxiv icon

Know Me, Respond to Me: Benchmarking LLMs for Dynamic User Profiling and Personalized Responses at Scale

Add code
Apr 19, 2025
Viaarxiv icon

Interpersonal Theory of Suicide as a Lens to Examine Suicidal Ideation in Online Spaces

Add code
Apr 17, 2025
Viaarxiv icon

Cancer-Myth: Evaluating AI Chatbot on Patient Questions with False Presuppositions

Add code
Apr 15, 2025
Figure 1 for Cancer-Myth: Evaluating AI Chatbot on Patient Questions with False Presuppositions
Figure 2 for Cancer-Myth: Evaluating AI Chatbot on Patient Questions with False Presuppositions
Figure 3 for Cancer-Myth: Evaluating AI Chatbot on Patient Questions with False Presuppositions
Figure 4 for Cancer-Myth: Evaluating AI Chatbot on Patient Questions with False Presuppositions
Viaarxiv icon