chatbots


SWE-Dev: Evaluating and Training Autonomous Feature-Driven Software Development

Add code
May 22, 2025
Viaarxiv icon

PersonaBOT: Bringing Customer Personas to Life with LLMs and RAG

Add code
May 22, 2025
Viaarxiv icon

X-MAS: Towards Building Multi-Agent Systems with Heterogeneous LLMs

Add code
May 22, 2025
Viaarxiv icon

EnSToM: Enhancing Dialogue Systems with Entropy-Scaled Steering Vectors for Topic Maintenance

Add code
May 22, 2025
Viaarxiv icon

Alignment Under Pressure: The Case for Informed Adversaries When Evaluating LLM Defenses

Add code
May 21, 2025
Figure 1 for Alignment Under Pressure: The Case for Informed Adversaries When Evaluating LLM Defenses
Figure 2 for Alignment Under Pressure: The Case for Informed Adversaries When Evaluating LLM Defenses
Figure 3 for Alignment Under Pressure: The Case for Informed Adversaries When Evaluating LLM Defenses
Figure 4 for Alignment Under Pressure: The Case for Informed Adversaries When Evaluating LLM Defenses
Viaarxiv icon

AI vs. Human Judgment of Content Moderation: LLM-as-a-Judge and Ethics-Based Response Refusals

Add code
May 21, 2025
Viaarxiv icon

Hunyuan-TurboS: Advancing Large Language Models through Mamba-Transformer Synergy and Adaptive Chain-of-Thought

Add code
May 21, 2025
Viaarxiv icon

Beyond Words: Multimodal LLM Knows When to Speak

Add code
May 20, 2025
Viaarxiv icon

Evaluating the efficacy of LLM Safety Solutions : The Palit Benchmark Dataset

Add code
May 20, 2025
Viaarxiv icon

Evaluatiing the efficacy of LLM Safety Solutions : The Palit Benchmark Dataset

Add code
May 19, 2025
Viaarxiv icon