Picture for Dilek Hakkani-Tür

Dilek Hakkani-Tür

EJ

Language Specific Knowledge: Do Models Know Better in X than in English?

Add code
May 21, 2025
Viaarxiv icon

Must Read: A Systematic Survey of Computational Persuasion

Add code
May 12, 2025
Viaarxiv icon

PIPA: A Unified Evaluation Protocol for Diagnosing Interactive Planning Agents

Add code
May 02, 2025
Viaarxiv icon

TD-EVAL: Revisiting Task-Oriented Dialogue Evaluation by Combining Turn-Level Precision with Dialogue-Level Comparisons

Add code
Apr 28, 2025
Viaarxiv icon

ToolRL: Reward is All Tool Learning Needs

Add code
Apr 16, 2025
Viaarxiv icon

YourBench: Easy Custom Evaluation Sets for Everyone

Add code
Apr 02, 2025
Viaarxiv icon

Persuade Me if You Can: A Framework for Evaluating Persuasion Effectiveness and Susceptibility Among Large Language Models

Add code
Mar 03, 2025
Viaarxiv icon

SMART: Self-Aware Agent for Tool Overuse Mitigation

Add code
Feb 17, 2025
Figure 1 for SMART: Self-Aware Agent for Tool Overuse Mitigation
Figure 2 for SMART: Self-Aware Agent for Tool Overuse Mitigation
Figure 3 for SMART: Self-Aware Agent for Tool Overuse Mitigation
Figure 4 for SMART: Self-Aware Agent for Tool Overuse Mitigation
Viaarxiv icon

Can a Single Model Master Both Multi-turn Conversations and Tool Use? CALM: A Unified Conversational Agentic Language Model

Add code
Feb 12, 2025
Viaarxiv icon

Beyond Sample-Level Feedback: Using Reference-Level Feedback to Guide Data Synthesis

Add code
Feb 06, 2025
Viaarxiv icon