Picture for Gokhan Tur

Gokhan Tur

Bilkent University, Ankara, Turkey

TD-EVAL: Revisiting Task-Oriented Dialogue Evaluation by Combining Turn-Level Precision with Dialogue-Level Comparisons

Add code
Apr 28, 2025
Figure 1 for TD-EVAL: Revisiting Task-Oriented Dialogue Evaluation by Combining Turn-Level Precision with Dialogue-Level Comparisons
Figure 2 for TD-EVAL: Revisiting Task-Oriented Dialogue Evaluation by Combining Turn-Level Precision with Dialogue-Level Comparisons
Figure 3 for TD-EVAL: Revisiting Task-Oriented Dialogue Evaluation by Combining Turn-Level Precision with Dialogue-Level Comparisons
Figure 4 for TD-EVAL: Revisiting Task-Oriented Dialogue Evaluation by Combining Turn-Level Precision with Dialogue-Level Comparisons
Viaarxiv icon

ToolRL: Reward is All Tool Learning Needs

Add code
Apr 16, 2025
Figure 1 for ToolRL: Reward is All Tool Learning Needs
Figure 2 for ToolRL: Reward is All Tool Learning Needs
Figure 3 for ToolRL: Reward is All Tool Learning Needs
Figure 4 for ToolRL: Reward is All Tool Learning Needs
Viaarxiv icon

YourBench: Easy Custom Evaluation Sets for Everyone

Add code
Apr 02, 2025
Figure 1 for YourBench: Easy Custom Evaluation Sets for Everyone
Figure 2 for YourBench: Easy Custom Evaluation Sets for Everyone
Figure 3 for YourBench: Easy Custom Evaluation Sets for Everyone
Figure 4 for YourBench: Easy Custom Evaluation Sets for Everyone
Viaarxiv icon

Persuade Me if You Can: A Framework for Evaluating Persuasion Effectiveness and Susceptibility Among Large Language Models

Add code
Mar 03, 2025
Figure 1 for Persuade Me if You Can: A Framework for Evaluating Persuasion Effectiveness and Susceptibility Among Large Language Models
Figure 2 for Persuade Me if You Can: A Framework for Evaluating Persuasion Effectiveness and Susceptibility Among Large Language Models
Figure 3 for Persuade Me if You Can: A Framework for Evaluating Persuasion Effectiveness and Susceptibility Among Large Language Models
Figure 4 for Persuade Me if You Can: A Framework for Evaluating Persuasion Effectiveness and Susceptibility Among Large Language Models
Viaarxiv icon

SMART: Self-Aware Agent for Tool Overuse Mitigation

Add code
Feb 17, 2025
Figure 1 for SMART: Self-Aware Agent for Tool Overuse Mitigation
Figure 2 for SMART: Self-Aware Agent for Tool Overuse Mitigation
Figure 3 for SMART: Self-Aware Agent for Tool Overuse Mitigation
Figure 4 for SMART: Self-Aware Agent for Tool Overuse Mitigation
Viaarxiv icon

Can a Single Model Master Both Multi-turn Conversations and Tool Use? CALM: A Unified Conversational Agentic Language Model

Add code
Feb 12, 2025
Figure 1 for Can a Single Model Master Both Multi-turn Conversations and Tool Use? CALM: A Unified Conversational Agentic Language Model
Figure 2 for Can a Single Model Master Both Multi-turn Conversations and Tool Use? CALM: A Unified Conversational Agentic Language Model
Figure 3 for Can a Single Model Master Both Multi-turn Conversations and Tool Use? CALM: A Unified Conversational Agentic Language Model
Figure 4 for Can a Single Model Master Both Multi-turn Conversations and Tool Use? CALM: A Unified Conversational Agentic Language Model
Viaarxiv icon

Towards Preventing Overreliance on Task-Oriented Conversational AI Through Accountability Modeling

Add code
Jan 17, 2025
Viaarxiv icon

Large Language Models as User-Agents for Evaluating Task-Oriented-Dialogue Systems

Add code
Nov 15, 2024
Viaarxiv icon

ReSpAct: Harmonizing Reasoning, Speaking, and Acting Towards Building Large Language Model-Based Conversational AI Agents

Add code
Nov 01, 2024
Figure 1 for ReSpAct: Harmonizing Reasoning, Speaking, and Acting Towards Building Large Language Model-Based Conversational AI Agents
Figure 2 for ReSpAct: Harmonizing Reasoning, Speaking, and Acting Towards Building Large Language Model-Based Conversational AI Agents
Figure 3 for ReSpAct: Harmonizing Reasoning, Speaking, and Acting Towards Building Large Language Model-Based Conversational AI Agents
Figure 4 for ReSpAct: Harmonizing Reasoning, Speaking, and Acting Towards Building Large Language Model-Based Conversational AI Agents
Viaarxiv icon

Simulating User Agents for Embodied Conversational-AI

Add code
Oct 31, 2024
Figure 1 for Simulating User Agents for Embodied Conversational-AI
Figure 2 for Simulating User Agents for Embodied Conversational-AI
Figure 3 for Simulating User Agents for Embodied Conversational-AI
Figure 4 for Simulating User Agents for Embodied Conversational-AI
Viaarxiv icon