Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Gonca Gürsun

Towards Trustworthy Multi-Turn LLM Agents via Behavioral Guidance

Dec 12, 2025

Gonca Gürsun

Figure 1 for Towards Trustworthy Multi-Turn LLM Agents via Behavioral Guidance

Figure 2 for Towards Trustworthy Multi-Turn LLM Agents via Behavioral Guidance

Figure 3 for Towards Trustworthy Multi-Turn LLM Agents via Behavioral Guidance

Figure 4 for Towards Trustworthy Multi-Turn LLM Agents via Behavioral Guidance

Abstract:Large Language Models demonstrate strong reasoning and generation abilities, yet their behavior in multi-turn tasks often lacks reliability and verifiability. We present a task completion framework that enables LLM-based agents to act under explicit behavioral guidance in environments described by reinforcement learning formalisms with defined observation, action, and reward signals. The framework integrates three components: a lightweight task profiler that selects reasoning and generation strategies, a reasoning module that learns verifiable observation - action mappings, and a generation module that enforces constraint-compliant outputs through validation or deterministic synthesis. We show that as the agent interacts with the environment, these components co-evolve, yielding trustworthy behavior.

* Accepted to AAAI 2026 Workshop on Trust and Control in Agentic AI (TrustAgent)

Via

Access Paper or Ask Questions

Can you text what is happening? Integrating pre-trained language encoders into trajectory prediction models for autonomous driving

Sep 13, 2023

Ali Keysan, Andreas Look, Eitan Kosman, Gonca Gürsun, Jörg Wagner, Yu Yao, Barbara Rakitsch

Abstract:In autonomous driving tasks, scene understanding is the first step towards predicting the future behavior of the surrounding traffic participants. Yet, how to represent a given scene and extract its features are still open research questions. In this study, we propose a novel text-based representation of traffic scenes and process it with a pre-trained language encoder. First, we show that text-based representations, combined with classical rasterized image representations, lead to descriptive scene embeddings. Second, we benchmark our predictions on the nuScenes dataset and show significant improvements compared to baselines. Third, we show in an ablation study that a joint encoder of text and rasterized images outperforms the individual encoders confirming that both representations have their complementary strengths.

Via

Access Paper or Ask Questions