Picture for Philipp Sadler

Philipp Sadler

Playpen: An Environment for Exploring Learning Through Conversational Interaction

Add code
Apr 11, 2025
Viaarxiv icon

The Unreasonable Ineffectiveness of Nucleus Sampling on Mitigating Text Memorization

Add code
Aug 29, 2024
Figure 1 for The Unreasonable Ineffectiveness of Nucleus Sampling on Mitigating Text Memorization
Figure 2 for The Unreasonable Ineffectiveness of Nucleus Sampling on Mitigating Text Memorization
Figure 3 for The Unreasonable Ineffectiveness of Nucleus Sampling on Mitigating Text Memorization
Figure 4 for The Unreasonable Ineffectiveness of Nucleus Sampling on Mitigating Text Memorization
Viaarxiv icon

clembench-2024: A Challenging, Dynamic, Complementary, Multilingual Benchmark and Underlying Flexible Framework for LLMs as Multi-Action Agents

Add code
May 31, 2024
Figure 1 for clembench-2024: A Challenging, Dynamic, Complementary, Multilingual Benchmark and Underlying Flexible Framework for LLMs as Multi-Action Agents
Figure 2 for clembench-2024: A Challenging, Dynamic, Complementary, Multilingual Benchmark and Underlying Flexible Framework for LLMs as Multi-Action Agents
Figure 3 for clembench-2024: A Challenging, Dynamic, Complementary, Multilingual Benchmark and Underlying Flexible Framework for LLMs as Multi-Action Agents
Figure 4 for clembench-2024: A Challenging, Dynamic, Complementary, Multilingual Benchmark and Underlying Flexible Framework for LLMs as Multi-Action Agents
Viaarxiv icon

Sharing the Cost of Success: A Game for Evaluating and Learning Collaborative Multi-Agent Instruction Giving and Following Policies

Add code
Mar 26, 2024
Figure 1 for Sharing the Cost of Success: A Game for Evaluating and Learning Collaborative Multi-Agent Instruction Giving and Following Policies
Figure 2 for Sharing the Cost of Success: A Game for Evaluating and Learning Collaborative Multi-Agent Instruction Giving and Following Policies
Figure 3 for Sharing the Cost of Success: A Game for Evaluating and Learning Collaborative Multi-Agent Instruction Giving and Following Policies
Figure 4 for Sharing the Cost of Success: A Game for Evaluating and Learning Collaborative Multi-Agent Instruction Giving and Following Policies
Viaarxiv icon

Learning Communication Policies for Different Follower Behaviors in a Collaborative Reference Game

Add code
Feb 07, 2024
Figure 1 for Learning Communication Policies for Different Follower Behaviors in a Collaborative Reference Game
Figure 2 for Learning Communication Policies for Different Follower Behaviors in a Collaborative Reference Game
Figure 3 for Learning Communication Policies for Different Follower Behaviors in a Collaborative Reference Game
Figure 4 for Learning Communication Policies for Different Follower Behaviors in a Collaborative Reference Game
Viaarxiv icon

Pento-DIARef: A Diagnostic Dataset for Learning the Incremental Algorithm for Referring Expression Generation from Examples

Add code
May 24, 2023
Figure 1 for Pento-DIARef: A Diagnostic Dataset for Learning the Incremental Algorithm for Referring Expression Generation from Examples
Figure 2 for Pento-DIARef: A Diagnostic Dataset for Learning the Incremental Algorithm for Referring Expression Generation from Examples
Figure 3 for Pento-DIARef: A Diagnostic Dataset for Learning the Incremental Algorithm for Referring Expression Generation from Examples
Figure 4 for Pento-DIARef: A Diagnostic Dataset for Learning the Incremental Algorithm for Referring Expression Generation from Examples
Viaarxiv icon

Yes, this Way! Learning to Ground Referring Expressions into Actions with Intra-episodic Feedback from Supportive Teachers

Add code
May 22, 2023
Figure 1 for Yes, this Way! Learning to Ground Referring Expressions into Actions with Intra-episodic Feedback from Supportive Teachers
Figure 2 for Yes, this Way! Learning to Ground Referring Expressions into Actions with Intra-episodic Feedback from Supportive Teachers
Figure 3 for Yes, this Way! Learning to Ground Referring Expressions into Actions with Intra-episodic Feedback from Supportive Teachers
Figure 4 for Yes, this Way! Learning to Ground Referring Expressions into Actions with Intra-episodic Feedback from Supportive Teachers
Viaarxiv icon

clembench: Using Game Play to Evaluate Chat-Optimized Language Models as Conversational Agents

Add code
May 22, 2023
Figure 1 for clembench: Using Game Play to Evaluate Chat-Optimized Language Models as Conversational Agents
Figure 2 for clembench: Using Game Play to Evaluate Chat-Optimized Language Models as Conversational Agents
Figure 3 for clembench: Using Game Play to Evaluate Chat-Optimized Language Models as Conversational Agents
Figure 4 for clembench: Using Game Play to Evaluate Chat-Optimized Language Models as Conversational Agents
Viaarxiv icon

Spatial Attention as an Interface for Image Captioning Models

Add code
Sep 29, 2020
Figure 1 for Spatial Attention as an Interface for Image Captioning Models
Figure 2 for Spatial Attention as an Interface for Image Captioning Models
Figure 3 for Spatial Attention as an Interface for Image Captioning Models
Figure 4 for Spatial Attention as an Interface for Image Captioning Models
Viaarxiv icon

Can Neural Image Captioning be Controlled via Forced Attention?

Add code
Nov 10, 2019
Figure 1 for Can Neural Image Captioning be Controlled via Forced Attention?
Figure 2 for Can Neural Image Captioning be Controlled via Forced Attention?
Figure 3 for Can Neural Image Captioning be Controlled via Forced Attention?
Figure 4 for Can Neural Image Captioning be Controlled via Forced Attention?
Viaarxiv icon