Picture for Chris Callison-Burch

Chris Callison-Burch

Shammie

You Have Thirteen Hours in Which to Solve the Labyrinth: Enhancing AI Game Masters with Function Calling

Add code
Sep 11, 2024
Figure 1 for You Have Thirteen Hours in Which to Solve the Labyrinth: Enhancing AI Game Masters with Function Calling
Figure 2 for You Have Thirteen Hours in Which to Solve the Labyrinth: Enhancing AI Game Masters with Function Calling
Figure 3 for You Have Thirteen Hours in Which to Solve the Labyrinth: Enhancing AI Game Masters with Function Calling
Figure 4 for You Have Thirteen Hours in Which to Solve the Labyrinth: Enhancing AI Game Masters with Function Calling
Viaarxiv icon

ReDel: A Toolkit for LLM-Powered Recursive Multi-Agent Systems

Add code
Aug 05, 2024
Figure 1 for ReDel: A Toolkit for LLM-Powered Recursive Multi-Agent Systems
Figure 2 for ReDel: A Toolkit for LLM-Powered Recursive Multi-Agent Systems
Figure 3 for ReDel: A Toolkit for LLM-Powered Recursive Multi-Agent Systems
Figure 4 for ReDel: A Toolkit for LLM-Powered Recursive Multi-Agent Systems
Viaarxiv icon

TinyStyler: Efficient Few-Shot Text Style Transfer with Authorship Embeddings

Add code
Jun 21, 2024
Figure 1 for TinyStyler: Efficient Few-Shot Text Style Transfer with Authorship Embeddings
Figure 2 for TinyStyler: Efficient Few-Shot Text Style Transfer with Authorship Embeddings
Figure 3 for TinyStyler: Efficient Few-Shot Text Style Transfer with Authorship Embeddings
Figure 4 for TinyStyler: Efficient Few-Shot Text Style Transfer with Authorship Embeddings
Viaarxiv icon

Learning Translations via Matrix Completion

Add code
Jun 19, 2024
Figure 1 for Learning Translations via Matrix Completion
Figure 2 for Learning Translations via Matrix Completion
Figure 3 for Learning Translations via Matrix Completion
Figure 4 for Learning Translations via Matrix Completion
Viaarxiv icon

PaCE: Parsimonious Concept Engineering for Large Language Models

Add code
Jun 06, 2024
Figure 1 for PaCE: Parsimonious Concept Engineering for Large Language Models
Figure 2 for PaCE: Parsimonious Concept Engineering for Large Language Models
Figure 3 for PaCE: Parsimonious Concept Engineering for Large Language Models
Figure 4 for PaCE: Parsimonious Concept Engineering for Large Language Models
Viaarxiv icon

Large Language Models Can Self-Improve At Web Agent Tasks

Add code
May 30, 2024
Figure 1 for Large Language Models Can Self-Improve At Web Agent Tasks
Figure 2 for Large Language Models Can Self-Improve At Web Agent Tasks
Figure 3 for Large Language Models Can Self-Improve At Web Agent Tasks
Figure 4 for Large Language Models Can Self-Improve At Web Agent Tasks
Viaarxiv icon

PDDLEGO: Iterative Planning in Textual Environments

Add code
May 30, 2024
Viaarxiv icon

Evaluating Vision-Language Models on Bistable Images

Add code
May 29, 2024
Figure 1 for Evaluating Vision-Language Models on Bistable Images
Figure 2 for Evaluating Vision-Language Models on Bistable Images
Figure 3 for Evaluating Vision-Language Models on Bistable Images
Figure 4 for Evaluating Vision-Language Models on Bistable Images
Viaarxiv icon

A Textbook Remedy for Domain Shifts: Knowledge Priors for Medical Image Analysis

Add code
May 23, 2024
Figure 1 for A Textbook Remedy for Domain Shifts: Knowledge Priors for Medical Image Analysis
Figure 2 for A Textbook Remedy for Domain Shifts: Knowledge Priors for Medical Image Analysis
Figure 3 for A Textbook Remedy for Domain Shifts: Knowledge Priors for Medical Image Analysis
Figure 4 for A Textbook Remedy for Domain Shifts: Knowledge Priors for Medical Image Analysis
Viaarxiv icon

RAID: A Shared Benchmark for Robust Evaluation of Machine-Generated Text Detectors

Add code
May 13, 2024
Figure 1 for RAID: A Shared Benchmark for Robust Evaluation of Machine-Generated Text Detectors
Figure 2 for RAID: A Shared Benchmark for Robust Evaluation of Machine-Generated Text Detectors
Figure 3 for RAID: A Shared Benchmark for Robust Evaluation of Machine-Generated Text Detectors
Figure 4 for RAID: A Shared Benchmark for Robust Evaluation of Machine-Generated Text Detectors
Viaarxiv icon