Picture for Megan Kinniment

Megan Kinniment

Evaluating Language-Model Agents on Realistic Autonomous Tasks

Add code
Jan 04, 2024
Viaarxiv icon