Picture for Brian Goodrich

Brian Goodrich

Evaluating Language-Model Agents on Realistic Autonomous Tasks

Add code
Jan 04, 2024
Viaarxiv icon