Picture for Frank F. Xu

Frank F. Xu

CowPilot: A Framework for Autonomous and Human-Agent Collaborative Web Navigation

Add code
Jan 28, 2025
Viaarxiv icon

TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks

Add code
Dec 18, 2024
Viaarxiv icon

The BrowserGym Ecosystem for Web Agent Research

Add code
Dec 10, 2024
Figure 1 for The BrowserGym Ecosystem for Web Agent Research
Figure 2 for The BrowserGym Ecosystem for Web Agent Research
Figure 3 for The BrowserGym Ecosystem for Web Agent Research
Figure 4 for The BrowserGym Ecosystem for Web Agent Research
Viaarxiv icon

OpenDevin: An Open Platform for AI Software Developers as Generalist Agents

Add code
Jul 23, 2024
Figure 1 for OpenDevin: An Open Platform for AI Software Developers as Generalist Agents
Figure 2 for OpenDevin: An Open Platform for AI Software Developers as Generalist Agents
Figure 3 for OpenDevin: An Open Platform for AI Software Developers as Generalist Agents
Figure 4 for OpenDevin: An Open Platform for AI Software Developers as Generalist Agents
Viaarxiv icon

CodeRAG-Bench: Can Retrieval Augment Code Generation?

Add code
Jun 20, 2024
Figure 1 for CodeRAG-Bench: Can Retrieval Augment Code Generation?
Figure 2 for CodeRAG-Bench: Can Retrieval Augment Code Generation?
Figure 3 for CodeRAG-Bench: Can Retrieval Augment Code Generation?
Figure 4 for CodeRAG-Bench: Can Retrieval Augment Code Generation?
Viaarxiv icon

WebArena: A Realistic Web Environment for Building Autonomous Agents

Add code
Jul 25, 2023
Figure 1 for WebArena: A Realistic Web Environment for Building Autonomous Agents
Figure 2 for WebArena: A Realistic Web Environment for Building Autonomous Agents
Figure 3 for WebArena: A Realistic Web Environment for Building Autonomous Agents
Figure 4 for WebArena: A Realistic Web Environment for Building Autonomous Agents
Viaarxiv icon

Hierarchical Prompting Assists Large Language Model on Web Navigation

Add code
May 23, 2023
Viaarxiv icon

Active Retrieval Augmented Generation

Add code
May 11, 2023
Figure 1 for Active Retrieval Augmented Generation
Figure 2 for Active Retrieval Augmented Generation
Figure 3 for Active Retrieval Augmented Generation
Figure 4 for Active Retrieval Augmented Generation
Viaarxiv icon

Why do Nearest Neighbor Language Models Work?

Add code
Jan 17, 2023
Viaarxiv icon

DocCoder: Generating Code by Retrieving and Reading Docs

Add code
Jul 13, 2022
Figure 1 for DocCoder: Generating Code by Retrieving and Reading Docs
Figure 2 for DocCoder: Generating Code by Retrieving and Reading Docs
Figure 3 for DocCoder: Generating Code by Retrieving and Reading Docs
Figure 4 for DocCoder: Generating Code by Retrieving and Reading Docs
Viaarxiv icon