Picture for Zora Zhiruo Wang

Zora Zhiruo Wang

How Do AI Agents Do Human Work? Comparing AI and Human Workflows Across Diverse Occupations

Add code
Oct 26, 2025
Viaarxiv icon

OpenAgentSafety: A Comprehensive Framework for Evaluating Real-World AI Agent Safety

Add code
Jul 08, 2025
Figure 1 for OpenAgentSafety: A Comprehensive Framework for Evaluating Real-World AI Agent Safety
Figure 2 for OpenAgentSafety: A Comprehensive Framework for Evaluating Real-World AI Agent Safety
Figure 3 for OpenAgentSafety: A Comprehensive Framework for Evaluating Real-World AI Agent Safety
Figure 4 for OpenAgentSafety: A Comprehensive Framework for Evaluating Real-World AI Agent Safety
Viaarxiv icon

Inducing Programmatic Skills for Agentic Tasks

Add code
Apr 09, 2025
Figure 1 for Inducing Programmatic Skills for Agentic Tasks
Figure 2 for Inducing Programmatic Skills for Agentic Tasks
Figure 3 for Inducing Programmatic Skills for Agentic Tasks
Figure 4 for Inducing Programmatic Skills for Agentic Tasks
Viaarxiv icon

SkillWeaver: Web Agents can Self-Improve by Discovering and Honing Skills

Add code
Apr 09, 2025
Figure 1 for SkillWeaver: Web Agents can Self-Improve by Discovering and Honing Skills
Figure 2 for SkillWeaver: Web Agents can Self-Improve by Discovering and Honing Skills
Figure 3 for SkillWeaver: Web Agents can Self-Improve by Discovering and Honing Skills
Figure 4 for SkillWeaver: Web Agents can Self-Improve by Discovering and Honing Skills
Viaarxiv icon

Benchmarking Failures in Tool-Augmented Language Models

Add code
Mar 18, 2025
Figure 1 for Benchmarking Failures in Tool-Augmented Language Models
Figure 2 for Benchmarking Failures in Tool-Augmented Language Models
Figure 3 for Benchmarking Failures in Tool-Augmented Language Models
Figure 4 for Benchmarking Failures in Tool-Augmented Language Models
Viaarxiv icon

CowPilot: A Framework for Autonomous and Human-Agent Collaborative Web Navigation

Add code
Jan 28, 2025
Figure 1 for CowPilot: A Framework for Autonomous and Human-Agent Collaborative Web Navigation
Figure 2 for CowPilot: A Framework for Autonomous and Human-Agent Collaborative Web Navigation
Figure 3 for CowPilot: A Framework for Autonomous and Human-Agent Collaborative Web Navigation
Figure 4 for CowPilot: A Framework for Autonomous and Human-Agent Collaborative Web Navigation
Viaarxiv icon

AutoPresent: Designing Structured Visuals from Scratch

Add code
Jan 01, 2025
Figure 1 for AutoPresent: Designing Structured Visuals from Scratch
Figure 2 for AutoPresent: Designing Structured Visuals from Scratch
Figure 3 for AutoPresent: Designing Structured Visuals from Scratch
Figure 4 for AutoPresent: Designing Structured Visuals from Scratch
Viaarxiv icon

Agent Workflow Memory

Add code
Sep 11, 2024
Figure 1 for Agent Workflow Memory
Figure 2 for Agent Workflow Memory
Figure 3 for Agent Workflow Memory
Figure 4 for Agent Workflow Memory
Viaarxiv icon

ECCO: Can We Improve Model-Generated Code Efficiency Without Sacrificing Functional Correctness?

Add code
Jul 19, 2024
Viaarxiv icon

CodeRAG-Bench: Can Retrieval Augment Code Generation?

Add code
Jun 20, 2024
Figure 1 for CodeRAG-Bench: Can Retrieval Augment Code Generation?
Figure 2 for CodeRAG-Bench: Can Retrieval Augment Code Generation?
Figure 3 for CodeRAG-Bench: Can Retrieval Augment Code Generation?
Figure 4 for CodeRAG-Bench: Can Retrieval Augment Code Generation?
Viaarxiv icon