Picture for Yuanli Wang

Yuanli Wang

Terminal-Bench: Benchmarking Agents on Hard, Realistic Tasks in Command Line Interfaces

Add code
Jan 17, 2026
Viaarxiv icon

WebGraphEval: Multi-Turn Trajectory Evaluation for Web Agents using Graph Representation

Add code
Oct 22, 2025
Viaarxiv icon

Efficient Data Distribution Estimation for Accelerated Federated Learning

Add code
Jun 03, 2024
Figure 1 for Efficient Data Distribution Estimation for Accelerated Federated Learning
Figure 2 for Efficient Data Distribution Estimation for Accelerated Federated Learning
Figure 3 for Efficient Data Distribution Estimation for Accelerated Federated Learning
Viaarxiv icon

Trace and Edit Relation Associations in GPT

Add code
Dec 30, 2023
Viaarxiv icon