Picture for Yanyi Shang

Yanyi Shang

WebChain: A Large-Scale Human-Annotated Dataset of Real-World Web Interaction Traces

Add code
Mar 05, 2026
Viaarxiv icon

WebFactory: Automated Compression of Foundational Language Intelligence into Grounded Web Agents

Add code
Mar 05, 2026
Viaarxiv icon

VenusBench-GD: A Comprehensive Multi-Platform GUI Benchmark for Diverse Grounding Tasks

Add code
Dec 18, 2025
Figure 1 for VenusBench-GD: A Comprehensive Multi-Platform GUI Benchmark for Diverse Grounding Tasks
Figure 2 for VenusBench-GD: A Comprehensive Multi-Platform GUI Benchmark for Diverse Grounding Tasks
Figure 3 for VenusBench-GD: A Comprehensive Multi-Platform GUI Benchmark for Diverse Grounding Tasks
Figure 4 for VenusBench-GD: A Comprehensive Multi-Platform GUI Benchmark for Diverse Grounding Tasks
Viaarxiv icon

WebCanvas: Benchmarking Web Agents in Online Environments

Add code
Jun 18, 2024
Viaarxiv icon