Picture for Yangyue Wang

Yangyue Wang

GUI-Perturbed: Domain Randomization Reveals Systematic Brittleness in GUI Grounding Models

Add code
Apr 15, 2026
Viaarxiv icon

Benchmarking the Generality of Vision-Language-Action Models

Add code
Dec 12, 2025
Viaarxiv icon

MultiNet: An Open-Source Software Toolkit \& Benchmark Suite for the Evaluation and Adaptation of Multimodal Action Models

Add code
Jun 10, 2025
Viaarxiv icon

Benchmarking Vision, Language, & Action Models in Procedurally Generated, Open Ended Action Environments

Add code
May 08, 2025
Viaarxiv icon