Alert button

VisualWebArena: Evaluating Multimodal Agents on Realistic Visual Web Tasks

Jan 24, 2024
Jing Yu Koh, Robert Lo, Lawrence Jang, Vikram Duvvur, Ming Chong Lim, Po-Yu Huang, Graham Neubig, Shuyan Zhou, Ruslan Salakhutdinov, Daniel Fried

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: