WebArena: A Realistic Web Environment for Building Autonomous Agents
Source: arXiv Authors: Shuyan Zhou, Frank F. Xu, Hao Zhu, Xuhui Zhou, Robert Lo, Abishek Sridhar, Xianyi Cheng, Tianyue Ou, et al. Date: 2023-07-25 Primary category: cs.AI All categories: cs.AI, cs.CL, cs.LG
Abstract
WebArena builds realistic, reproducible web environments across multiple domains so agents can be tested on long-horizon internet tasks with functional-correctness evaluation. It is a foundational executable benchmark for web agents because it moves beyond toy sites and exposes the gap between plausible browsing and actually finishing a task.