WebArena
Overview
WebArena is a realistic web environment for autonomous agents built from functional websites across multiple domains. It focuses on long-horizon web tasks with functional-correctness evaluation rather than miniature scripted browsing demos.
Why it matters
It matters because it helped shift web-agent research from toy environments toward worlds that are realistic enough to expose serious planning and action problems.
Distinctive trait
Its distinctive trait is realism with reproducibility: the sites behave like real web systems, but evaluation remains controlled enough to compare agents honestly.
Relationships
Read WebArena with browsergym, visualwebarena, workarena, and the browser-gym section of rl-gyms-and-executable-environments-for-ai-harnesses.