WorkArena: How Capable Are Web Agents at Solving Common Knowledge Work Tasks?
Source: arXiv Authors: Alexandre Drouin, Maxime Gasse, Massimo Caccia, Issam H. Laradji, Manuel Del Verme, Tom Marty, Léo Boisvert, Megh Thakkar, et al. Date: 2024-03-12 Primary category: cs.LG All categories: cs.LG, cs.AI
Abstract
WorkArena focuses on realistic enterprise knowledge-work tasks in the browser and introduces BrowserGym as the environment substrate. It is especially useful for harness design because it centers routine office work rather than puzzle-box internet tasks.