ComputerRL: Scaling End-to-End Online Reinforcement Learning for Computer Use Agents

Source: arXiv Authors: Hanyu Lai, Xiao Liu, Yanxiao Zhao, Han Xu, Hanchen Zhang, Bohao Jing, Yanyu Ren, Shuntian Yao, et al. Date: 2025-08-19 Primary category: cs.AI All categories: cs.AI

Abstract

ComputerRL provides distributed RL infrastructure for large-scale computer-use training and couples API and GUI actions in one environment. It matters because it addresses the ugly but decisive practical issue: you do not really have a gym until thousands of environments can run without collapsing in melodrama.