Atropos

Overview

Atropos is the RL environment and rollout substrate used by Hermes for multi-turn tool-calling tasks. In the Hermes stack it supplies the BaseEnv contract, server management, scheduling, logging, and serve/process/evaluate modes, while Hermes layers on agent-loop orchestration and reward verification in the same sandbox.

Why it matters

It matters because it shows what a harness-native environment API looks like when the agent is not just completing one prompt but acting through tools across multiple turns.

Distinctive trait

Its distinctive trait is scope: in Hermes it is neither a bare trainer nor a mere leaderboard wrapper, but an environment/runtime contract with training hookup through the broader Tinker-Atropos stack.

Relationships

Read Atropos with hermes-agent, agentgym, rl-gyms-and-executable-environments-for-ai-harnesses, the fit analysis in another-harness-and-atropos, and the local stepping-stone prototypes in another-harness-work-item-closure-environment and another-harness-evaluator-discipline-environment.