JudgeFlow

Overview

JudgeFlow adds a judge module that assigns responsibility scores to workflow blocks so repair can target the part that actually failed. It tries to close the gap between “the workflow went badly” and “here is the piece worth mutating.”

Why it matters

It matters because serious control planes need diagnosis, not only generation. Promotion requires knowing which block deserves blame and which does not.

Distinctive trait

Its distinctive trait is block-level responsibility assignment over workflow traces, which turns vague failure into actionable repair signals.

Relationships

Read JudgeFlow with self-evolving-workflows, evaluation-and-review-loops, and RobustFlow. It also pairs naturally with AFlow when search needs a sharper evaluator lane.