Moldable Operations Studio Architecture Spec

Goal

Specify a harness architecture in which the CLI remains an exact textual kernel, but the larger system becomes a moldable operations studio: one durable control plane, many projections, explicit provenance, and first-class support for autonomous and human-supervised work.

The concrete typed companion to this page is moldable-operations-studio-schema-pass. The speculative design companion is sci-fi-audit-for-moldable-operations-studio. The real-research grounding pass is grounding-moldable-operations-studio-ideas-in-real-research. The screen-model companion is moldable-operations-studio-wireframes.

Non-goals

Replacing code with a toy canvas.
Forcing every task through one graphical control shape.
Turning the control plane into a giant eventually-consistent fever dream.

Design axioms

The transcript is a projection, not the source of truth.
All surfaces are clients of one shared state model.
Causality matters more than wall-clock order.
Partial failure must remain legible.
Human approval is a typed work primitive.
Speculative work is normal and should be represented explicitly.
A surface earns its existence by making a real operational question cheaper to answer.

Core state model

1. Durable event log

The studio stores append-only events for user actions, agent actions, tool calls, approvals, rejections, view changes, failures, suspicions, merges, and checkpoint creation.

Every event should include:

event id
logical timestamp
causal parents or frontier
actor identity
view epoch
affected objects
secrecy/integrity labels where relevant

This imports the lessons of vector-clock, virtual-synchrony, and jif.

2. Control objects

The minimum durable object family should be:

work_item
branch
checkpoint
approval
artifact
trace
view_definition
coalition
authority_budget
session_token

The current transcript thread is then only one possible rendering over these objects.

3. Consistent cuts

A checkpoint is not merely a label on recent history. It should identify a consistent cut over the relevant state so replay, audit, and review do not begin from an incoherent mixture. This is the Chandy–Lamport import.

4. Tentative versus committed state

Objects and events should distinguish:

tentative
committed
compensated
superseded
abandoned

This is the optimistic-replication / Bayou lesson. Branches are normal, and the studio should never confuse speculative work with promoted state.

Consistency model

1. Selective linearizability

Only a small set of control-plane operations need linearizable semantics:

approve / reject
grant / revoke authority budget
promote branch
publish view definition
change policy
finalize checkpoint

Everything else may use weaker but explicit semantics.

2. Session guarantees across surfaces

Humans and agents should receive at least:

read-your-writes
monotonic reads
writes-follow-reads
monotonic writes

A person who approves in the app should not open the CLI and be told, with a straight face, that the past is still pending.

3. View epochs

Live collaborations, subscriptions, and coalitions should be versioned by explicit membership epochs. This makes failover, reassignment, and watcher changes renderable rather than mystical.

4. Suspicion model

Runtime status should distinguish:

healthy
delayed
suspected failed
unreachable
confirmed failed
recovered

Decisions taken under suspicion should record that fact in provenance.

Authority model

1. Escrowed rights

Delegated autonomy should use bounded budgets rather than universal central gates. Budgets may cover:

token spend
API calls
deployment attempts
file-edit scope
branch creation rights
approval routing rights

2. Labeled artifacts

Artifacts, traces, and prompts should carry information-flow labels describing who may read, derive from, or act on them. This is finer-grained than generic workspace roles.

3. Self-certifying references

Remote artifacts and capabilities should prefer cryptographically meaningful handles when crossing trust boundaries.

Interaction model

Required projections

The first serious studio should support at least these synchronized views:

CLI transcript view
- exact text stream and tool I/O
Work graph view
- items, dependencies, branches, approvals, coalitions
Queue / inbox view
- pending approvals, stalled work, escalations, human asks
Grid / fleet view
- runs by state across many branches or agents
Evidence / trace view
- logs, tool outputs, runtime spans, diff lineage, queryable replay
Spatial exploration canvas
- loose arrangement of evidence before formalization
Moldable inspector view
- generated pane specialized for the current object or question

Each view is a view_definition plus a materialized projection over the same state.

Protocol model

1. Typed handoffs

Approvals, tool calls, reviewer requests, and escalation flows should be expressed as typed protocol steps, not free-text suggestion clouds.

2. Session-typed boundaries

Where a protocol is stable enough, the studio should check conformance: who may send next, what object shape is expected, and what counts as completion.

3. Human override

Humans may intervene by issuing commands that become ordinary durable control-plane events. Override should not mean bypassing history.

Replay and debugging

1. Queryable provenance

The replay system should answer:

what caused this state?
what was concurrent but independent?
when was this object first suspected bad?
what tentative work later became committed?
what changed between checkpoint A and B?

2. Counterfactual support

Where feasible, replay should support branch-local counterfactuals: inspect what the system would have looked like had a branch been promoted or an approval been denied.

Suggested implementation order

Build the durable event log with causal metadata.
Add consistent-cut checkpoints and replay queries.
Separate linearizable control operations from weaker projection updates.
Add session guarantees across CLI/app/web surfaces.
Add work graph, queue, and grid projections.
Add approval nodes and escrowed authority budgets.
Add moldable inspectors.
Add spatial exploration canvas and direct-manipulation affordances.

Acceptance criteria

A first usable moldable operations studio should make the following statements true:

The same work can be inspected coherently from CLI, graph, and queue views.
A checkpoint can be reopened as a consistent cut.
Parallel edits can be recognized as concurrent rather than merely conflicting.
Approval and promotion events are durable, auditable control-plane facts.
Humans and agents see monotonic state across surfaces.
Partial failure is represented explicitly rather than buried in retry folklore.
Delegated autonomy can operate inside bounded budgets.

Main warning

Do not build the projections first and invent the semantics later. The old distributed-systems literature is mostly a reminder that nice surfaces become nonsense quickly when the underlying control plane cannot distinguish causality, membership, suspicion, tentative state, and committed state.

Read this with moldable-operations-studio-wireframes, grounding-moldable-operations-studio-ideas-in-real-research, moldable-operations-studio-schema-pass, sci-fi-audit-for-moldable-operations-studio, legacy-distributed-systems-ideas-for-moldable-operations-studio, web-patterns-for-non-linear-harness-interfaces, non-linear-interface-options-for-next-harness, new-harness-design-notes, work-management-primitives, partial-order-trace-semantics, and harness-engineering.

Agent Harness Wiki

Browse

Moldable Operations Studio Architecture Spec

Goal

Non-goals

Design axioms

Core state model

1. Durable event log

2. Control objects

3. Consistent cuts

4. Tentative versus committed state

Consistency model

1. Selective linearizability

2. Session guarantees across surfaces

3. View epochs

4. Suspicion model

Authority model

1. Escrowed rights

2. Labeled artifacts

3. Self-certifying references

Interaction model

Required projections

Protocol model

1. Typed handoffs

2. Session-typed boundaries

3. Human override

Replay and debugging

1. Queryable provenance

2. Counterfactual support

Suggested implementation order

Acceptance criteria

Main warning

Graph View

Table of Contents

Backlinks

Agent Harness Wiki

Browse

Moldable Operations Studio Architecture Spec

Goal

Non-goals

Design axioms

Core state model

1. Durable event log

2. Control objects

3. Consistent cuts

4. Tentative versus committed state

Consistency model

1. Selective linearizability

2. Session guarantees across surfaces

3. View epochs

4. Suspicion model

Authority model

1. Escrowed rights

2. Labeled artifacts

3. Self-certifying references

Interaction model

Required projections

Protocol model

1. Typed handoffs

2. Session-typed boundaries

3. Human override

Replay and debugging

1. Queryable provenance

2. Counterfactual support

Suggested implementation order

Acceptance criteria

Main warning

Related pages

Graph View

Table of Contents

Backlinks