Wiki Log
Chronological record of all wiki actions. Append-only. Format:
## [YYYY-MM-DD] action | subjectActions: ingest, update, query, lint, create, archive, delete When this file exceeds 500 entries, rotate: rename to log-YYYY.md, start fresh.
[2026-04-07] create | Wiki initialized
- Domain: Agent harnesses — software infrastructure for reliable autonomous coding agents
- Focus: Codex CLI, Claude Code, Hermes Agent, Gas Town/Gas City, and architectural patterns
- Structure created with SCHEMA.md, index.md, log.md, raw/, entities/, concepts/, comparisons/, queries/
[2026-04-07] ingest | Bulk ingestion: 15 sources
- Sources:
- yegge-welcome-to-gas-town.md
- yegge-future-of-coding-agents.md
- yegge-gas-town-clown-show-to-v1.md
- yegge-birthday-blog.md
- yegge-welcome-to-the-wasteland.md
- yegge-vibe-maintainer.md
- openai-unlocking-codex-harness.md
- openai-harness-engineering.md
- anthropic-effective-harnesses.md
- anthropic-three-agent-harness-infoq.md
- raw/articles/daily-dose-anatomy-agent-harness.md
- raw/articles/philschmid-agent-harness-2026.md
- raw/articles/gupta-2026-is-agent-harnesses.md
- raw/articles/orami-top-ai-agents-2026.md
- newstack-openclaw-vs-hermes.md
- raw/articles/calv-coding-agents-feb-2026.md
- raw/articles/hn-gas-town-decoded.md
- raw/articles/prius-of-gastown.md
- codex-cli-github.md
- hermes-agent-github.md
- Pages created: 12 (6 entities, 5 concepts, 2 comparisons, 1 query)
[2026-04-07] update | Wiki recovery after Codex-assisted audit
- Spawned parallel Codex collaborators to audit on-disk state, recover a minimal canonical source set, and propose the missing page set.
- Verified the wiki had become partially ingested: index/log referenced pages and sources that did not yet exist on disk.
- Restored 8 missing raw source summaries:
- Created 14 content pages:
- Repaired SCHEMA.md so content-page rules and meta-file exceptions are explicit, and added
meta/schematags to the taxonomy. - Rebuilt index.md to match the actual on-disk page set: 14 pages.
- Note: the earlier bulk-ingestion entry reflects intended scope at the time and does not exactly match the previously materialized on-disk state; this recovery entry records the repaired truth.
[2026-04-07] create | safety-and-permissions
- Added safety-and-permissions.md to normalize a central theme already present across Codex, Hermes, and OpenClaw sources.
- Cross-linked the new concept from codex-cli.md, hermes-agent.md, openclaw.md, and agent-harness-anatomy.md.
- Updated index.md to include the new concept and bump total pages from 14 to 15.
[2026-04-07] create | evaluation-and-review-loops
- Added evaluation-and-review-loops.md to capture evaluator agents, self-review loops, and PR-governance patterns across OpenAI, Anthropic, and Yegge sources.
- Cross-linked the new concept from claude-code.md, harness-engineering.md, and harness-quality-comparison.md.
- Updated index.md to include the new concept and bump total pages from 15 to 16.
[2026-04-07] ingest | Current-source refresh from parallel research agents
- Spawned three parallel research agents to expand the corpus across OpenAI/Codex, Anthropic/Claude Code, and persistent-orchestrator runtimes.
- Added 9 raw source summaries for newly researched primary sources:
- Updated major pages to reflect current surface models, memory systems, permissions, and orchestration details:
[2026-04-07] create | codex-app-server
- Added codex-app-server.md because the App Server is now central enough in OpenAI’s public materials to warrant its own page.
- Updated index.md to include the new entity and bump total pages from 16 to 17.
[2026-04-07] create | harness-decision-matrix
- Added harness-decision-matrix.md to turn the current qualitative corpus into a weighted design-choice table.
- Cross-linked the new comparison from harness-quality-comparison.md and new-harness-design-notes.md so it participates in the existing synthesis graph.
- Updated index.md to include the new comparison and bump total pages from 17 to 18.
[2026-04-07] create | automation-and-background-work
- Added automation-and-background-work.md to capture the recurring-work pattern now visible across Codex app automations, Claude Code recurring tasks, and Hermes cron jobs.
- Cross-linked the new concept from context-engineering.md, work-management-primitives.md, safety-and-permissions.md, codex-cli.md, claude-code.md, and hermes-agent.md.
- Updated index.md to include the new concept and bump total pages from 18 to 19.
[2026-04-07] create | instruction-layering
- Added instruction-layering.md to separate durable scoped guidance from transcript memory, using
AGENTS.md,CLAUDE.md, Hermes memory files, and OpenClaw bootstrap files as the main examples. - Cross-linked the new concept from context-engineering.md, memory-persistence.md, safety-and-permissions.md, codex-cli.md, claude-code.md, hermes-agent.md, and openclaw.md.
- Updated index.md to include the new concept and bump total pages from 19 to 20.
[2026-04-07] ingest | orchestration and automation docs refresh
- Added 4 raw source summaries for current official docs:
- Updated synthesis pages to reflect the stronger distinction between subagents, session teams, and background work:
[2026-04-07] create | orchestration-topologies
- Added orchestration-topologies.md to capture the emerging split between inline subagents, separate-session teams, and full swarm/factory coordination.
- Cross-linked the new concept from agent-harness-anatomy.md, automation-and-background-work.md, claude-code.md, harness-architecture-comparison.md, and new-harness-design-notes.md.
- Updated index.md to include the new concept and bump total pages from 21 to 22.
[2026-04-08] ingest | arXiv round two on formal semantics for agent harnesses
- Added 6 raw paper summaries:
- arxiv-zhang-2024-formal-methods-trustworthy-ai-agents.md
- arxiv-lahiri-2026-intent-formalization.md
- arxiv-conradie-2016-probabilistic-epistemic-updates.md
- arxiv-kishida-2017-categories-for-dynamic-epistemic-logic.md
- arxiv-wang-2026-structural-operational-semantics-true-concurrency.md
- arxiv-edixhoven-2022-branching-pomsets-for-choreographies.md
- Extended SCHEMA.md with new tags for
formal-methods,semantics,epistemics, andconcurrency. - Created 4 content pages:
- Updated existing synthesis pages to cross-link the new research layer:
- Updated index.md to include the new pages and bump total pages from 22 to 26.
[2026-04-08] ingest | formal-core cognition research pass
- Added 6 raw paper summaries:
- Created 3 content pages:
- Updated existing synthesis pages to absorb the new formal-core angle:
- Updated index.md to include the new pages and bump total pages from 26 to 29.
[2026-04-08] ingest | non-linear harness interface research pass
- Added 9 raw paper summaries:
- arxiv-sarkar-2023-code-relevant-ui.md
- arxiv-angert-2023-spellburst.md
- arxiv-rein-2024-live-programmers.md
- arxiv-krause-glau-2023-code-proximal-dynamic-software-visualization.md
- arxiv-kuhn-2010-spatial-software-visualization-ide.md
- arxiv-li-2024-kishu-time-traveling-notebooks.md
- arxiv-fang-2025-code-data-space-versioning.md
- arxiv-chen-2022-nl2interface.md
- arxiv-krause-glau-2024-code-review-software-city.md
- Created 1 content page:
- Updated existing synthesis pages to absorb the new surface/interface angle:
- Updated index.md to include the new query and bump total pages from 29 to 30.
[2026-04-08] update | add diagrams to core concept pages
- Added Mermaid diagrams to clarify structure and flow on:
- Chose diagrams that mirror existing prose rather than introducing new claims: a harness component map, a work-object lifecycle, a formalization loop, and a topology comparison sketch.
[2026-04-08] lint | wiki grooming pass
- Audited the corpus against the current schema: index coverage, source existence, tag taxonomy, outbound wikilinks, and modified-page
updateddates. - Fixed the taxonomy violation in gas-city-but-its-just-codex.md by replacing ad hoc tags with existing schema tags.
- Bumped stale
updateddates on the already edited comparison, concept, and query pages so frontmatter matches the current worktree. - Added
scripts/lint-wiki.shand documented it in SCHEMA.md so future grooming does not depend on chat memory.
[2026-04-09] ingest | non-hierarchical orchestration research pass
- Added 11 raw paper summaries for non-hierarchical coordination patterns and dolphin-sociality inputs:
- smith-1980-contract-net-protocol.md
- gelernter-1985-generative-communication-in-linda.md
- mcmanus-1991-design-and-analysis-tools-for-concurrent-blackboard-systems.md
- olfati-saber-fax-murray-2007-consensus-and-cooperation-in-networked-multi-agent-systems.md
- shehory-kraus-1998-methods-for-task-allocation-via-agent-coalition-formation.md
- dias-zlot-kalra-stentz-2006-market-based-multirobot-coordination.md
- brambilla-ferrante-birattari-dorigo-2013-swarm-robotics-review.md
- lusseau-conradt-2009-unshared-consensus-decisions-dolphins.md
- bruck-2013-decades-long-social-memory-in-bottlenose-dolphins.md
- evans-krzyszczyk-frere-mann-2021-lifetime-stability-of-social-traits-in-bottlenose-dolphins.md
- king-connor-kruetzen-allen-2021-cooperation-based-concept-formation-in-male-bottlenose-dolphins.md
- Created 3 content pages:
- Updated existing synthesis pages so the new material participates in the main design graph:
- Updated index.md to include the new pages and bump total pages from 30 to 33.
[2026-04-09] ingest | Memento-Skills paper and companion code
- Added 2 raw source summaries:
- Created 1 content page:
- Updated existing synthesis pages so the new system participates in the main graph:
- Extended SCHEMA.md with a
memento-skillsimplementation tag. - Updated index.md to include the new entity and bump total pages from 33 to 34.
[2026-04-09] ingest | broader web patterns for non-linear harness interfaces
- Added 14 raw source summaries:
- sketch-n-sketch.md
- glamorous-toolkit-moldable-development-environment.md
- arxiv-omar-2018-live-functional-programming-with-typed-holes.md
- arxiv-doderlein-2026-spacetime-programming.md
- pernosco-omniscient-printf-debugging.md
- plutojl-interactive-programming-environment.md
- vistrails-aosa.md
- spatial-hypertext.md
- langgraph-studio-first-agent-ide.md
- temporal-web-ui.md
- windmill-suspend-approval-prompts.md
- trigger-dev-product.md
- airflow-ui-overview.md
- dagster-scaling-dag-visualization.md
- Created 1 content page:
- Updated existing synthesis pages so the new material participates in the main interface graph:
- Updated index.md to include the new query and bump total pages from 34 to 35.
[2026-04-09] ingest | under-explored coordination strategies arXiv pass
- Added 14 raw paper summaries:
- arxiv-salemi-2025-llm-blackboard-data-discovery.md
- arxiv-nakamura-2025-terrarium-blackboard-multi-agent-safety.md
- arxiv-pugachev-2025-codecrdt-observation-driven-coordination.md
- arxiv-duetting-2023-mechanism-design-large-language-models.md
- arxiv-zhao-2025-llm-auction-generative-auction.md
- arxiv-li-2025-lacp-agent-communication-protocol.md
- arxiv-ehtesham-2025-survey-agent-interoperability-protocols.md
- arxiv-ben-khaled-2026-g2cp-graph-grounded-communication-protocol.md
- arxiv-zou-2025-blocka2a-secure-verifiable-interoperability.md
- arxiv-miculicich-2025-veriguard-verified-code-generation.md
- arxiv-ye-2025-x-mas-heterogeneous-llms.md
- arxiv-yu-2025-dyntaskmas-dynamic-task-graph.md
- arxiv-wang-2024-battleagentbench.md
- arxiv-sun-2025-collab-overcooked.md
- Created 1 content page:
- Updated existing synthesis pages so the new coordination material participates in the main graph:
- Updated index.md to include the new query and bump total pages from 34 to 35.
[2026-04-09] update | page-count recovery note
- The content-page total is 36, not 35.
- Two separate query-page ingests landed on 2026-04-09 (web-patterns-for-non-linear-harness-interfaces.md and arxiv-under-explored-coordination-strategies.md), so the later page-bump note undercounted by one.
- index.md now records the repaired total while preserving the earlier historical note as written.
[2026-04-09] ingest | legacy distributed-systems ideas and studio architecture spec
- Added 11 raw source summaries:
- Created 2 content pages:
- Updated existing synthesis pages so the new semantics layer participates in the main design graph:
- Updated index.md to include the new queries and bump total pages from 36 to 38.
[2026-04-09] lint | maintenance pass
- Ran
scripts/lint-wiki.shand a structural audit across the content corpus. - Fixed invalid YAML frontmatter quoting in:
- Repaired inbound navigation for previously orphaned query pages by cross-linking them from:
- Hardened
scripts/lint-wiki.shto parse YAML frontmatter, so colon-bearing titles now fail fast instead of slipping past the shell-only checks. - Re-ran lint: pass, 36 content pages checked, with no orphan content pages in the follow-up structural scan.
[2026-04-09] update | staged lint polish
- Adjusted
scripts/lint-wiki.shso theupdated:check inspects staged modified content pages when a commit is in flight, rather than only unstaged worktree edits. - Limited that
updated:rule to modified tracked pages (--diff-filter=M), so newly introduced historical pages can keep their original creation/update dates. - Guarded the git-diff branch so staged-tree audits outside a git checkout do not emit misleading repo warnings.
[2026-04-09] ingest | schema pass and sci-fi audit for moldable operations studio
- Added 11 raw source summaries:
- Created 2 content pages:
- Updated existing synthesis pages so the new concrete and speculative layers participate in the main design graph:
- Updated index.md to include the new queries and bump total pages from 38 to 40.
- Re-ran lint after the fix: pass.
[2026-04-09] ingest | grounded research for moldable operations studio ideas
- Added 20 raw source summaries:
- yang-wigdor-2014-panelrama.md
- klokmose-et-al-2015-webstrates.md
- bragdon-et-al-2011-code-space.md
- danielsson-alvinius-larsson-2014-common-operating-picture.md
- nandiganahalli-et-al-2014-mode-confusion-detection.md
- honarmand-torrellas-2014-replay-debugging.md
- ko-myers-2009-java-whyline.md
- stasko-gorg-liu-2008-jigsaw.md
- andrews-north-2012-analysts-workspace.md
- groth-streefkerk-2006-provenance-annotation-visual-exploration.md
- amershi-et-al-2015-modeltracker.md
- wexler-et-al-2019-what-if-tool.md
- malkhi-lamport-zhou-2008-stoppable-paxos.md
- rfc-9420-mls-protocol.md
- rivest-lampson-1996-sdsi.md
- birgisson-et-al-2014-macaroons.md
- efstathopoulos-et-al-2005-asbestos.md
- finkelstein-et-al-1992-viewpoints.md
- foster-et-al-2007-bidirectional-tree-transformations.md
- green-karvounarakis-tannen-2007-provenance-semirings.md
- Created 1 content page:
- Updated existing synthesis pages so the grounded-research layer participates in the main design graph:
- Updated index.md to include the new query and bump total pages from 40 to 41.
[2026-04-09] ingest | wireframe pass for moldable operations studio
-
Added 1 asset:
- raw/assets/moldable-operations-studio-wireframes.excalidraw
-
Created 1 content page:
-
Updated existing synthesis pages so the wireframe layer participates in the main design graph:
-
Updated index.md to include the new query; total pages now 42.
-
The content-page total is 44, not 42.
-
Concurrent additions landed in the corpus while the wireframe pass was in flight, including self-evolving-workflows.md and arxiv-self-evolving-workflows-for-codex-control-plane.md.
-
index.md now records the repaired total while preserving the earlier line-level history.
[2026-04-09] ingest | self-evolving workflows for gas-city-but-its-just-codex
- Added 16 raw source summaries:
- gas-city-but-its-just-codex-repo-2026-04-09.md
- arxiv-zhang-2024-aflow.md
- arxiv-li-2024-autoflow.md
- arxiv-wang-2024-agent-workflow-memory.md
- arxiv-hu-2024-automated-design-of-agentic-systems.md
- arxiv-wang-2025-evoagentx.md
- arxiv-wang-2026-learning-to-compose-agentic-workflow-generation.md
- arxiv-xu-2026-hyevo.md
- arxiv-zhao-2025-a2flow.md
- arxiv-shen-2026-skillfoundry.md
- arxiv-zhang-2026-evoskills.md
- arxiv-wang-2026-skillx.md
- arxiv-ma-2026-scaling-coding-agents-via-atomic-skills.md
- arxiv-rhodes-2026-compiled-memory.md
- arxiv-shinn-2023-reflexion.md
- arxiv-zhao-2023-expel.md
- Created 2 content pages:
- Updated existing pages to connect the new workflow-evolution layer into the main graph:
- Updated index.md to include the new concept and query and bump total pages from 41 to 43.
- Re-ran lint after the ingest: pass, 43 content pages checked.
[2026-04-09] ingest | broader arxiv sweep for self-evolving workflows
- Added 19 additional raw source summaries to widen the literature map beyond the first direct-hit pass:
- arxiv-qiao-2024-benchmarking-agentic-workflow-generation.md
- arxiv-zheng-2025-mermaidflow.md
- arxiv-wang-2026-query-level-workflows.md
- arxiv-ma-2026-judgeflow.md
- arxiv-wang-2025-dyflow.md
- arxiv-wang-2025-self-improving-agent-skill-library.md
- arxiv-ye-2026-meta-context-engineering.md
- arxiv-du-2025-bottom-up-skill-evolution.md
- arxiv-zhai-2025-agentevolver.md
- arxiv-qian-2025-metaagent.md
- arxiv-banerjee-2026-severa.md
- arxiv-ye-2025-sop-agent.md
- arxiv-li-2025-sopbench.md
- arxiv-huo-2026-atommem.md
- arxiv-liu-2026-graph-of-skills.md
- arxiv-ni-2026-trace2skill.md
- arxiv-zhang-2026-memskill.md
- arxiv-xia-2026-metaclaw.md
- arxiv-xu-2025-robustflow.md
- Expanded the main synthesis pages so the literature map now includes evaluator loops, robustness, retrieval over large skill libraries, and learnable memory/context routines:
- Re-ran lint after the broader sweep: pass, 43 content pages checked.
[2026-04-09] ingest | rl gyms and executable environments for ai harnesses
- Added 20 raw source summaries for executable benchmark and training environments:
- arxiv-zhou-2023-webarena.md
- arxiv-chezelles-2024-browsergym-ecosystem.md
- arxiv-koh-2024-visualwebarena.md
- arxiv-drouin-2024-workarena.md
- arxiv-boisvert-2024-workarena-plus-plus.md
- arxiv-pan-2024-webcanvas.md
- arxiv-xie-2024-osworld.md
- arxiv-bonatti-2024-windows-agent-arena.md
- arxiv-trivedi-2024-appworld.md
- arxiv-ma-2024-agentboard.md
- arxiv-xi-2024-agentgym.md
- arxiv-nathani-2025-mlgym.md
- arxiv-pan-2024-swe-gym.md
- arxiv-yao-2024-tau-bench.md
- arxiv-chuang-2026-proxy-state-based-evaluation.md
- arxiv-chen-2025-rl-long-horizon-interactive-llm-agents.md
- arxiv-lai-2025-computerrl.md
- arxiv-mehta-2026-enterprisebench-corecraft.md
- arxiv-mialon-2023-gaia.md
- arxiv-yao-2022-webshop.md
- Created 1 content page:
- Updated existing concept pages so environment substrates connect back into the main harness graph:
- Updated index.md to include the new query and corrected the total page count from 43 to 45 after a recount.
- Re-ran lint after the gym sweep: pass, 45 content pages checked.
[2026-04-10] ingest | neural-native programming via internal-layer interfaces
- Added 35 raw source notes to resolve the user-provided NNPL draft into explicit bibliography entries:
- neural-native-programming-direct-internal-layer-interfaces-draft-2026-04-10.md
- transformer-circuits-mathematical-framework.md
- logit-lens-lesswrong.md
- anthropic-toy-models-of-superposition.md
- anthropic-monosemantic-features.md
- nnsight-docs.md
- baukit-github.md
- doug-github.md
- arxiv-vaswani-2017-attention-is-all-you-need.md
- geva-2021-transformer-feed-forward-layers-are-key-value-memories.md
- arxiv-belrose-2023-tuned-lens.md
- arxiv-hewitt-2019-structural-probe.md
- arxiv-meng-2022-locating-and-editing-factual-associations-in-gpt.md
- arxiv-turner-2023-steering-language-models-with-activation-engineering.md
- arxiv-panickssery-2023-steering-llama-2-via-contrastive-activation-addition.md
- arxiv-dathathri-2019-pplm.md
- arxiv-wu-2024-pyvene.md
- arxiv-li-2023-inference-time-intervention.md
- arxiv-zou-2023-representation-engineering.md
- arxiv-burns-2022-discovering-latent-knowledge.md
- arxiv-hu-2021-lora.md
- arxiv-reed-2015-neural-programmer-interpreters.md
- arxiv-kusner-2017-grammar-variational-autoencoder.md
- arxiv-van-den-oord-2017-neural-discrete-representation-learning.md
- arxiv-hong-2020-latent-programmer.md
- arxiv-macfarlane-2024-searching-latent-program-spaces.md
- kanerva-2009-hyperdimensional-computing.md
- plate-1995-holographic-reduced-representations.md
- arxiv-gayler-2004-vector-symbolic-architectures.md
- arxiv-weiss-2021-thinking-like-transformers.md
- arxiv-lindner-2023-tracr.md
- arxiv-tomkins-flanagan-2025-differentiable-vector-symbolic-types.md
- arxiv-chen-2021-evaluating-llms-trained-on-code.md
- arxiv-austin-2021-program-synthesis-with-large-language-models.md
- arxiv-hendrycks-2021-measuring-coding-challenge-competence-with-apps.md
- Extended SCHEMA.md with
mechanistic-interpretabilityandprogram-synthesistags so the new material can be filed without abusing older categories. - Created 2 content pages:
- Updated existing synthesis pages so the new topic participates in the current harness graph:
- Updated navigation/meta files:
[2026-04-10] lint | neural-native programming ingest maintenance pass
- Ran
scripts/lint-wiki.shafter the ingest. - Initial lint failed because several already-modified tracked pages still carried
updated: 2026-04-09even though the worktree date had advanced. - Bumped
updatedto 2026-04-10 on the following pages to restore schema consistency: - Re-ran lint: pass, 47 content pages checked.
[2026-04-10] create | neural-native programming research program
- Added neural-native-programming-research-program.md as a tighter staged plan with explicit promotion gates, kill criteria, benchmark order, and first-quarter deliverables.
- Cross-linked the new plan page from:
- Updated index.md to include the new query and bump total pages from 47 to 48.
[2026-04-10] lint | neural-native research-program follow-up
- Re-ran
scripts/lint-wiki.shafter adding the research-program page. - Result: pass, 48 content pages checked.
[2026-04-10] ingest | rl gym entity pages and atropos fit analysis
- Added 2 repo-grounded raw source summaries for local harness integration state:
- Created 21 entity pages for the executable-environment and gym layer:
- agentboard.md
- agentgym.md
- appworld.md
- atropos.md
- browsergym.md
- computer-rl.md
- enterprisebench-corecraft.md
- gaia.md
- mlgym.md
- osworld.md
- proxy-state-based-evaluation.md
- sopbench.md
- swe-gym.md
- tau-bench.md
- visualwebarena.md
- webarena.md
- webcanvas.md
- webshop.md
- windows-agent-arena.md
- workarena.md
- workarena-plus-plus.md
- Created 1 synthesis page:
- Updated the main gym synthesis and surrounding concept pages so the new environment layer participates in the harness graph:
- Recounted content pages and updated index total from 48 to 70.
- Re-ran lint after the entity-page pass: pass, 70 content pages checked.
[2026-04-10] ingest | concrete another-harness atropos schema
- Added 1 raw source summary for the new repo-native Atropos sidecar design:
- Created 1 synthesis page:
- Updated existing pages so the concrete schema is linked back into the gym and fit-analysis graph:
- Updated index.md to include the new query and bump total pages from 70 to 71.
- Re-ran lint after the schema sync: pass, 71 content pages checked.
[2026-04-10] update | public repo rename and github pages publishing layer
- Renamed the public GitHub repository from
wikitoagent-harness-wiki. - Added a lightweight MkDocs publishing layer with:
- mkdocs.yml
- docs-requirements.txt
- scripts/prepare_site_docs.py
- .github/workflows/pages.yml
- README.md updates for local preview and published-site usage
- .gitignore rules for generated site directories and local virtualenvs
- Enabled GitHub Pages in workflow-deployment mode and updated the Pages workflow to current action major versions to avoid stale Node 20 warnings.
- Verified the docs build locally after preparing the generated docs tree.
- Re-ran lint after the publishing-layer pass: pass, 70 content pages checked.
[2026-04-10] update | another-harness work-item closure environment prototype
- Added 1 raw source summary for the first live executable environment slice in another-harness:
- Created 1 synthesis page:
- Updated existing pages so the executable prototype is linked back into the Atropos fit/schema/gym graph:
- Updated index.md to include the new query and bump total pages from 71 to 72.
- Re-ran lint after the prototype sync: pass.
[2026-04-10] update | seeded user evolution log
- Added top-level meta file:
- USER_EVOLUTION_LOG.md
- Designed a durable entry format that separates explicit observations from inferred trends, records confidence, and preserves evidence, uncertainty, and what-to-watch-next.
- Seeded initial entries for 2026-04-05 through 2026-04-10 covering:
- tightening acceptance and review standards
- shift from compatibility analysis toward switchable Hermes profiles
- movement toward non-linear / moldable operations studio interfaces
- clearer separation of formulas, skills, memory, and evaluation in self-evolving workflow research
- explicit agency-preserving collaboration norms
[2026-04-10] ingest | multiplayer harness and p2p network research pass
- Added 7 raw source summaries for the local-first, peer-to-peer, interoperability, and multiplayer-surface layer:
- Created 1 synthesis page:
- Updated existing synthesis pages so the new multiplayer/federated direction is linked into the harness graph:
- Updated index.md to include the new query and bump total pages from 72 to 73.
- Re-ran lint after the research pass: pass, 73 content pages checked.
[2026-04-10] create | multiplayer harness build strategy
- Created 1 synthesis page answering the implementation-order question directly:
- Updated the multiplayer research synthesis to point at the new implementation-focused page:
- Updated index.md to include the new query and bump total pages from 73 to 74.
- Re-ran lint after the build-strategy pass: pass.
[2026-04-10] create | multiplayer harness inspection guide
- Created 1 synthesis page for deciding what to inspect first when shaping the design:
- Updated the build-strategy page to point at the new inspection guide:
- Updated index.md to include the new query and bump total pages from 74 to 75.
- Re-ran lint after the inspection-guide pass: pass.
[2026-04-10] update | another-harness evaluator-discipline environment prototype
- Added 1 raw source summary for the second live executable environment slice in another-harness:
- Created 1 synthesis page:
- Updated existing pages so the evaluator prototype is linked back into the Atropos fit/schema/gym graph:
- Updated index.md to include the new query and bump total pages from 75 to 76.
- Re-ran lint after the evaluator-prototype sync: pass.
[2026-04-10] ingest | sovereignty and observed-goals deep-dive for multiplayer harnesses
- Added 12 raw source summaries spanning portable attestations, trust management, proof-carrying authorization, provenance, commitments, governance, goal recognition, and socially translucent collaboration:
- w3c-verifiable-credentials-data-model-v2.md
- li-mitchell-winsborough-role-based-trust-management-framework.md
- bauer-schneider-felten-appel-proof-carrying-authorization.md
- claimchain.md
- torres-arias-2019-in-toto.md
- w3c-prov-dm.md
- medina-mora-winograd-flores-flores-action-workflow-approach.md
- fornara-colombetti-commitment-based-agent-communication-language.md
- singh-chopra-computational-governance-violable-contracts.md
- keren-gal-karpas-goal-recognition-design.md
- ramirez-geffner-probabilistic-plan-recognition.md
- erickson-et-al-socially-translucent-systems.md
- Added 2 synthesis pages from the multi-round deep-dive:
- Updated existing synthesis/navigation pages to link the new sovereignty track into the harness graph:
- Recounted content pages and updated index total from 76 to 78.
- Note: the commitment/governance synthesis page was materialized during the deeper research pass itself; this entry records it explicitly so the log matches the on-disk truth.
[2026-04-10] create | sovereign identity and observed-goals schema pass
- Created 1 concrete schema page layering sovereignty and non-scalar trust into the control plane:
- Updated existing synthesis pages so the new schema pass is linked into the research graph:
- Updated index.md to include the new query and bump total pages from 78 to 79.
- Re-ran lint after the schema pass: pass, 79 content pages checked.
[2026-04-10] create | node card and minimum adapter contract
- Created 1 concrete federation-boundary page:
- Updated existing synthesis/schema pages so the adapter contract is linked into the multiplayer and sovereignty graph:
- Updated index.md to include the new query and bump total pages from 79 to 80.
- Re-ran lint after the contract pass: pass, 80 content pages checked.
[2026-04-11] create | prompt optimization research and dspy follow-ups
- Created 1 raw research batch note capturing arXiv/OpenAlex-grounded bibliography and notes:
- Created 1 new entity page for the central framework and 1 synthesis/query page:
- Updated schema and navigation to include DSPy in the implementation taxonomy and add both new pages to the index:
- Intended focus: RL prompt optimization, prompt-program systems, and the early research line following DSPy.
[2026-04-11] create | prompt optimization timeline and harness lessons
- Created 1 follow-on synthesis page that turns the prompt-optimization survey into a chronological map and an agent-harness design memo:
- Updated related prompt-program pages and navigation so the new synthesis is linked into the graph:
- Updated index total from 83 to 84.
[2026-04-11] create | prompt-program deployment open questions
- Created 1 new synthesis page focused on deployment-time research gaps for prompt programs and writable instruction artifacts:
- Updated navigation to include the new page in the query index:
- Focus: compile-time vs runtime adaptation, promotion and rollback, safety constraints, drift, human oversight, memory substrates, and long-lived harness packaging.
- Updated index total from 84 to 85.
[2026-04-11] create | prompt optimization open questions fan-out
- Created 2 raw research batch notes for the fan-out clusters:
- Created 4 query/synthesis pages covering the umbrella map and the three main open-question clusters:
- Updated navigation and page metadata to index the new pages and normalize frontmatter/tags for lint correctness:
- Updated index total from 85 to 88.
[2026-04-11] update | another-harness run-history decision
- Added 1 raw repository-synthesis note:
- Updated the main Atropos fit synthesis page so it now records the repo’s explicit storage answer:
- Updated index.md so the query summary reflects the new run-history stance while keeping total content pages at 88.
- Captured the repo’s current decision that
state/runs/remains non-canonical and that historically important attempts should live as derived evidence bundles underdocs/plans/artifacts/until real multi-attempt pressure says otherwise.
[2026-04-11] lint | wiki pass after another-harness run-history sync
- Ran
scripts/lint-wiki.shafter the run-history decision sync. - Confirmed the updated query/index metadata and the new raw note pass the current wiki structural checks.
[2026-04-11] create | self-evolving workflow tool coverage
- aflow.md
- agent-workflow-memory.md
- agentevolver.md
- atommem.md
- autoflow.md
- compiled-memory.md
- dyflow.md
- evoskills.md
- expel.md
- graph-of-skills.md
- judgeflow.md
- memskill.md
- mermaidflow.md
- metaagent.md
- metaclaw.md
- reflexion.md
- robustflow.md
- sage.md
- severa.md
- skillfoundry.md
- skillx.md
- sop-agent.md
- trace2skill.md
- worfbench.md
- worfeval.md
- Updated the query page to add the missing raw sources for Learning to Compose and AgentEvolver, and replaced plain-text system names with wikilinks where appropriate:
- Updated the core concept page so its main workflow, evaluation, memory, and skill-system examples now resolve to concrete entity pages:
- Updated index.md to include the new entities and bump total pages from 88 to 113.
[2026-04-11] lint | self-evolving workflow tool coverage pass
- Ran
scripts/lint-wiki.shafter adding the workflow, memory, skill-library, and self-evolving-agent entity pages. - Confirmed index coverage and wikilink resolution for the new entity set: pass, 113 content pages checked.
[2026-04-11] create | prompt optimization tool coverage
- Created 5 raw paper notes so the main prompt-program optimizer systems now have first-class source stubs:
- Created 5 entity pages so the prompt-optimization literature has concrete reusable nodes rather than only query-page mentions:
- Updated the central prompt-program synthesis pages so the new systems resolve as concrete entities and the DSPy follow-up line is less foggy:
- Normalized the index total to 119 after adding the new prompt-optimizer entity set and reconciling prompt-query coverage.
[2026-04-11] create | research on the ten prompt-optimization questions
- Added 1 supporting raw research note for extra cross-cutting evaluator / programming-language / release-engineering references:
- Created 1 question-by-question synthesis page covering the ten cross-cutting open questions with anchor papers and concise research reads:
- Updated related prompt-program navigation so the new research map is reachable from the umbrella open-questions page and indexed in the query catalog:
- Updated index total from 118 to 119.
[2026-04-11] create | another-harness model-docs drift checker
- Added 1 raw repository-synthesis note:
- Created 1 query page capturing why the repo’s first Lean-backed drift fence targets the attempt-vs-stream grounding distinction:
- Updated navigation so the new page is indexed alongside the other another-harness slices:
- Updated index total from 119 to 120.
[2026-04-11] lint | wiki pass after model-docs drift sync
- Ran
scripts/lint-wiki.shafter the 0032 sync. - Confirmed the new raw note, query page, and index update pass the current wiki structural checks.
[2026-04-11] create | another-harness resume-recover environment
- Added 1 raw repository-synthesis note:
- Created 1 query page capturing the repo’s first executable recovery family:
- Updated navigation so the new page is indexed alongside the other another-harness slices:
- Updated index total from 129 to 130.
[2026-04-11] lint | wiki pass after resume-recover sync
- Ran
scripts/lint-wiki.shafter the 0036 sync. - Confirmed the new raw note, query page, and index update pass the current wiki structural checks.
[2026-04-11] create | three more prompt-optimizer branches
- Added 5 raw paper notes for the runtime-adaptation, RL-over-program, and planning/evolution optimizer branches:
- Created 5 entity pages so these optimizer families now have first-class wiki nodes:
- Created 1 synthesis page that turns the newly covered systems into a regime map for harness design:
- Updated the central DSPy and prompt-optimization pages so the new systems resolve as concrete entities and the optimizer-family map is more explicit:
- dspy.md
- rlprompt.md
- prompt-optimization-and-dspy-follow-ups.md
- prompt-optimization-timeline-and-harness-lessons.md
- prompt-program-representation-and-optimizer-open-questions.md
- prompt-program-deployment-open-questions.md
- research-on-open-questions-in-prompt-optimization-and-language-programs.md
- index.md
- Updated index total from 120 to 126.
[2026-04-11] update | quartz publishing migration
- Replaced the GitHub Pages publishing path from MkDocs with Quartz 4 while keeping the repository-root markdown corpus canonical.
- Added
scripts/prepare_quartz_content.pyto materialize the wiki into.quartz-site/content/, synthesize titles for root meta pages, and sanitize frontmatter for Quartz consumption. - Vendored Quartz under
.quartz-site/from upstream commit59b5807, customized the local Quartz config/layout, and switched.github/workflows/pages.ymlto a Node 22 + Quartz build. - Verified the Quartz build locally; the explorer/sidebar navigation replaces the brittle top dropdown navbar, and Mermaid diagrams render with built-in expand/zoom controls.
[2026-04-11] create | repo architecture plans from the ten prompt-program questions
- Created 2 repo-grounded plan documents that translate the ten cross-cutting prompt-program questions into concrete architecture stances and sequencing ladders:
/Users/ericfode/src/another-harness/docs/plans/2026-04-11-another-harness-prompt-program-architecture-plan.md/Users/ericfode/src/gas-city-but-its-just-codex/docs/plans/2026-04-11-gascity-prompt-program-architecture-plan.md
- Added 1 wiki synthesis page summarizing both repo plans and their main divergence in design center:
- Updated prompt-program navigation so the new cross-repo plan page is indexed from the query catalog and linked from the umbrella question map:
- Updated index total from 126 to 127.
[2026-04-11] create | context visualization for harnesses
- Added 1 raw research batch note covering provenance/trust visualization, context-assembly UX, and attention/attribution caveats:
- Created 2 synthesis pages: one for the overall context/provenance/trust surface and one for the narrower attention/attribution question:
- Updated nearby context/provenance pages so the new research is linked into the existing moldable-operations and context-engineering graph:
- Updated index total from 127 to 129.
[2026-04-12] update | repo recovery and scratch hygiene
- Recovered the stranded local wiki additions from the older
spike/quartz-nav-rescueworktree ontomain, including the resume-recover and context-visualization query pages, their supporting raw notes, and one additional workflow-visualization research note. - Updated context-engineering.md and web-patterns-for-non-linear-harness-interfaces.md so the new research is connected into the existing graph, and normalized index.md to 130 pages.
- Added repo ignore rules for local scratch directories and moved the generated Quartz content tree to local git excludes so builds stop cluttering status without hiding the content from Quartz itself.
[2026-04-12] lint | recovery pass
- Ran
scripts/lint-wiki.shafter the recovery merge and ignore cleanup. - Result: pass, 130 content pages checked.
[2026-04-14] ingest | mathcode
- Added 1 raw source summary for the MathCode GitHub repository:
- Added 1 entity page for the mathematical coding agent itself:
- Updated formal-methods synthesis pages so MathCode is linked as a concrete harness example:
- Extended the schema tag taxonomy with
mathcodeand updated navigation: - Updated index total from 130 to 131.
[2026-04-14] lint | mathcode ingest
- Ran
scripts/lint-wiki.shafter adding MathCode to the wiki. - Result: pass, 131 content pages checked.
[2026-04-14] update | wiki maintenance pass
- Repaired dangling body links in dyflow.md, how-to-build-a-multiplayer-harness-network.md, and multiplayer-agent-harnesses-and-p2p-networks.md so they now resolve to real notes or plain prose rather than aspirational page names.
- Added inbound graph links so the previously orphaned pages sybil-resistance-and-local-trust.md, another-harness-model-docs-drift-checker.md, another-harness-resume-recover-environment.md, and prompt-program-architecture-plans-for-another-harness-and-gas-city.md are now reattached to nearby synthesis pages.
- Updated touched query/entity pages so their
updatedfrontmatter matches the repair pass.
[2026-04-14] lint | wiki maintenance pass
- Ran
scripts/lint-wiki.shafter the repair pass. - Ran an additional cross-link audit for broken wikilinks and orphaned content pages.
- Result: pass, 131 content pages checked, 0 broken wikilinks, 0 orphan pages.
[2026-04-15] lint | pre-commit wiki rerun
- Re-ran
scripts/lint-wiki.shon the current wiki worktree before commit and push. - Normalized
updatedfrontmatter on the previously touched query/entity/concept pages from2026-04-14to2026-04-15so the modified-page freshness rule reflects the actual edit date. - Result: pass, 131 content pages checked.
[2026-04-15] update | gas-city-but-its-just-codex current-structure deep dive
- Added 1 fresh raw repository snapshot grounded in the live worktree rather than only the last clean commit:
- Rewrote gas-city-but-its-just-codex.md from an early-spike design note into an up-to-date structure tour covering the ledger, formulas, app service, control surfaces, operator daemon, sidecar/UI path, operational center of gravity, and current structural tensions.
- Updated index.md so the query summary and index header date reflect the refreshed deep dive.
[2026-04-15] lint | gas-city-but-its-just-codex deep dive refresh
- Ran
scripts/lint-wiki.shafter refreshing the raw note, query page, and index/log metadata. - Also re-checked broken wikilinks and orphaned content pages after the rewrite.
- Result: pass, 131 content pages checked, 0 broken wikilinks, 0 orphan pages.
[2026-04-15] update | gas-city-but-its-just-codex documentation rendering pass
- Added 1 raw documentation-and-operations bundle so the repo’s architecture, usage, evidence, and implementation notes are preserved as a distinct wiki source layer:
- Added 3 focused query pages so the repo’s documentation and interesting bits are rendered as separate, navigable wiki surfaces instead of being trapped in one large catch-all note:
- Updated gas-city-but-its-just-codex.md to point at the new focused subpages and updated index.md to 134 content pages.
[2026-04-15] lint | gas-city documentation rendering pass
- Ran
scripts/lint-wiki.shafter adding the new raw note, three focused query pages, and the navigation updates. - Also re-checked broken wikilinks and orphaned content pages after the rendering pass.
- Result: pass, 134 content pages checked, 0 broken wikilinks, 0 orphan pages.
[2026-04-21] update | nightly src projects desk kickoff
- Added 1 raw survey note grounding a public-safe snapshot of the local
/Users/ericfode/srctree: - Added 1 new query page for the first nightly project-news brief:
- Added a generated editorial illustration under
queries/news-assets/and wired it into the new desk page. - Updated index.md with a new
Newssection and a corresponding query listing while bumping the header date/page count.
[2026-04-21] update | nightly journalist loop automation
- Created a nightly Hermes cron job to resurvey
/Users/ericfode/src, refresh the public-safe project desk, generate fresh editorial art, run wiki lint, commit the new desk update, and push the wiki repository. - The loop is configured to use 10 survey lanes over the
src/tree, with recursive subagent fanout requested where the runtime exposes it and a documented parallel-inspection fallback when it does not.
[2026-04-21] lint | nightly src projects desk kickoff
- Ran
scripts/lint-wiki.shafter adding the raw survey note, the new desk page, the illustration asset, and the index/log updates. - Result: pass, 136 content pages checked.
[2026-05-01] create | software verification/testing environment project anchor
- Created software-verification-testing-environment-research-program.md as the durable wiki anchor for the new Hermes Kanban research project.
- Created a six-task Kanban DAG for parallel research, synthesis, wiki writing, and final lint repair:
t_7a29256dformal verification foundationst_77b7318btesting and evidence primitivest_2cf92f9bagent-facing verifier/testing harness architecturet_8b1eb08csynthesis fan-int_78213be9wiki writingt_176aa99ffinal lint gate
- Added inbound links from formal-methods-for-agent-harnesses.md, evaluation-and-review-loops.md, and work-management-primitives.md.
- Updated index.md to 137 content pages.
[2026-05-01] lint | software verification/testing environment project anchor
- Ran
scripts/lint-wiki.shafter creating the project anchor page and inbound links. - Result: pass, 137 content pages checked.
[2026-05-01] create | agent-facing-verifier-environment-architecture
- Task:
t_2cf92f9b— research lane C, agent-facing verifier/test harness architecture. - Sources consulted: agent-harness-anatomy.md, evaluation-and-review-loops.md, work-management-primitives.md, automation-and-background-work.md, formal-methods-for-agent-harnesses.md, rl-gyms-and-executable-environments-for-ai-harnesses.md, software-verification-testing-environment-research-program.md, another-harness-work-item-closure-environment.md, another-harness-evaluator-discipline-environment.md, another-harness-atropos-environment-schema.md.
- External source ingested: code-tracer-towards-traceable-agent-states.md — traceable agent states, hierarchical trace trees, reflective replay, and failure-onset localization.
- Page created: agent-facing-verifier-environment-architecture.md
- Candidate primitives: specification surface, evidence ledger, promotion gate, regression memory, trace tree node.
- Object model: versioned, hash-addressed append-only ledger and regression memory; mutable promotion gates with versioned transitions; write-once trace forest.
- State machine: proposed → tested → reviewed → proved | rejected | waived → accepted, with role-separation invariants.
- Six open design questions about specification authority, trace compression, formal lane depth, anti-gaming evidence, waiver governance, and regression decay.
- Pages updated: software-verification-testing-environment-research-program.md, formal-methods-for-agent-harnesses.md, evaluation-and-review-loops.md, work-management-primitives.md.
- index.md updated: total pages 137 → 138.
[2026-05-01] update | agent-facing verifier architecture — t_2cf92f9b repairs
- Fixed broken
[[app-world]]links (3 occurrences) →[[appworld]]in agent-facing-verifier-environment-architecture.md. - Replaced invalid
[[code-tracer-towards-traceable-agent-states|CodeTracer]]wikilinks with plain text citations to the raw CodeTracer note. - Added
[[agent-facing-verifier-environment-architecture]]inbound links to automation-and-background-work.md and rl-gyms-and-executable-environments-for-ai-harnesses.md. - Ran
scripts/lint-wiki.sh: pass, 138 content pages checked.
[2026-05-01] ingest | software verification/testing environment research synthesis
- Created raw sources:
- arxiv-2602.18307-verisoftbench.md
- arxiv-2604.16584-leetproof.md
- arxiv-2604.22601-nl2vc-60.md
- arxiv-2511.00125-infer-dafny-assertions.md
- arxiv-2604.12172-cobalt-tla.md
- arxiv-2604.08633-icepick-glacier.md
- arxiv-2505.13452-autobug.md
- arxiv-2511.14435-watchdogs-oracles.md
- arxiv-2506.18315-property-generated-solver.md
- arxiv-2307.04346-llm-pbt.md
- arxiv-2406.09843-llm-mutation-testing.md
- arxiv-2308.16557-llm-test-generation-mutation.md
- arxiv-2504.05289-flare.md
- arxiv-2310.15991-whitefox.md
- arxiv-2601.12274-hybrid-concolic-llm.md
- arxiv-2504.17542-cottontail.md
- arxiv-2405.00648-drowzee.md
- arxiv-2511.02108-metamorphic-testing-llm-nlp.md
- Created entities:
- Updated concepts:
- Updated queries:
[2026-05-01] lint | verify: lint and repair wiki after verification/testing project update
- Ran
scripts/lint-wiki.shand a programmatic deep audit (Python scan of all 140 content pages). - Discovered: 2 orphan pages, 141 broken aspirational wikilinks across 9 query pages.
- Repairs applied:
- Added inbound links to nightly-src-projects-desk-2026-04-21 from work-management-primitives and to codex-app-server-provider-vs-runtime-bridge from hermes-agent.
- Demoted 141 broken wikilinks to plain text per SCHEMA.md wishful-link rule in:
- sci-fi-audit-for-moldable-operations-studio
- grounding-moldable-operations-studio-ideas-in-real-research
- high-impact-artifacts-for-multiplayer-harness-design
- how-to-build-a-multiplayer-harness-network
- legacy-distributed-systems-ideas-for-moldable-operations-studio
- moldable-operations-studio-architecture-spec
- multiplayer-agent-harnesses-and-p2p-networks
- sovereignty-and-observed-goals-ledgers-for-multiplayer-harnesses
- web-patterns-for-non-linear-harness-interfaces
- Bumped
updateddates on all modified pages to 2026-05-01.
- Final lint: pass, 140 content pages checked, 0 errors.
[2026-05-02] query | nightly src projects desk
- Created raw survey note nightly-src-projects-desk-2026-05-02.md and public query page nightly-src-projects-desk-2026-05-02.md.
- Generated local editorial SVG illustration at
queries/news-assets/2026-05-02-project-desk-hero.svgafter the configured image backend reportedFAL_KEYwas unavailable; no fake screenshots were used. - Survey automation status: exactly 10 top-level Hermes survey lanes covered all 27 top-level directories under
/Users/ericfode/src; each lane reported deeper three-way delegation was available and used. A first all-at-once 10-task batch was rejected by the runtime concurrency cap, so the lanes ran as two batches of five. - Public-safety filter: project-specific details were held back for 10 surveyed directories because of sensitive identity/reputational material, mixed creative material needing curation, credential/private deployment signals, internal supervisor/evaluator material, scratch wrappers, empty directories, or skeletal non-project evidence.
- Updated index.md News and Queries sections for the new nightly desk page; working-tree page count moved 140 → 141.
- Lint result: initial
scripts/lint-wiki.shrun failed on 21 pre-existing dirty content pages whoseupdatedfields were still 2026-05-01 under a 2026-05-02 clock. After staging only the intended nightly desk content page so the linter used the cached content-page set for modified-page freshness,scripts/lint-wiki.shpassed: 141 content pages checked.
[2026-05-02] lint | verification/testing project freshness repair
- Re-ran
scripts/lint-wiki.shafter the Kanban verification/testing project and nightly desk work had both touched the tree under the 2026-05-02 clock. - Bumped modified content-page
updated:fields from 2026-05-01 to 2026-05-02 so the live linter no longer depends on staged-cache behavior. - Preserved Kanban synthesis handoffs as raw transcripts:
- Final live lint: pass, 141 content pages checked.
[2026-05-02] query | specification elaboration naming frame
- Added raw research note specification-elaboration-naming-research.md covering knowledge representation, formal concept analysis, Problem Frames, Moldable Development, and knowledge maps as naming constraints.
- Created specification-elaboration-naming-frame.md.
- Recommendation recorded: replace “spec image” with Intent Lattice as the working product frame, with
intent model,intent graph,viability critique,refinement packet, andadapteras internal vocabulary. - Updated index.md to include the new query page and bump total pages from 141 to 142.
[2026-05-02] query update | specification elaboration naming frame
- Incorporated the user’s clarification that the project is about making specifications more verifiable, not imaging them.
- Added NASA requirements-verifiability guidance, SpecRL negative-test reward framing, and LTL specification-mining context to specification-elaboration-naming-research.md.
- Revised specification-elaboration-naming-frame.md: Claim Lattice is now the working product/kernel name, and collaborative spec verifiability environment is the category phrase.
- Added the future RL-environment boundary: observation, action, actor, reward, done, and safety invariant.
- Added the human-play boundary: the human is a first-class actor whose refinements, overrides, rejections, and rationale are durable environment transitions.
- Updated index.md to reflect the revised frame.
[2026-05-02] query update | process-first CLI naming
- Incorporated the user’s correction that Claim Lattice names the internal model, not the process, and is therefore a poor CLI name.
- Revised specification-elaboration-naming-frame.md: Spec Gym /
specgymis now the recommended CLI/product name; Claim Lattice remains the internal model. - Added command vocabulary:
play,score,step,rollout, andexport. - Added practical npm registry checks in specification-elaboration-naming-research.md:
specgym,spec-gym,speclab,spec-lab,proveout,spec-assay, andspec-trialreturned 404;specforge,assay,sharpen, andspec-checkalready exist. - Updated index.md to use the
specgymframe.
[2026-05-02] decision | Spec Gym accepted
- User accepted Spec Gym as the name.
- Updated specification-elaboration-naming-frame.md and specification-elaboration-naming-research.md from recommendation language to accepted-name language.
- Corrected CLI examples to use
--projectionfor optional adapters;ClaimLattice.leanis a default core artifact rather than a Lean adapter export.
[2026-05-02] implementation | Spec Gym repo vocabulary migration
- Propagated the accepted Spec Gym /
specgymname into/Users/ericfode/Documents/New project 4. - The top-level repo spec now describes a collaborative spec verifiability environment, not a spec-imaging tool.
- Primary artifacts are
claim-lattice.json,viability-critique.md,ClaimLattice.lean, andrefinement-packet.md. - The generated claim lattice now includes environment fields for observation, action, actor, reward signals, done states, safety invariant, and score.
- Linear project renamed to Spec Gym; FOD-30 marked done after local verification.
[2026-05-02] ingest | 0xSero self-distillation video
- Resolved the user-supplied X post from
@0xSeroto the embedded YouTube source Why Self-Distillation Is Taking Over LLM Post-Training (w/ the Researchers Behind It). - Added raw source note 0xsero-self-distillation-video-2026-05-02.md and raw transcript deep-learning-yacine-self-distillation-transcript-2026-04-28.md.
- Created on-policy-self-distillation.md as the durable concept page for SDPO/SDFT-style feedback-conditioned self-distillation.
- Updated memory-persistence.md, self-evolving-workflows.md, evaluation-and-review-loops.md, harness-engineering.md, context-engineering.md, rl-gyms-and-executable-environments-for-ai-harnesses.md, agent-facing-verifier-environment-architecture.md, and openclaw.md.
- Updated index.md to include the new concept page and bump total pages from 142 to 143.
[2026-05-03] query | nightly src projects desk
- Created raw survey note nightly-src-projects-desk-2026-05-03.md and public query page nightly-src-projects-desk-2026-05-03.md.
- Generated editorial illustration
queries/news-assets/2026-05-03-project-desk-hero.svglocally after the configured image backend reported missingFAL_KEY; no fake screenshots were used. - Survey automation status: exactly 10 top-level Hermes survey lanes covered all 30 top-level directories under
/Users/ericfode/src; a first 10-lane batch was rejected by the runtime concurrency cap (max_concurrent_children=6), so the lanes ran as 6 + 4. Lane summaries reported orchestrator-style project splits and focused purpose/docs, live-work, and safety passes, with one project child handled directly rather than recursively. - Public-safety filter: project-specific public detail was held back or reduced to category-only mention for 10 surveyed directories because of sensitive identity/reputational framing, local agent settings, empty/skeletal evidence, credential/private deployment signals, uncommitted internal workbenches, mixed creative material needing curation, hidden evaluator/supervisor surfaces, or prompt-bearing workflow residue.
- Updated index.md News and Queries sections for the new nightly desk page; content-page count moved 143 → 144.
- Lint result:
scripts/lint-wiki.shpassed: 144 content pages checked.
[2026-05-04] query | LLM-readable spec files
- Created raw source note llm-readable-spec-files-research-2026-05-04.md covering OpenAI prompting/evals guidance, Claude Code memory rules, GitHub Copilot instructions, GitHub Spec Kit, NASA/INCOSE requirements guidance, Microsoft LLM-judge reliability notes, and nl2postcond.
- Created query page llm-readable-spec-files.md.
- Main conclusion: excellent
spec.mdfiles are compact executable contracts with stable IDs, glossary, scope, non-goals, behavioral requirements, invariants, examples/counterexamples, verification matrix, operational gates, and explicit unresolved questions. - Updated index.md to include the new query page and bump total pages from 144 to 145.
[2026-05-04] query | spec dataset evolution research project
- Created Kanban board
spec-dataset-evolutionfor the publicspec.md/ spec-like corpus project. - Created raw design note spec-dataset-evolution-kanban-design-2026-05-04.md from three methodology lanes: corpus discovery, longitudinal change analysis, and code/spec connectedness.
- Created query page spec-dataset-evolution-research-project.md.
- Added scratch project brief at
/Users/ericfode/.hermes/gateway-scratch/spec-dataset-evolution/PROJECT_BRIEF.md. - Kanban tasks created:
t_39d3ad17throught_d29a8854; anchor taskt_39d3ad17marked done, with four methodology tasks now ready. - Updated index.md to include the new query page and set content-page count to 146.
- Lint result:
scripts/lint-wiki.shpassed: 146 content pages checked.
[2026-05-04] update | spec dataset evolution deep-repo expansion
- User clarified that the project should fan out more and deeply explore whole repositories, not merely design a file-level crawler.
- Added scratch protocol
DEEP_REPO_EXPLORATION_PROTOCOL.mdrequiring each scout to inspect 3–5 public repos, including whole-repo structure, spec inventory, git history, code/spec ratios, connectedness, pressure signals, compliance, and structured dossier artifacts. - Added Kanban tasks
SPEC-REPO-00throughSPEC-REPO-14;SPEC-REPO-00/t_2d3d9d2fwas completed as the protocol anchor, and 12 scout tasks plus aggregation/review tasks were created. - Linked
SPEC-REPO-14/t_482593d1as a parent ofSPEC-DATA-13/t_b5c6851b, so the pilot report cannot complete without the deep-repo evidence review. - Updated spec-dataset-evolution-research-project.md to record the deep-repo wave and task IDs.
[2026-05-05] update | spec dataset evolution Kanban error repair
- Mirrored scratch repair note into the wiki as spec-dataset-evolution-kanban-error-repair-2026-05-05.md.
- Updated spec-dataset-evolution-research-project.md sources and operational repair notes.
- Recorded root cause: default Kanban workers crash-looped because the globally enabled Basis plugin used top-level
anyOfin tool parameter schemas rejected by OpenAI Codex. - Recorded repair: local Basis plugin source commit
0061d32made the schemas Codex-compatible;SPEC-DATA-JCODEcompleted,SPEC-DATA-21was manually closed after verified artifacts, andSPEC-DATA-22intentionally blocked on unresolved human/adjudicated audit labels.
[2026-05-05] update | private spec corpus and wiki-ingest project
- Created and pushed private GitHub repository
ericfode/spec-dataset-evolution-corpusfor the spec dataset pilot corpus. - The private corpus includes 1,741 artifact occurrence rows, 1,676 private raw-file copies, provenance manifests, dedup outputs, authored deep-dive dossiers, and validation scripts; 65 hard-quarantined raw rows remain metadata-only.
- Created Hermes Kanban board
spec-deep-dive-wiki-ingestwith tasksSPEC-WIKI-00throughSPEC-WIKI-06to ingest public-safe deep dives into the wiki. - Added spec-deep-dive-wiki-ingest-project and linked it from spec-dataset-evolution-research-project.
[2026-05-05] update | spec deep-dive source map and safety policy
- Expanded spec-deep-dive-wiki-ingest-project with the
SPEC-WIKI-00source map, downstream task source routing, naming conventions, citation style, excerpt policy, fail-closed public-safety gates, and downstream page acceptance criteria. - Wrote machine-readable handoff artifact
source-map-public-safety-policy.jsonin the project scratch directory for downstream workers. - Source evidence inspected: private corpus checkout HEAD
4659608, clean working tree, 207 deep-dive files, 1,741 manifest rows, 55 aggregate repo records, and 55 connectedness feature rows. - Validation:
scripts/lint-wiki.shpassed: 147 content pages checked.
[2026-05-05] query | spec deep-dive priority case studies
- Created three public-safe case pages from the private corpus priority dossiers:
- spec-deep-dive-case-jcode —
1jehuang/jcodeas a post-LLM coding-agent harness with distributed spec surfaces and high code/spec connectedness. - spec-deep-dive-case-droidagent — DroidAgent as agent-generated mobile-GUI behavioral specs connected to traces, reports, and replay scripts.
- spec-deep-dive-case-j8-ambiguity — corrected J8/J8Spec ambiguity trail preserving negative evidence and the pre-AI J8Spec executable-spec control.
- spec-deep-dive-case-jcode —
- Updated spec-deep-dive-wiki-ingest-project, spec-dataset-evolution-research-project, and index for inbound links and page count.
- Source evidence used:
reports/deep-dives/jcode.md,reports/deep-dives/jcode-analysis.json,reports/jcode_first_calibration_seed.md,reports/deep-dives/droidagent.md, andreports/deep-dives/j8-agent.mdfrom the private corpus checkout at HEAD4659608. - Public-safety boundary: synthesis and metadata only; no raw private corpus bodies or long upstream excerpts copied into the wiki.
- Validation:
scripts/lint-wiki.shpassed: 153 content pages checked. During lint repair, index was reconciled with sibling cohort pages already present in the shared worktree, and the agent-native cohort frontmatter was normalized to existing wiki source pages.
[2026-05-05] query | spec deep-dive agent-native / Spec Kit / Kiro cohort
- Created spec-deep-dive-cohort-agent-native-spec-kit-kiro from private corpus cohorts
SPEC-REPO-03,SPEC-REPO-04,SPEC-REPO-08,SPEC-REPO-11, andSPEC-REPO-20template-lineage reports. - Preserved the template-vs-independent-spec distinction: 19 relevant selected dossier occurrences, 15 generated-scaffold rows, 11 edited-project-contract rows, and 14 fail-closed independent-authority weights in the lineage mapping.
- Source evidence used:
reports/deep-dives/SPEC-REPO-03/index.md,reports/deep-dives/SPEC-REPO-04/index.md,reports/deep-dives/SPEC-REPO-08/index.md,reports/deep-dives/SPEC-REPO-11/index.md, andreports/deep-dives/SPEC-REPO-20/TEMPLATE_LINEAGE_MODEL.mdfrom the private corpus checkout at HEAD4659608. - Public-safety boundary: synthesis and metadata only; no raw private corpus bodies or long upstream excerpts copied into the wiki.
[2026-05-05] query | spec deep-dive RFC, ADR, and executable-contract cohort
- Created spec-deep-dive-cohort-rfc-adr-executable-contracts from private corpus cohorts
SPEC-REPO-05,SPEC-REPO-06,SPEC-REPO-18,SPEC-REPO-19, and theSPEC-REPO-16export gate. - Distinguished RFC/proposal governance records from executable and formal contracts: OpenAPI, Proto, Smithy, AsyncAPI, GraphQL, Thrift, TLA+, and Dafny are modeled as machine-readable or checkable artifacts rather than ordinary Markdown product specs.
- Preserved caveats: ADR discovery is incomplete,
asyncapi/specandsmithy-lang/smithyremain hard-quarantined for raw content, and non-GitHub public forges retain host-specific discovery and license/review limits. - Updated spec-deep-dive-wiki-ingest-project, spec-dataset-evolution-research-project, and index for cohort cross-links and the shared 153-page count.
- Validation:
scripts/lint-wiki.shpassed: 153 content pages checked;git diff --check -- <edited wiki paths>passed for index, log, spec-dataset-evolution-research-project, spec-deep-dive-wiki-ingest-project, and spec-deep-dive-cohort-rfc-adr-executable-contracts.
[2026-05-05] query | spec deep-dive exact spec.md and standards cohort
- Created spec-deep-dive-cohort-exact-spec-md-and-standards from private corpus cohorts
SPEC-REPO-01,SPEC-REPO-07, and related standards records inSPEC-REPO-18andSPEC-REPO-19. - Distinguished exact lower-case
spec.mdas a discovery handle from repository authority shape: OCI runtime/distribution, Compose Spec, HCL, and Cloud Foundry Service Broker distribute authority across prose, schemas, conformance tests, implementation code, profile material, release tags, and governance. - Mature baseline evidence from OCI runtime, OpenTelemetry, Protocol Buffers, and gRPC was summarized as metadata and synthesis only; no private raw corpus bodies or long upstream excerpts were copied into the wiki.
- Updated index and reused existing cross-links from spec-deep-dive-wiki-ingest-project and spec-dataset-evolution-research-project so the new cohort page participates in the shared ingest graph.
- Validation:
scripts/lint-wiki.shpassed: 153 content pages checked.
[2026-05-05] query | spec deep-dive aggregate index
- Created spec-deep-dive-index as the public-safe aggregate index for the private spec corpus deep-dive wave.
- Cross-linked the aggregate index from spec-dataset-evolution-research-project, spec-deep-dive-wiki-ingest-project, index, and the priority case/cohort pages.
- Preserved the private corpus as a source pointer only: aggregate counts, corpus-relative source paths, repo URLs, connectedness/compliance summaries, and caveats are public; raw private corpus bodies remain unpublished.
- Validation:
scripts/lint-wiki.shpassed: 154 content pages checked.
[2026-05-05] query | hidden-state JEPA reasoning shortcuts
- Created raw literature packet hidden-state-jepa-reasoning-shortcut-literature-2026-05-05.md from direct arXiv Atom metadata, Hugging Face model API checks, existing neural-native-programming notes, and on-policy-self-distillation.
- Created query page hidden-state-jepa-reasoning-shortcuts answering whether Gemma 4/open models can learn a JEPA-like pre-thinking to post-thinking latent shortcut.
- Confirmed via Hugging Face API that public Gemma 4 entries exist for E2B, E4B, 26B-A4B, and 31B variants; the answer recommends E2B/E4B or Qwen3-4B for first experiments.
- Raised the wiki standard in SCHEMA.md so substantive paper/literature summaries now require at least one diagram plus a source/quality/caveat table.
- Updated index to list the new query page and bump content-page count from 154 to 155.
[2026-05-06] query | nightly src projects desk
- Created raw survey note nightly-src-projects-desk-2026-05-06.md and query page nightly-src-projects-desk-2026-05-06 for the nightly local-projects desk.
- Published verdict: Basis/spec-code grounding and test-writing environments lead tonight, with tinygrad/Gemma benches,
handterm, Dungeon Steward,FACEMUSIC, orchestration side rooms, and privacy-gated corpus work summarized only where inspectable evidence supported public claims. - Generated editorial illustration
queries/news-assets/2026-05-06-project-desk-hero.svglocally after the configured image backend reported missingFAL_KEY; no fake screenshots were used. - Survey automation status: exactly 10 top-level Hermes survey lanes covered all 38 top-level directories under
/Users/ericfode/src; the lanes ran as 6 + 4, and all lane summaries reported successful recursive 3-way delegation for purpose/docs, live-work evidence, and safety/public-summary review. - Public-safety filter: project-specific public detail was held back or reduced to category-only mention for 13 surveyed directories because of local hidden settings, sensitive social/reputational framing, empty or skeletal evidence, private corpus contents, local deployment/model-runner configuration, raw logs/prompts/trajectories, evaluator-like materials, generated artifacts, or creative material needing human curation.
- Updated index.md News and Queries sections for the new nightly desk page; content-page count moved 155 → 156.
- Lint result:
scripts/lint-wiki.shpassed: 156 content pages checked.
[2026-05-06] update | 0xSero self-distillation post recheck
- Rechecked the user-supplied canonical X/Twitter URL
https://twitter.com/0xSero/status/2050692154524156374against public oEmbed metadata. - Updated the existing raw source note 0xsero-self-distillation-video-2026-05-02.md rather than creating a duplicate page; the post had already been ingested on 2026-05-02 and linked to on-policy-self-distillation.
- No index-page count change: this was a raw-source metadata update, not a new content page.
[2026-05-06] ingest | The Last Harness You’ll Ever Build
- Resolved the user-supplied screenshot to arXiv:2604.21003v3, arxiv-seong-2026-last-harness-youll-ever-build.md, by grounding metadata through the arXiv Atom API and section text through the arXiv HTML page after
web_extractreported Firecrawl was not configured. - Created last-harness-youll-ever-build as a source-quality-aware entity page with a Mermaid architecture sketch and evidence/caveat table.
- Updated harness-engineering and self-evolving-workflows so the new paper is linked from the existing harness-engineering graph rather than sitting as a decorative orphan.
- Updated index.md to list the new entity page; content-page count moved 156 → 157.
- Validation:
scripts/lint-wiki.shpassed: 157 content pages checked.
[2026-05-06] ingest | Agentic-imodels
- Resolved the user-supplied arXiv URL
https://arxiv.org/abs/2605.03808to arXiv:2605.03808v1, arxiv-singh-2026-agentic-imodels.md, by grounding metadata through the arXiv Atom API and source text through the arXiv e-print package afterweb_extractreported Firecrawl was not configured and arXiv HTML returned 404. - Created agentic-imodels as a source-quality-aware entity page with an autoresearch-loop diagram, evidence/caveat table, BLADE result summary, and reward-hacking boundary.
- Updated context-engineering, evaluation-and-review-loops, and self-evolving-workflows so agent-facing simulatability tests are linked into the harness graph.
- Updated index.md to list the new entity page; content-page count moved 157 → 158.
- Validation:
scripts/lint-wiki.shpassed: 158 content pages checked.
[2026-05-07] query | nightly src projects desk
- Created raw survey note nightly-src-projects-desk-2026-05-07.md and query page nightly-src-projects-desk-2026-05-07 for the nightly local-projects desk.
- Published verdict: spec-code grounding, test-writing environments, and
tinygrad-gemmalead tonight, with terminal/game/interface craft projects, NNPL benches, and orchestration side rooms summarized only where inspectable evidence supported public-safe claims. - Generated editorial illustration
queries/news-assets/2026-05-07-project-desk-hero.svglocally after the configured image backend reported missingFAL_KEY; no fake screenshots were used, and the SVG parsed as valid XML. - Survey automation status: exactly 10 top-level Hermes survey lanes ran as 3 + 3 + 3 + 1, and all 10 lane summaries reported successful recursive 3-way delegation for purpose/docs, live-work evidence, and safety/public-summary review. A controller post-dispatch audit covered two hidden top-level directories as a read-only supplement so the total lane count remained exactly ten; total coverage was 38 top-level directories.
- Public-safety filter: project-specific public detail was held back or reduced to category-only mention for hidden local settings, one sensitive social-claim notebook, empty or skeletal directories, local deployment/model-runner folders, private corpus contents, internal workflow/assistant configuration, scratch/meta workspaces, raw logs/prompts/trajectories, evaluator-like payloads, benchmark outputs, model/checkpoint artifacts, and creative material needing human curation.
- Updated index.md News and Queries sections for the new nightly desk page; content-page count moved 158 → 159.
- Lint result:
scripts/lint-wiki.shpassed: 159 content pages checked.
[2026-05-08] query | nightly src projects desk
- Created raw survey note nightly-src-projects-desk-2026-05-08.md and query page nightly-src-projects-desk-2026-05-08 for the nightly local-projects desk.
- Published verdict: spec-code grounding and test-writing environments still lead, with
tinygrad-gemma, NNPL benches,handterm, Dungeon Steward, privacy-sensitive interface/simulation work, and orchestration side rooms summarized only where inspectable evidence supported public-safe claims. - Generated editorial illustration
queries/news-assets/2026-05-08-project-desk-hero.svglocally after the configured image backend reported missingFAL_KEY; no fake screenshots were used, and the SVG parsed as valid XML. - Survey automation status: exactly 10 top-level Hermes survey lanes ran as 3 + 3 + 3 + 1 across all 38 top-level directories under the local src tree, including hidden directories. All 10 lane summaries reported successful recursive 3-way delegation for purpose/docs, live-work evidence, and safety/public-summary review.
- Public-safety filter: project-specific public detail was held back or reduced to category-only mention for hidden local settings, a sensitive social-claim notebook, empty or skeletal directories, local deployment/model-runner folders, private corpus bodies, internal workflow/assistant configuration, scratch/meta workspaces, generated artifacts, prompt/log/trajectory materials, evaluator-like payloads, benchmark raw outputs, model/checkpoint artifacts, privacy-sensitive capture data, and creative material needing human curation.
- Updated index.md News and Queries sections for the new nightly desk page; content-page count moved 159 → 160.
- Lint result:
scripts/lint-wiki.shpassed after the page, index, and log edits: 160 content pages checked.
[2026-05-09] query | nightly src projects desk
- Created raw survey note nightly-src-projects-desk-2026-05-09.md and query page nightly-src-projects-desk-2026-05-09 for the nightly local-projects desk.
- Published verdict: spec-code grounding and test-generation environments still lead, with
tinygrad-gemma, Gemma/tinygrad optimization benches, freshkettlebellsimsimulation work, Dungeon Steward,handterm, privacy-sensitive interface work, NNPL benches, and orchestration side rooms summarized only where inspectable evidence supported public-safe claims. - Generated editorial illustration
queries/news-assets/2026-05-09-project-desk-hero.svglocally as symbolic SVG art; no fake screenshots were used. - Survey automation status: exactly 10 top-level Hermes survey lanes ran as 3 + 3 + 3 + 1 across all 38 top-level directories under the local src tree, including hidden directories. All 10 lane summaries reported three-way delegation for purpose/docs, live-work evidence, and safety/public-summary review, plus a further three-way leaf recursion where the runtime allowed it; further recursion ended at leaf workers because of the configured depth cap.
- Public-safety filter: project-specific public detail was held back or reduced to category-only mention for hidden local settings, a sensitive social-claim wiki, empty or skeletal directories, local deployment/model-runner folders, private corpus bodies, prompt/agent/skill instruction bodies, scratch/meta workspaces, generated media, raw logs/prompts/trajectories, evaluator-like payloads, hidden references/oracles, benchmark raw outputs, model/checkpoint artifacts, privacy-sensitive capture data, creative canon drafts, and local service configuration.
- Updated index.md News and Queries sections for the new nightly desk page; content-page count moved 160 → 161.
- Lint result:
scripts/lint-wiki.shpassed after the page, index, and log edits: 161 content pages checked.
[2026-05-10] query | nightly src projects desk
- Created raw survey note nightly-src-projects-desk-2026-05-10.md and query page nightly-src-projects-desk-2026-05-10 for the nightly local-projects desk.
- Published verdict: harness/control-plane work, spec-code reducers, test-generation environments,
tinygrad-gemma, NNPL boundary experiments,kettlebellsim,handterm, and selected side rooms lead tonight only where inspectable evidence supported public-safe claims. - Generated editorial illustration
queries/news-assets/2026-05-10-project-desk-hero.svglocally as symbolic SVG art; the SVG parsed as valid XML and no fake screenshots were used. - Survey automation status: exactly 10 top-level Hermes survey lane identities ran in one batch across all 38 top-level directories under the local src tree, including hidden directories. All 10 lane summaries reported three-way delegation for purpose/docs, live-work evidence, and safety/public-summary review, plus one further three-way leaf recursion; further recursion ended at the configured depth cap. A controller read-only audit corrected a lane spelling slip for the actual
nnpl-*directories while preserving the top-level lane count of exactly 10. - Public-safety filter: project-specific public detail was held back or reduced to category-only mention for hidden local settings, hidden-only or empty directories, one sensitive social-claim note set, local deployment/model-runner folders, private corpus bodies, prompt/agent/skill instruction bodies, scratch/meta workspaces, generated media, raw logs/prompts/trajectories, evaluator-like payloads, hidden references/oracles, benchmark raw outputs, model/checkpoint artifacts, biometric/capture data, creative story/canon drafts, service configuration, and cache/build/vendor directories.
- Updated index.md News and Queries sections for the new nightly desk page; content-page count moved 161 → 162.
- Lint result:
scripts/lint-wiki.shpassed after the page, index, and log edits: 162 content pages checked.
[2026-05-10] query | Basis project corner
- Created raw public-safe evidence note basis-project-deep-dive-2026-05-10.md from local inspection of
basis,basis-hermes, andbasis-jcode. - Created basis-project-index, basis-architecture-and-plans, basis-experiment-status, and basis-source-basis-and-safety-gate as a dedicated wiki corner for Basis plans, running experiments, status, and publication boundaries.
- Recorded verified status: core
basisclean ata5544e0, tests and compile pass but formatter gate fails onlib/basis/run/server.ex;basis-hermesclean at0061d32with Python and JS gates passing;basis-jcodeat4b1e621, ahead 10 and dirty, with 26 reducer tests passing and a loopback dashboard for.basis/self-convergence. - Public-safety boundary:
.basisrun bodies, prompts, NDJSON streams, packet bodies, validation bodies, logs, dashboard payloads, and private corpus bodies remain unpublished; wiki pages use synthesis, counts, safe filenames, test results, caveats, and plans only.
[2026-05-11] query | nightly src projects desk
- Created raw survey note nightly-src-projects-desk-2026-05-11.md and query page nightly-src-projects-desk-2026-05-11 for the nightly local-projects desk.
- Published verdict: test-generation environments, Basis/spec-code work,
gemma-dungeon,tinygrad-gemma, NNPL boundary experiments,kettlebellsim,handterm, and selected orchestration side rooms lead tonight only where inspectable evidence supported public-safe claims. - Generated editorial illustration
queries/news-assets/2026-05-11-project-desk-hero.svglocally as symbolic SVG art; the SVG parsed as valid XML and no fake screenshots were used. - Survey automation status: exactly 10 top-level Hermes survey lane identities ran in one batch across all 39 top-level directories under the local src tree, including hidden directories. All 10 lane summaries reported three-way delegation for purpose/docs/manifests, live-work evidence, and public-safety review, plus one further three-way leaf recursion; further recursion ended at leaf checks/depth limits. A controller read-only audit re-enumerated the tree and found no missing top-level directory.
- Public-safety filter: project-specific public detail was held back or reduced to category-only mention for hidden local settings, hidden-only or empty directories, one sensitive social-claim notebook, local deployment/model-runner folders, private corpus bodies, prompt/agent/skill instruction bodies, scratch/meta workspaces, generated media, raw logs/prompts/trajectories, evaluator-like payloads, hidden references/oracles, benchmark raw outputs, model/checkpoint artifacts, biometric/capture data, creative story/canon drafts, service configuration, raw test/counterexample bodies, cache/build/vendor directories, and too-skeletal placeholders.
- Updated index.md News and Queries sections for the new nightly desk page; content-page count moved 166 → 167.
- Lint result:
scripts/lint-wiki.shpassed after the page, index, and log edits: 167 content pages checked.
[2026-05-11] update | website navigation and missing page routing
- Added project-list and project-update-log as curated public-safe landing pages for active project lines and personal project update logs.
- Added root website doors projects.md and news.md, and updated index.md with a Start Here block plus a shorter News section.
- Path-qualified ambiguous nightly desk links so query pages resolve to
queries/*instead of colliding with raw source notes. - Added Quartz trailing-slash redirect aliases and a cleaner Explorer configuration so Basis pages, news, and projects are easier to reach from the published site.
[2026-05-11] update | Basis Reduce and Imagine wiki recovery
- Recovered Codex/worktree provenance for basis-reduce-workbench and basis-imagine-workbench from safe branch metadata, filenames, test outcomes, aggregate experiment reports, and generated UI artifacts.
- Created public-safe raw evidence note basis-reduce-imagine-codex-log-sift-2026-05-11.md.
- Added four recovered/generated UI assets under
queries/news-assets/for Basis.Reduce and Basis.Imagine, each labeled as an artifact rather than accepted runtime state. - Updated basis-project-index, basis-experiment-status, basis-source-basis-and-safety-gate, project-list, projects.md, news.md, and index.md so Basis.Reduce, Basis.Imagine, projects, and news have obvious entry points.
- Public-safety boundary: raw Codex JSONL bodies, app-server turn bodies, prompts, packet bodies, dashboard state payloads, hidden evaluator/oracle bodies, and private candidate dumps remain unpublished.
[2026-05-13] query | nightly src projects desk
- Created raw survey note nightly-src-projects-desk-2026-05-13.md and query page nightly-src-projects-desk-2026-05-13 for the nightly local-projects desk.
- Published verdict:
testing-rlandtinygrad-gemmalead tonight, withhandtermand Dungeon Steward as clean craft/game leads; Basis/Steward, NNPL,gemma-dungeon,openai-symphony, andkettlebellsimare research-bench or side-room material. - Generated editorial illustration
queries/news-assets/2026-05-13-project-desk-hero.svglocally as symbolic SVG art; the SVG parsed as valid XML and no fake screenshots were used. - Survey automation status: exactly 10 top-level Hermes survey lane identities ran in one batch across all 41 top-level directories under the local src tree, including hidden directories. All 10 lanes spawned three read-only subteams for purpose/docs/manifests, live-work evidence, and public-safety review. Subteams generally recursed once more into leaf probes; two minor leaf-shape exceptions were recorded in the raw note. A controller audit found 41 unique assignments, no missing directories, no extras, and no duplicates.
- Public-safety filter: project-specific public detail was held back or reduced to category-only mention for hidden local settings, internal security-scan artifacts, hidden-only or empty directories, one sensitive social-claim notebook, local deployment/model-runner folders, private corpus bodies, prompt/agent/skill instruction bodies, scratch/meta workspaces, generated media, raw logs/prompts/trajectories, evaluator-like payloads, hidden references/oracles, benchmark raw outputs, model/checkpoint artifacts, biometric/capture data, creative story/canon drafts, service configuration, raw test/counterexample bodies, cache/build/vendor directories, and too-skeletal placeholders.
- Updated index.md News and Queries sections for the new nightly desk page; content-page count moved 171 → 172. Existing untracked
.hermes/material in the wiki worktree remained untouched and uncommitted. - Lint result:
scripts/lint-wiki.shpassed after the new page, index, and log edits: 172 content pages checked.
[2026-05-13] update | East Bay property monitor website publishing
- Added the focused East Bay Catholic property monitor to the Quartz publishing content tree and linked it from index.md Start Here navigation.
- Published target page: East Bay Catholic Property Sales Monitor under
east-bay-catholic-property-monitor/queries/. - Preserved the main harness-wiki content count at 172; the property monitor remains a focused sub-vault rather than part of the main harness taxonomy.
[2026-05-14] query | nightly src projects desk
- Created raw survey note raw note and query page nightly-src-projects-desk-2026-05-14 for the nightly local-projects desk.
- Published verdict:
gemma-dungeonis tonight’s same-night lead,textual-world-modelis active benchmark-first research-loop evidence,testing-rlandtinygrad-gemmaremain the stable benches, and Basis/Steward/Symphony/Gas-City work stays architecture/provenance-level under the safety filter. - Generated editorial illustration
queries/news-assets/2026-05-14-project-desk-hero.svgas deterministic symbolic SVG after rejecting a raster draft with text artifacts; no fake screenshots were used, and the SVG parsed as valid XML. - Survey automation status: exactly 10 top-level Hermes survey lane identities ran in one batch across all 41 top-level directories under the local src tree, including hidden directories. All 10 lanes reported three read-only subteams for purpose/docs/manifests, live-work evidence, and public-safety review, plus one further three-way leaf recursion where delegation was exposed. A controller audit found 41 unique assignments, no missing directories, no extras, and no duplicates.
- Public-safety filter: project-specific public detail was held back or reduced to category-only mention for hidden local settings, internal security-scan artifacts, empty or hidden-only directories, one protected-class-sensitive social-claim notebook, local deployment/model-runner folders, private corpus bodies, prompt/agent/skill instruction bodies, scratch/meta workspaces, generated media, raw logs/prompts/trajectories, evaluator/oracle payloads, benchmark raw outputs, model/checkpoint artifacts, biometric/capture data, creative/canon drafts, service configuration, raw test/counterexample bodies, cache/build/vendor directories, and too-skeletal placeholders.
- Updated index.md News and Queries sections for the new nightly desk page; content-page count moved 172 → 173. Existing untracked
.hermes/material in the wiki worktree remained untouched and uncommitted. - Lint result:
bash scripts/lint-wiki.shpassed after the new page, index, and log edits: 173 content pages checked.
[2026-05-16] query | nightly src projects desk
- Created raw survey note raw note and query page nightly-src-projects-desk-2026-05-16 for the nightly local-projects desk.
- Published verdict:
jepa-pokeris tonight’s fresh public-safe lead, withgemma-dungeon,testing-rl, andtinygrad-gemmaas continuing benches;is-it-formalandbasis-hermesgive the formal/spec-provenance corner clean surfaces;handterm, Dungeon Steward, andkettlebellsimremain craft/game/simulation side rooms. - Generated editorial illustration
queries/news-assets/2026-05-16-project-desk-hero.svgas deterministic symbolic SVG after rejecting a generated raster draft with text artifacts; no fake screenshots were used, and the SVG parsed as valid XML. - Survey automation status: exactly 10 top-level Hermes survey lane identities ran in one batch across all 43 top-level directories under the local src tree, including hidden directories. All 10 lanes reported three read-only subteams for purpose/docs/manifests, live-work evidence, and public-safety review, plus one further three-way leaf recursion. A controller audit found 43 assigned directories, 43 unique assignments, no missing directories, no extras, and no duplicates.
- Public-safety filter: project-specific public detail was held back or reduced to category-only mention for hidden local settings, security-scan artifacts, empty or hidden-only directories, provocative/protected-class-sensitive social-claim material, local deployment/model-runner folders, private corpus bodies, prompt/agent/skill instruction bodies, scratch/meta workspaces, generated media, raw logs/prompts/trajectories, evaluator/oracle payloads, benchmark raw outputs, model/checkpoint artifacts, biometric/capture data, creative/canon drafts, service configuration, raw test/counterexample bodies, cache/build/vendor directories, and too-skeletal placeholders.
- Updated index.md News and Queries sections for the new nightly desk page; content-page count moved 173 → 174. Existing unrelated East Bay property-monitor working-tree changes and wiki
.hermes/material remained untouched and uncommitted. - Lint result:
bash scripts/lint-wiki.shpassed after the new page, index, and log edits: 174 content pages checked.
[2026-05-24] query | nightly src projects desk
- Created raw survey note raw note and query page nightly-src-projects-desk-2026-05-24 for the nightly local-projects desk.
- Published verdict:
unconventional-jepa-labis tonight’s clean public lead;word-gamesandjepa-langmake the JEPA/world-model line newly legible;testing-rl,tinygrad-gemma, andgemma-dungeonremain the sturdy continuing benches; Basis/Hermes, Lean/formality, and Steward keep the formal/spec/provenance corner coherent. - Generated editorial illustration
queries/news-assets/2026-05-24-project-desk-hero.svgas deterministic symbolic SVG; no fake screenshots were used, and the SVG parsed as valid XML. - Survey automation status: exactly 10 top-level Hermes survey lane identities ran as two batches of five across all 47 top-level directories under the local src tree, including hidden directories. All 10 lanes reported three read-only subteams for purpose/docs/manifests, live-work evidence, and public-safety review, plus one further three-way leaf recursion. A controller audit found 47 assigned directories, 47 unique assignments, no missing directories, no extras, and no duplicates.
- Public-safety filter: project-specific public detail was held back or reduced to category-only mention for hidden local settings, security-scan artifacts, empty or hidden-only directories, provocative/protected-class-sensitive social-claim material, local deployment/model-runner folders, private corpus bodies, prompt/agent/skill instruction bodies, scratch/meta workspaces, generated media, raw logs/prompts/trajectories, evaluator/oracle payloads, benchmark raw outputs, model/checkpoint artifacts, biometric/capture data, creative/canon drafts, service configuration, raw test/counterexample bodies, dirty patch/reject variants, cache/build/vendor directories, and too-skeletal placeholders.
- Updated index.md News and Queries sections for the new nightly desk page; content-page count moved 174 → 175. Pre-existing uncommitted 2026-05-16 nightly desk files and index/log lines were in the same news-desk family and were retained for a coherent index; unrelated East Bay property-monitor working-tree changes and wiki
.hermes/material remained untouched and uncommitted. - Lint result:
bash scripts/lint-wiki.shpassed after the new page, index, and log edits: 175 content pages checked.
[2026-05-27] query | nightly src projects desk
- Created raw survey note raw note and query page nightly-src-projects-desk-2026-05-27 for the nightly local-projects desk.
- Published verdict:
gemma-dungeon,jepa-lang, andjepa-pokerare tonight’s clearest technical leads;testing-rlremains the sturdy verifier bench;deer-flow,openai-symphony, Gas City/Basis/Steward,parenting-bookshelf-compass,handterm, and selected side rooms were summarized only where inspectable evidence supported public-safe claims. - Generated editorial illustration
queries/news-assets/2026-05-27-project-desk-hero.svgas deterministic symbolic SVG after rejecting raster drafts with text artifacts; no fake screenshots were used, and the SVG parsed as valid XML. - Survey automation status: exactly 10 top-level Hermes survey lane identities ran in one batch across all 50 top-level directories under the local src tree, including hidden directories. All 10 lanes reported three read-only subteams for purpose/docs/manifests, live-work evidence, and public-safety review, plus one further three-way leaf probe. A controller audit found 50 assigned directories, 50 unique assignments, no missing directories, no extras, and no duplicates.
- Public-safety filter: project-specific public detail was held back or reduced to category-only mention for hidden local settings, security/dependency scan artifacts, empty or skeletal directories, provocative/protected-class-sensitive social-claim material, local deployment/model-runner folders, private corpus bodies, prompt/agent/skill instruction bodies, scratch/meta workspaces, generated media, raw logs/prompts/trajectories, evaluator/oracle payloads, raw benchmark outputs, model/checkpoint artifacts, biometric/capture data, creative/canon/world-packet drafts, service configuration, raw test/counterexample bodies, dirty patch/reject variants, cache/build/vendor directories, and too-skeletal placeholders.
- Updated index.md News and Queries sections for the new nightly desk page; committed content-page count moved 175 → 176. Pre-existing uncommitted source-essence, East Bay property-monitor, and wiki
.hermes/working-tree material remained untouched and uncommitted. - Lint result:
bash scripts/lint-wiki.shpassed in the working tree before staging: 213 content pages checked; committed-tree verification passed after commit: 176 content pages checked.
[2026-05-29] query | nightly src projects desk
- Created raw survey note raw note and query page nightly-src-projects-desk-2026-05-29 for the nightly local-projects desk.
- Published verdict:
gas-city-but-its-just-codexis tonight’s clearest orchestration/control-plane lead;cardgame1andgemma-dungeoncarry the game/research craft story;testing-rlremains the verifier bench;kettlebellsimis the clean simulation-validation lead;unconventional-jepa-lab,jepa-lang,textual-world-model, selected NNPL projects,parenting-bookshelf-compass, andhandtermwere summarized only where inspectable evidence supported public-safe claims. - Generated editorial illustration
queries/news-assets/2026-05-29-project-desk-hero.svgas deterministic symbolic SVG after rejecting two generated raster drafts with text artifacts; no fake screenshots were used, and the SVG parsed as valid XML. - Survey automation status: exactly 10 top-level Hermes survey lane identities ran in one batch across all 50 top-level directories under the local src tree, including hidden directories. All 10 lanes reported three read-only subteams for purpose/docs/manifests, live-work evidence, and public-safety review, plus one further three-way leaf probe. A controller audit found 50 assigned directories, 50 unique assignments, no missing directories, no extras, and no duplicates.
- Public-safety filter: project-specific public detail was held back or reduced to category-only mention for hidden local assistant/settings directories, security/dependency scan artifacts, empty or skeletal directories, provocative/protected-class-sensitive social-claim material, local deployment/model-runner folders, private corpus bodies, prompt/agent/skill instruction bodies, scratch/meta workspaces, generated media, raw logs/prompts/trajectories, evaluator/oracle payloads, raw benchmark outputs, model/checkpoint artifacts, biometric/capture data, creative/canon/world-packet drafts, service configuration, raw test/counterexample bodies, dirty patch/reject variants, cache/build/vendor directories, local
.env-style material, and too-skeletal placeholders. - Updated index News and Queries sections for the new nightly desk page. Working-tree content-page count moved 213 → 214; the isolated committed-tree update moves 176 → 177 because pre-existing uncommitted source-essence, East Bay property-monitor, and wiki
.hermes/material remained untouched and uncommitted. - Lint result:
bash scripts/lint-wiki.shpassed in the working tree before staging: 214 content pages checked; committed-tree verification passed after commit: 177 content pages checked.
[2026-05-31] query | nightly src projects desk
- Created raw survey note raw note and query page nightly-src-projects-desk-2026-05-31 for the nightly local-projects desk.
- Published verdict:
gemma-dungeonis tonight’s clean same-night lead;cardgame1,testing-rl,testing-rl-hermes, andkettlebellsimare the strongest public-safe working set; Basis/Hermes,another-harness,is-it-formal,openai-symphony,steward, JEPA/NNPL/tinygrad research benches,handterm, andparenting-bookshelf-compasswere summarized only where inspectable evidence supported public-safe claims. - Generated editorial illustration
queries/news-assets/2026-05-31-project-desk-hero.svgas deterministic symbolic SVG after the image backend reported missingFAL_KEY; no fake screenshots were used, and the SVG parsed as valid XML. - Survey automation status: exactly 10 top-level Hermes survey lane identities ran in one batch across all 50 top-level directories under the local src tree, including hidden directories. All 10 lanes reported three read-only subteams for purpose/docs/manifests, live-work evidence, and public-safety review, plus one further three-way leaf probe. A controller audit found 50 assigned directories, 50 unique assignments, no missing directories, no extras, and no duplicates.
- Public-safety filter: project-specific public detail was held back or reduced to category-only mention for hidden local assistant/settings directories, security/dependency scan artifacts, empty or skeletal directories, provocative/protected-class-sensitive social-claim material, local deployment/model-runner folders, private corpus bodies, prompt/agent/skill instruction bodies, scratch/meta workspaces, generated media, raw logs/prompts/trajectories, evaluator/oracle payloads, raw benchmark outputs, model/checkpoint artifacts, biometric/capture data, creative/canon/world-packet drafts, service configuration, raw test/counterexample bodies, dirty patch/reject variants, cache/build/vendor directories, local
.env-style material, and too-skeletal placeholders. - Updated index News and Queries sections for the new nightly desk page. Working-tree content-page count moved 214 → 215; the isolated committed-tree update moves 177 → 178 because pre-existing uncommitted source-essence, East Bay property-monitor, and wiki
.hermes/material remained untouched and uncommitted. - Lint result:
bash scripts/lint-wiki.shpassed in the working tree before staging: 215 content pages checked; committed-tree verification passed after staging: 178 content pages checked.