Nightly Src Projects Desk Raw Survey (2026-05-11)
This raw note preserves the public-safe basis for the 2026-05-11 nightly src/ projects desk. It summarizes inspectable repository evidence only: README/docs, manifests, branch and commit metadata, status summaries, safe filenames, mtimes, tests, plans, and visible checked-in artifacts. It does not publish secret-bearing files, local settings, raw prompts/logs/trajectories, private corpus bodies, evaluator payloads, raw benchmark outputs, checkpoint/model artifacts, biometric/capture data, generated media bodies, or sensitive/provocative material.
Where a directory is local-only, sensitive, private-corpus-backed, artifact-heavy, or too skeletal, this note uses category-level wording. The source tree is not a confessional booth; it is a substrate for evidence.
Survey scope and method
- Survey root:
/Users/ericfode/src. - Survey timestamp: 2026-05-11.
- Full top-level directory count: 39, including hidden directories.
- Execution shape: exactly 10 top-level Hermes survey lane identities, dispatched as one batch of 10.
- Lane recursion: all 10 lane summaries reported
delegate_taskavailability. Each lane spawned a three-way survey team for purpose/docs/manifests, live-work evidence, and public-safety eligibility; each subteam reported one further three-way leaf recursion. Further recursion ended at leaf checks/depth limits. - Controller audit: after the lane summaries returned, a read-only controller audit re-enumerated all 39 top-level directories and spot-checked git state/HEAD/status or non-git top-level shape. No missing directory was found.
- Evidence allowed: README/docs/plans, manifests, branch/status/log metadata, safe modified/untracked filenames, mtimes, tests, checked-in reports, and visible artifacts.
- Evidence excluded: secret contents,
.envcontents, hidden local settings, raw prompts/logs/trajectories, hidden evaluator/supervisor payloads, private corpus bodies, explicit/provocative/unsafe material, raw benchmark outputs, checkpoints/model artifacts, biometric/capture data, generated media bodies, and directories too skeletal for a responsible public claim. - Illustration: generated locally as symbolic SVG editorial art at
queries/news-assets/2026-05-11-project-desk-hero.svg; it is not a screenshot.
Ten survey lanes
- Hidden local assistant settings; hidden tinygrad research checkout; uncommitted harness workspace; one sensitive social-claim notebook held back by category.
basis;basis-hermes;basis-jcode;cardgame1/ Dungeon Steward.- Empty
creative;deer-flow; privacy-sensitiveFACEMUSIC;gas-city-but-its-just-codex. gemma-dungeon;gemma4-tinygrad-opt;handterm;hoid.is-codex-better;is-it-formal;justfooln;kettlebellsim.- skeletal Kimi settings area; local
langfuse; local Hermes model runtime; scratchmeta-hermesworkspace. nnpl-external-latent-bus;nnpl-shared-bus;nnpl-typed-boundary-ir;openai-symphony.- empty
overengineeredlife;silly-pi-stuff; privatespec-dataset-evolution-corpus; nested internal skill workspace. steward;testing-rl;testing-rl-hermes; emptytinygrad.tinygrad-gemma; emptytinygrad-gemma-gemini;tinygrad-gemma-kimi.
Public-safe lead candidates
Test-writing, verifier, and evaluation environments
testing-rlevidence: git repo onmaster, HEAD46bbb48from 2026-05-10, ahead of origin by 1 and dirty with modified README/docs/scripts/tests plus untracked local-corpus test material. Safe evidence includes README, pyproject CLI entries,SPEC.md, workflow docs, artifact schemas, environment contract, non-cheating writer docs, counterfactual/verifier/history docs, Lean material, adapter docs, dashboard pages, and 19 test files by filename. Safe summary: an RL-style environment for training/evaluating agents that write high-value software tests against bounded workspaces and hidden reference/replay evidence. Raw reward feeds, local/private corpus details, dashboard payloads, benchmark bodies,.hermes/.codexinternals, logs/prompts/trajectories remain withheld.testing-rl-hermesevidence: clean local git repo onmain, HEAD6cbca51from 2026-05-02, no remote observed. Safe evidence includesMASTER_PLAN.md, pyproject console script, test-generation RL environment docs, benchmark/data strategy, verifier-training/history-fixture docs, reports by filename, source, and 3 test files. Safe summary: artifact-first prototype environment for test-generation agents with supervisor-held references/mutants and deterministic grading concepts. Raw fixture/oracle/mutant/report bodies remain withheld.is-it-formalevidence: no-commit git repo with Lean/Lake scaffold,README.md,IsItFormalsources, JSON examples by filename, and Python grader tooling. Safe summary: small Lean/Python scaffold for classifying how formal a claim is. It lacks a license and committed history, so this remains prototype copy.
Basis, Steward, and spec-code grounding
basisevidence: Elixir/Mix git repo onmain, HEADa5544e0from 2026-05-07, tracking origin, with an untracked reducer experiments directory. Safe evidence includesspec.md, Mix metadata, reducer and implementation-imaginer component specs, docs, and tests. Safe summary: draft Elixir/BEAM system for reducing prose/spec artifacts into structured, provenance-backed specification state.basis-hermesevidence: clean Python/Hermes plugin repo onmain, HEAD0061d32from 2026-05-05, with README,plugin.yaml, pyproject, dashboard manifest, reducer/validator source, CLI/tool handlers, and tests. Safe summary: Hermes-native wrapper exposing deterministic Basis reducer and packet-validator surfaces.basis-jcodeevidence: git repo onmain, HEAD4b1e621from 2026-05-05, ahead of origin by 10 and dirty with tracked deletions in reducer examples/UI. Safe summary: category-level Jcode-native reducer/control-plane variant for ledgers, validation, worker packets, and dashboard projections. Raw.basisruns, prompts, streams, validation bodies, worker packets, run graphs, and output artifacts are withheld.stewardevidence: design-stage git repo onmain, HEADba88837from 2026-05-05, dirty with modified design docs and untracked service-vision/ADR/schema/query-contract material. README explicitly frames the repo as ideation/design only. Safe summary: design-stage semantic/provenance service concept over specs, code, Git history, agent work, reasoning, and verification; not an implemented product.- The private spec corpus was surveyed only as category-level evidence of a gated research corpus; raw copied artifacts and compliance/scan payloads remain private.
Gemma, tinygrad, symbolic game state, and NNPL benches
gemma-dungeonevidence: clean git repo onmain, HEAD1ebd8a8from 2026-05-11. Safe evidence includes README, pyproject, docs/specs, schemas, tests, CLI/package surfaces, world-model/action-head/replay/policy-eval/runtime/web-viewer tests by filename. Safe summary: embedding-native, symbolically audited roguelike research workspace using explicit game state, legal-action scoring, replay/schema contracts, and Gemma/tinygrad policy experiments. Replay payloads, exports, prompt/logit artifacts, datasets, and internal plans are withheld.tinygrad-gemmaevidence: git repo onmain, ahead of origin by 93, tracked tree clean with many untracked local artifacts. Safe evidence includes README, pyproject packagetinygrad-gemma, CLI/chat entry points, docs/configs/benchmarks/scripts/tests, CI workflow, and recent 2026-05-06/07 worker-round commits. Safe summary: native tinygrad Gemma 4 implementation with local checkpoint loading, tokenizer and multimodal support, KV-cache generation, CLI/chat, training/checkpoint helpers, quantization surfaces, and tests. Raw checkpoints, benchmark logs, performance claims, and untracked artifact bodies are withheld.gemma4-tinygrad-optandtinygrad-gemma-kimiare category-level optimization sandboxes. The former lacks a top-level git repo/README; the latter is dirty onopt/attentionwith modified core/benchmark files and raw results/patch artifacts. Summarize them as Gemma/tinygrad optimization work only; do not publish benchmark payloads or patch-race artifacts.nnpl-external-latent-busevidence: non-git Python/Numpy prototype with README, project brief, pyproject, docs, source, artifacts by filename, and 52 test files. Safe summary: external/internal latent-bus architecture for option-preserving planning and bridge-dependence probes.nnpl-typed-boundary-irevidence: non-git Python/tinygrad prototype with README, project brief, pyproject, docs, data/readme, source, results by filename, and 37 test files. Safe summary: typed IR boundaries for validated planning artifacts, legality, auditability, deterministic rendering, and failure localization.nnpl-shared-busrecords useful negative/limited shared-bus experiment evidence but is kept category-level because run/checkpoint/trace/eval artifact categories dominate the visible surface.
Harness/control-plane and orchestration side rooms
gas-city-but-its-just-codexevidence: dirty git repo oncodex/native-codex-ui, HEAD198aefcfrom 2026-04-21, with README, Rust workspace, workflow-ledger specs, templates/schemas, MCP/gRPC/app-server surfaces, operator tooling, docs/scripts, tests, and Lean/formal material. Safe summary: category-level Codex-native durable workflow/control-plane research. Runtime state, transcripts, context boards, benchmark payloads, databases, workflow IDs, logs, and live operator state remain withheld.another-harnessevidence: no-commit git repo onmainwith hundreds of untracked entries, Lean/Lake metadata, docs, tests, tools, benchmarks, and plugins. Safe summary: early Codex/Hermes harness and Lean formalization workspace. No maturity or release claim is justified.openai-symphonyevidence: dirty Elixir/Phoenix repo onmain, HEAD58cf97dfrom 2026-04-27, with README/SPEC, Elixir manifest/docs, LiveView/API/dashboard/logging/token-accounting material, tests, and modified app-server/orchestrator/status files. Safe summary: engineering-preview orchestration service for issue-tracker-driven isolated coding-agent runs. Logs, workflow/prompt bodies, hidden tooling, and local runtime details are withheld.deer-flowevidence: public LangGraph/LangChain-style agent harness checkout with backend/frontend/Docker/docs/tests, dirty local nginx config and.floxstate. Safe summary: public super-agent harness checkout; local config remains private.is-codex-betterevidence: no-commit draft repo with README/docs/plugins/install scripts/state procedure material. Safe summary: category-level draft Codex/Hermes harness-extension repo; profile/session/procedure internals remain withheld.
Simulation, terminal, interface, and craft work
kettlebellsimevidence: clean git repo oncodex/reward-audit-and-swing-training, ahead of origin by 36, HEAD1d973defrom 2026-05-09. Safe evidence includes package metadata, planning/gate docs, bounded Modal/Isaac wrapper acceptance docs, scripts, configs, recipes, skills, and 97 test files by filename. Safe summary: simulation-first kettlebell swing biomechanics/path-signature toolkit with local deterministic planar gates and permission-gated remote Isaac/Modal probes. Logs, trajectories, rollouts, generated media, run artifacts, checkpoints, and service/account details remain withheld.handtermevidence: clean Rust git repo onmaster, HEAD977e709from 2026-04-19, with README, Cargo workspace, MIT license, optimization docs, CI, tests, and recent graphics/kitty-upload refactors. Safe summary: Wayland-native Rust terminal emulator focused on low-latency, resource-efficient multi-window operation.FACEMUSICwas surveyed as a privacy-sensitive face-controlled music prototype with web/iOS/Rust/ML components. Because the domain is biometric-adjacent and the tree is dirty/untracked, only category-level mention is appropriate.hoidwas surveyed as a structured world-packet / creative world-studio prototype with active Phoenix work, but creative corpus/story/world/music/comic bodies, prompt/transcript/event data, generated media, and secret/env-bearing categories keep it category-only.cardgame1/ Dungeon Steward has real Godot project and test/design evidence, but generated-art, prompt, model/checkpoint, ignored env/session-log, and simulation artifact surfaces keep this run at category level.
Held back from project-specific public detail
The survey fully held back, or reduced to category-only mention, hidden local settings, hidden-only or empty directories, one sensitive social-claim notebook, local deployment/model-runner folders, private corpus bodies, prompt/agent/skill instruction bodies, scratch/meta workspaces, generated media, raw logs/prompts/trajectories, evaluator-like payloads, hidden references/oracles, benchmark raw outputs, model/checkpoint artifacts, biometric/capture data, creative story/canon drafts, service configuration, raw test/counterexample bodies, cache/build/vendor directories, and all too-skeletal placeholders.
Editorial synthesis
The publishable movement clusters around six themes:
- test-generation and verifier environments are the strongest live-work signal tonight, with
testing-rlmoving on May 10 andtesting-rl-hermespreserving the smaller prototype lineage; - specification work is spreading from Basis packets into Steward-style durable provenance services;
- Gemma/tinygrad work now includes both model-runtime benches and a symbolic roguelike environment that can expose policy/action-head claims through schemas and tests;
- NNPL remains useful when it preserves negative results and typed-boundary claims rather than merely promising latent magic;
- orchestration repos are rich but often dirty, internal, or artifact-heavy, so public copy should emphasize architecture and withhold run state;
- craft projects remain publishable when they bring ordinary proofs of life: README, license, manifests, tests, clean git state. The Cargo manifest remains a modest but dignified epistemology.
A public note can say that much. It should not say more merely because the filesystem was candid.