AI Agent Operational Governance
Defines AI agent operational governance: the layer of authority, evidence, tools, traces, costs, and human gates that makes agents usable in real work.
Essays and memos about software, AI, product judgment, personal systems, and the work behind public artifacts.
Choose the route that matches your current problem, then read the connected pieces in order.
Defines AI agent operational governance: the layer of authority, evidence, tools, traces, costs, and human gates that makes agents usable in real work.
Explains why chat logs are not a durable knowledge base and how Obsidian, local Markdown, Codex, project directories, and publication nodes form a federated system.
Explains why Codex and Claude are entry points while the durable SAH core is the MCP server, project workspace, bridge trace, and SketchUp bridge.
Argues that an issue should be an executable contract for AI work, with scope, context, gates, evidence, and release boundaries instead of a loose todo.
Defines a project-specific AI delivery pipeline: AI acts as a worker while the project owns task intake, context, gates, evidence, and release boundaries.
Explains why AI tools for designers should behave as durable workbenches, not just chatbots, image generators, or automatic drafting plugins.
An open-source project connecting agent CLIs, an MCP server, a SketchUp Ruby bridge, structured design models, and runtime skills into a verifiable design workflow.
Connects trajectory evaluation, goal-level cost, semantic recovery, and confidence calibration into a practical evaluation frame for agent workflows.
Defines AI agent operational governance: the layer of authority, evidence, tools, traces, costs, and human gates that makes agents usable in real work.
Explains residual drift in long-running agent sessions and why serious workflows need commitment tracking, not just memory or contradiction checks.
Shows how evidence-carrying actions and source authority rules prevent fluent text from overriding structured data, tool output, or source evidence.
Explains why agent actions need runtime authority checks and autonomy gates, instead of relying only on plan-time approval or a confident task plan.
Treats MCP servers, plugins, tool descriptions, and dependencies as part of the agent control surface that needs governance and supply-chain review.
A companion essay on why AI's near-term impact may first appear as compressed execution work, narrower entry-level paths, and weaker labor leverage.
A personal essay on why a real issue-driven AI delivery workflow changed my timeline for AI's impact on ordinary work from eight to ten years to three to five.
Defines what the central knowledge base, project directories, and publication node each own, and why a single vault template is not enough for fact governance.
Explains why chat logs are not a durable knowledge base and how Obsidian, local Markdown, Codex, project directories, and publication nodes form a federated system.
Explains Obsidian's real role in an AI knowledge base: local Markdown, links, backlinks, and human review, not an automatic truth machine.
Summarizes the system boundaries and failure modes: privacy, project truth sources, premature automation, site truth drift, and AI output mistaken for evidence.
Explains why project AI should not read the whole central knowledge base by default and how local facts, narrow interfaces, and selective memory reduce risk.
Explains why AI tools for designers should behave as durable workbenches, not just chatbots, image generators, or automatic drafting plugins.
Explains why SketchUp execution should be planned as a bridge trace before calling the Ruby bridge, so agent actions remain inspectable and repairable.
Shows how an agent CLI, MCP server, SketchUp Ruby bridge, runtime skills, and a structured design model turn design intent into verifiable project state.
Explains why Codex and similar AI coding tools should remain bounded workers, not long-running schedulers or release controllers.
Many AI task failures do not happen because the model cannot modify code. They happen because the model reads the wrong context.
Explains why the SketchUp scene is an execution view while the structured design model is better for agent reasoning, validation, repair, and replay.
Discusses how AI moves designer value from low-level execution toward goals, constraints, feedback, and version judgment.
A product and workflow essay arguing that AI should reduce low-level drafting work so designers can focus on intent, judgment, constraints, and tradeoffs.
Explains why AI delivery must include verifiable proof: tests, logs, screenshots, risk notes, and a review path, not only a claim that work is done.
Explains why AI pull requests should include reviewable evidence bundles, not just code changes and a claim that tests pass.
Shows how repeated AI mistakes should become project memory, updated rules, and regression checks so the delivery pipeline improves after each failure.
Explains why AI design collaboration needs project workspaces, structured truth, and resumable project state instead of chat history alone.
Shows how GitHub labels, branches, and comments can become an AI engineering state machine that keeps issue repair work observable and controllable.
Explains why an agent harness needs headless smoke, live bridge checks, release smoke, and installed package verification.
Shows where humans should stand in an AI delivery pipeline: requirements, risk boundaries, release decisions, rollback choices, and final acceptance.
Explains why real projects should put AI work in isolated branches or workspaces, then move changes through explicit gates before they reach the main codebase.
Argues that an issue should be an executable contract for AI work, with scope, context, gates, evidence, and release boundaries instead of a loose todo.
Explains how the MCP server, execution trace, JSON-RPC, Ruby bridge, and feedback loop turn natural language into executable operations.
Explains how to safely extract public methodology, templates, and toy examples from private production AI workflow experience.
Explains why a merged PR is not the end of AI engineering work; release verification, rollout checks, and post-merge evidence still define done.
Defines a project-specific AI delivery pipeline: AI acts as a worker while the project owns task intake, context, gates, evidence, and release boundaries.
Explains why plan import should preserve source evidence and generate repairable working truth instead of promising perfect recognition.
Explains the product boundary between product runtime skills, project dynamic skills, and maintainer development skills.
Explains why a SketchUp agent needs a semantic component registry that exposes placement intent, constraints, and reuse rules instead of only searching 3D model assets.
Explains why Codex and Claude are entry points while the durable SAH core is the MCP server, project workspace, bridge trace, and SketchUp bridge.
The most common risk in AI-assisted development is not that the model cannot write code. It is that the model starts writing code too early.
Explains why AI repair work should move through triage, analysis, implementation, evidence, and review gates instead of jumping straight to code.
Explains how screenshots, renderings, and top views should become structured repair actions instead of another pile of untracked opinions.
Uses SketchUp Agent Harness to explain why AI design tools need an editable, verifiable, repairable source of truth instead of only generating finished-looking output.
The operating rule for this site: publish the canonical version here, then adapt it outward to social platforms.
A note on separating raw material from publishable work when Codex is part of the workflow.