Decision-Theory

The Gate That Said No

Whether to adopt a tool you built is itself a decision under uncertainty, so it deserves a decision rule, not a leaderboard. I built a pre-committed, frozen-blind adoption gate over a posterior on the utility difference between two ways of answering questions about my own documents: a model that always answers, and Bayesian machinery that abstains when it is not sure. The gate refused adoption despite a clearly positive mean, because the choice hinges on a number I cannot honestly introspect: how much a confident wrong answer actually costs me. Then I built the abstaining machine and ran it, and it answered almost nothing, withholding even the values it had read faithfully. Not a failure of the decision math but its honest consequence. The silence pulled apart into three separable beliefs (can I find the fact, did I read it faithfully, is it still true), only one of which is my utility talking, and a wrong answer turned out to be, more often than not, a faithful read of a fact that has since gone stale. The gate moves only as the missing number is earned from behaviour, never from a hand on the prior.

essaysbayesianaidecision-theory

Make Your OpenClaw Agent Cheaper, and Measure It Yourself

credence-pi is an OpenClaw plugin plus a local daemon that learns your agent and acts at two points by expected utility: it routes each turn to the cheapest model whose expected accuracy justifies its cost, and it governs each tool call your agent proposes, blocking the wasted ones and flagging injected exfiltration as a confirmation. Routing is on by default; shadow mode reports what it would save and block on your own sessions, with its own false-block rate, before it enforces anything. Early-stage research that wants the community's help; local-first, installable today.

essaysbayesianaidecision-theory

Make Your OpenClaw Cheaper and Harder to Fool

credence-pi is an OpenClaw plugin plus a local daemon that learns your agent's behaviour and governs its tool calls by expected utility: it blocks the calls your agent wastes, flags injected exfiltration as a confirmation at 0.94 precision on a public benchmark, and makes ask-or-proceed decisions a fixed rule provably can't reproduce. Research-stage, local-first, installable today.

essaysbayesianaidecision-theory

What a Regex Can't Do

The governor from 'The Brain is Opaque to the Body' now ships as an OpenClaw plugin: wasted tool calls blocked at precision and recall 1.0 on real sessions, injected exfiltration surfaced as a confirmation at 0.94 precision. And an argument: several of these behaviours are beyond any hand-tuned heuristic, because matching them re-derives Bayesian decision theory.

essaysbayesianaidecision-theory

The Brain is Opaque to the Body

A first-pass body-brain architecture for governing a coding agent's tool_call hook with a Bayesian decision-theoretic brain. The wire schema is fixed by what Pass 1 ships; Pass 2 swaps the posterior representation without disturbing it. Plus what credence-lint and a precedent system caught at the eleventh hour.

essaysbayesianaidecision-theory