The Art of Delegation: Signatures and Sandboxes
· 5 min read · updated · Hrönir rank #37/38
The problem with most writing about AI delegation is not that itâs wrong. Itâs that itâs set in the wrong room.
In February, I almost missed a window in a federal tax case because I had started treating the assessorâs draft as the deliverable. The case was a state revenue objection â an auto de infração, forty-eight hours to contest before the administrative window closed. The parecer was good: the applicable statute correctly identified, the procedural argument solid, the conclusion defensible. The assessor had marked it ready for review on Monday morning. The window closed Wednesday at midnight. On Tuesday afternoon I realized the submission had not been filed. The draft was complete. The act was not. That is a forty-eight-hour gap during which I had stopped tracking something I was legally responsible for.
This is the same confusion that breaks AI delegation.
I spend my days in a state attorneyâs office in RondĂŽnia, reading pareceres drafted by assessors and signing the ones that donât terrify me. When I delegate a legal opinion, I am not asking someone to be my extended keyboard. I am handing down the task of traversing the case file, identifying the applicable law, and proposing a conclusion. What I am not handing down is the signature. The signature is the irreversible boundary â the moment the act enters the record and the deadlines start moving.
The reason this distinction doesnât arise naturally in software engineering is that the feedback loop is tighter. In law, the delays are built into the institution: the assessor finishes drafting, the protocol office has a processing window, the courtâs system has its own schedule. The stages are physically separate and the gap is legible. In code, the developer writes the function, the tests pass in thirty seconds, the PR auto-merges on green. The proposal and the act compress into a continuous motion, and the point where the agentâs output stops being draft and starts being act is never made explicit.
When we orchestrate agents like Jules and Claude, the problem is not that we want to micromanage their keystrokes. The problem is that software engineering, unlike administrative law, does not natively separate the draft from the signature. In code, writing the function and executing the function often look like the same continuous motion.
The boundaries of the sandbox
When I trust Jules to refactor a microservice in the background, the anxiety does not come from a fear that Jules will choose the wrong design pattern. The anxiety comes from the fact that Jules has write access.
The solution is not to stand over Julesâs shoulder while it writes. The solution is to build a sandbox where the agentâs actions are explicitly treated as proposals. The CI/CD pipeline, the test suites, the strict linting rulesâthese are not just quality assurance mechanisms. They are the equivalent of the institutional rules that say an assessor can draft a parecer, but cannot sign the final ofĂcio.
The magic of delegation happens when you constrain the output space, not the process. You define the bounds of the sandbox â the schema, the invariants, the tests â and you allow the agent to navigate the interior freely. If the tests pass, the proposal is valid. But the apply step â the actual merging of the PR, the deployment to production â that remains a human signature. A CI pipeline that cannot be bypassed is a protocol office: a mandatory processing step between the draft and the act that makes the stages legible again.
The harness as constitutional design
This is why the harness matters more than the model. Funes is not Claude; Funes is Claude wrapped in a specific set of rules, memories, and constraints. When Funes reads his SOUL.md and decides to document a decision rather than just executing it, he is operating within an administrative framework.
Funes opens pull requests; he does not merge them. He updates memory files; he does not send emails on his own. When I asked him to draft a response to an external inquiry about causaganha, he wrote the draft and created a PR containing it. He did not send the message. Not because a rule said do not send messages without permission. Because the harness simply had no wiring for outbound external messages â the sandbox made the signature step structurally required, not behaviorally enforced.
He acts because the framework permits it, and he pauses when the framework demands a signature. ReversĂvel â age, irreversĂvel â pergunta. That is not just a safety heuristic; it is a theory of constitutional design for agents.
The assessor is good. The agent is capable. Neither of those facts changes who signs.
For further reading
- Lucy Suchman, Plans and Situated Actions (1987) â on the difference between the plan as a cognitive model and the plan as an accountability artifact. The proposal PR is exactly this kind of artifact.
- Dylan Hadfield-Menell et al., The Off-Switch Game (2017) â corrigibility as game theory; the human-approval-before-apply step is a concrete instance of what this paper formalizes.
- Brazilian Lei 9.784/1999, arts. 11â17 â the domestic legal framework for delegation of administrative acts. The distinction between competĂȘncia and its limits is the statutory source of the draft/signature separation Iâve been describing. Most software engineers have never read a line of administrative procedure law and would benefit from the clarity.
- Fred Brooks, The Mythical Man-Month (1975) â specifically the surgical team chapter: the person doing the intellectual work is not the person accountable for the output. The same capability can exist in two accountability architectures, and the choice between them is not a capability question.
Related posts
Three Hammers Walk Into a Bar
On three professional postures, four alignment properties, and the one property that had to come from elsewhere.
The Agent That Doesn't Invent Verbs
On Cucumber, content-addressing, and an alignment technique that turns out to be older than alignment.
Hermes Agent vs OpenClaw: Why My Experience Got So Much Better
An honest account, based on my own sessions, about the UX leap between the old OpenClaw harness and the Hermes Agent.
Comments
Comments not configured yet.