Agent-Driven Policy

June 16, 2026
blog image

A legislator is not, fundamentally, a holder of opinions. A legislator is a cognitive engine asked to convert the chaos of a society into a small number of binding, enforceable, legitimate rules—and the engine is catastrophically underpowered. The defining failure of modern government is not corruption or cowardice; it is throughput. A single human mind, backed by a thinning staff, cannot read a four-thousand-page omnibus, scan fifty jurisdictions for what already worked, weigh how severe a problem truly is, estimate whether it can be moved at all, ground a decision in the research, and predict how millions of people will respond. So each of those faculties gets outsourced to whoever arrives with the answer pre-chewed—and the only actors who can afford to pre-chew it are the best-funded interests. The agent does not replace the legislator. It rebuilds the missing faculties one by one.

The first faculty is Comprehension: the ability to see how the system actually works—what the existing law already says, where it contradicts itself, which statutes are dead, who really benefits. Today this faculty barely exists; legislators vote on text they have not read and cannot, structurally, find time to read. An agent reads all of it, continuously, and turns the opaque corpus of accumulated law into a queryable map.

The second faculty is Significance: the discipline of deciding which problems are even worth a law. Legislative attention is the scarcest resource in a republic, and it is allocated by noise—by whichever crisis trends, whichever lobby shouts loudest. An agent can triage a thousand candidate problems by reach, severity, and reversibility, turning a politics of reaction into a politics of deliberate prioritisation.

The third faculty is Tractability: the sober estimate of how hard a problem is to actually move. Most political energy is spent on problems that look urgent but are structurally immovable, while tractable wins go unnoticed. An agent can model expected effect size against implementation difficulty, separating the problems a law can solve from the ones it will only perform solving.

The fourth faculty is Diffusion: the capacity to learn from everyone who already tried. The fifty states and the hundred-ninety countries are a vast, running experiment, and almost none of that evidence reaches the drafter in time. An agent mines the entire global record of policy—what spread, what worked, what backfired—and delivers proven templates instead of blank pages.

The fifth faculty is Evidence: the loyalty to what the research actually shows rather than what the talking point asserts. The evidence base is enormous and growing, and it is almost entirely unscanned by the people writing law. An agent grounds every claim in the studies, the trials, and the data—and, critically, supplies that grounding without a client behind it.

The sixth faculty is Simulation: the power to test a law before it is binding. We ship software behind a staging environment and a rollback button; we ship law to a continent on a floor vote and a hope. An agent war-games legislation against a synthetic population, surfacing the second- and third-order effects—the cobra-breeders, the gaming, the perverse incentives—in silico, before they hit reality.

The seventh faculty is Composition: the act of turning settled intent into precise statutory text. This is the one task already visibly migrating to machines, from a city ordinance drafted by a chatbot to a national drafting assistant trained on a million sections of law. Done well, it collapses the cost of writing good law; done carelessly, it floods the system with bad law faster than ever.

The eighth faculty is Constituent Sensing: the ability to hear what the public actually needs, directly and at scale, rather than through the filter of whoever can manufacture the loudest voice. Today a representative’s sense of the public is a handful of town halls and a flood of form letters; when millions of comments arrive, the genuine signal drowns. An agent listens to all of it, strips out the astroturf, and renders the real distribution of need.

The ninth faculty is Deliberation: the discipline of forcing a proposal to survive its strongest objections before it becomes law. Legislatures vote under time pressure and tribal reflex, rarely steelmanning the other side or naming who pays. An agent cross-examines every bill—generating the best case against, the trade-offs, and the role-reversal test—so the decision rests on public reasons, not on whoever held the floor.

The tenth faculty is Oversight: the loop that learns whether a law actually worked. Most legislation is passed once and never revisited, accumulating as dead statute no one tests. An agent measures every law against its own stated goals, flags failure early, and triggers the revision or repeal that turns lawmaking from a one-way act into a system that learns.

This article is a field guide to the ten capabilities of the Augmented Legislator—the Legislative Intelligence Stack, where Constituent Sensing brackets the front of the cycle and Oversight the back, with Deliberation standing between knowing and writing. Each capability is treated identically: a precise Definition, its Place in lawmaking in five aspects, the twelve principles that make it powerful, the three patterns by which it operates, the key mechanisms with real working examples, the way agents change the game, the four principles of that shift, and the honest advantages and disadvantages. The article closes with a phased Action plan for building the Stack inside a real legislature without surrendering the one thing that must remain human: the vote.

Summary

1) Comprehension

What it is — The faculty of seeing the existing system as it really is: the full corpus of law, its contradictions, its dead letters, its true beneficiaries.
How it works — Continuous reading and structural mapping of statutes, precedents, and proposed text into a queryable model.
Why it matters — You cannot reform a system you cannot see; comprehension is the precondition for every other faculty.
Failure mode — Voting blind: passing text no human has read or understood, captured by whoever summarises it.

2) Significance

What it is — The triage faculty: deciding which problems are meaningful enough to deserve scarce legislative attention.
How it works — Scoring candidate problems by reach, severity, urgency, and reversibility into an explicit priority order.
Why it matters — Attention is the binding constraint of a republic; misallocating it wastes the whole machine.
Failure mode — Government by trending crisis: loud problems crowd out large ones.

3) Tractability

What it is — The realism faculty: estimating how hard a problem is to actually move with a law.
How it works — Modelling expected effect size against implementation difficulty, cost, and resistance.
Why it matters — Effort spent on immovable problems is the largest hidden waste in politics.
Failure mode — Performative legislation: passing laws that look like solutions but cannot bite.

4) Diffusion

What it is — The learning faculty: mining other jurisdictions for policies that already worked.
How it works — Scanning the global record of adoption, outcomes, and failures to surface proven templates.
Why it matters — Most problems have been solved somewhere; reinvention is pure waste.
Failure mode — Parochial blindness: drafting from scratch while the answer sits in another statehouse.

5) Evidence

What it is — The grounding faculty: tying decisions to what research and data actually show.
How it works — Retrieving, weighing, and citing studies, trials, and evaluations against each claim.
Why it matters — Without evidence, law is narrative; with it, law can be corrected.
Failure mode — Lobbyist epistemics: the best-funded interest supplies the “facts.”

6) Simulation

What it is — The foresight faculty: testing a law against a model of the world before it is binding.
How it works — War-gaming policy on synthetic populations and economic models to expose second-order effects.
Why it matters — Unintended consequences are where good intentions go to die.
Failure mode — Shipping to 330 million people with zero unit tests.

7) Composition

What it is — The drafting faculty: converting settled intent into precise, conflict-free statutory text.
How it works — Generating and red-lining legal language grounded in the existing corpus.
Why it matters — The gap between intent and text is where loopholes and litigation live.
Failure mode — Legislative spam: cheap drafting that floods the system with volume, not law.

8) Constituent Sensing

What it is — The input faculty: hearing what citizens actually need, at scale, beneath the manufactured noise.
How it works — Collecting, deduplicating, and classifying public input while filtering astroturf and fraud.
Why it matters — A representative who cannot hear the represented governs blind to them.
Failure mode — Mistaking the loudest manufactured campaign for the public will.

9) Deliberation

What it is — The reasoning faculty: stress-testing a decision against its strongest objections.
How it works — Generating the opposing case, the trade-offs, and the role-reversal test.
Why it matters — A law unexamined by its best critics is a law waiting to fail.
Failure mode — Tribal reflex: passing on “our side” rather than on public reasons.

10) Oversight

What it is — The feedback faculty: learning whether a law actually worked after passage.
How it works — Measuring real outcomes against stated goals and triggering revision or repeal.
Why it matters — Without a feedback loop, laws accumulate as dead, unexamined sediment.
Failure mode — Ghost laws: passed once and never revisited.


The Capabilities

1) Comprehension

Definition

Comprehension is the faculty of accurately seeing the system a legislator proposes to change—the full body of existing law, its internal contradictions, its obsolete provisions, and its real-world beneficiaries—before touching it.

It functions as the legislature’s situational awareness layer: the precondition that makes every downstream faculty possible, because no problem can be triaged, no law simulated, and no text drafted against a system that is invisible to the person governing it.

Place in lawmaking: 5 aspects

  1. The precondition for legitimacy

    • A vote on unread text is a vote without consent of the mind that casts it.

    • Comprehension is what converts a signature into an actual decision.

  2. The map of the existing corpus

    • Statute is accreted over centuries; no single mind holds it.

    • Comprehension turns that sediment into a navigable structure.

  3. The contradiction detector

    • New law collides with old law in ways drafters rarely foresee.

    • Comprehension surfaces conflicts before they become litigation.

  4. The dead-letter finder

    • Much law is obsolete, redundant, or never enforced.

    • Comprehension distinguishes living rules from fossils.

  5. The beneficiary lens

    • Every rule moves value to someone; the question is whom.

    • Comprehension makes the distributional reality legible.

Why it works: 12 principles

  1. Externalised memory — it stores the corpus outside any single overloaded staff.

  2. Structural reading — it maps relationships (this section amends that one), not just words.

  3. Completeness — it reads all of the text, not the fraction a human samples.

  4. Cross-reference — it links proposed text to every statute it touches.

  5. Provenance — it traces where language came from and who supplied it.

  6. Comparability — it sets current law beside the proposed change, clause by clause.

  7. Continuity — it persists across electoral cycles, immune to staff turnover.

  8. Speed — it reads in minutes what once took staff weeks.

  9. Searchability — any clause becomes retrievable on demand.

  10. Version awareness — it tracks how text mutated across drafts and amendments.

  11. Scale-invariance — a thousand-page bill is no harder to read than a one-pager.

  12. Neutrality — it gives every provision equal attention, not selective focus.

Three major patterns of how it works

  1. Ingest → structure → query

    • Ingest the raw corpus and the proposed text

    • Structure it into linked clauses, definitions, and cross-references

    • Expose it to natural-language interrogation

  2. Compare → flag → explain

    • Compare new language against existing law

    • Flag conflicts, redundancies, and dead letters

    • Explain each flag in plain language

  3. Trace → attribute → expose

    • Trace clauses to their textual origin

    • Attribute them to a source (agency, interest, model bill)

    • Expose the provenance to the legislator and the public

Key mechanisms and how they work

A. Statutory research and pruning systems

  • Models that read the entire code and locate the relevant, redundant, or obsolete law.

  • Example: Stanford’s RegLab built a statutory-research system that identified relevant law with 94–99% reliability; deployed with the San Francisco City Attorney, it produced an ordinance cutting more than a third of the city’s mandated reports.

B. Code-scale deregulation analysis

  • Running an entire administrative code through analysis to flag what is unnecessary.

  • Example: Ohio ran its roughly fifteen-million-word administrative code through an AI analysis that flagged two million words and some 900 rules for removal, putting the state on track to cut nearly a third of the code.

C. Provenance and model-legislation detection

  • Computational comparison that reveals who actually wrote a bill.

  • Example: The “Copy, Paste, Legislate” investigation analysed nearly a million state bills and found more than 10,000 copied almost verbatim from interest-group “model legislation,” over 2,000 of which became law.

How AI changes the game: definition

AI turns comprehension from a sampling problem into a total-coverage problem—reading the whole corpus, mapping its structure, and answering questions about it in real time—while shifting the risk from “we missed something” to “we over-trusted the summary.”

In short: the legislator can finally read everything.

Four principles of how AI changes the game

  1. From sampling to totality — from reading a fraction of the text to processing all of it.

  2. From text to structure — from prose pages to a linked, queryable graph of law.

  3. From periodic to continuous — from a one-time read to an always-current model of the corpus.

  4. From opaque to attributed — from anonymous clauses to traceable provenance.

Advantages and disadvantages

Advantages

  1. Ends the absurdity of voting on unread text.

  2. Surfaces conflicts and dead letters before they cause harm.

  3. Exposes hidden authorship and beneficiaries.

  4. Gives a small office the reading capacity of a large institution.

Disadvantages

  1. A confident, wrong summary is more dangerous than an honest gap—automation bias is real.

  2. Whoever tunes the comprehension model shapes what the legislator “sees.”

  3. Structural maps can flatten the deliberate ambiguity that law sometimes needs.

  4. Total legibility of the corpus is also a tool for those who would exploit it.


2) Significance

Definition

Significance is the faculty of deciding which problems are meaningful enough to warrant scarce legislative attention—weighing how many are affected, how severe the harm, how urgent the timing, and how reversible the damage.

It functions as the legislature’s triage layer: the discipline that allocates the single most constrained resource in a republic—the finite attention of its lawmakers—toward the problems that actually matter rather than the ones that merely shout.

Place in lawmaking: 5 aspects

  1. The attention allocator

    • There are always more problems than legislative slots.

    • Significance decides what gets a hearing and what does not.

  2. The severity weigher

    • Not all harms are equal; some are catastrophic, some cosmetic.

    • Significance ranks by magnitude, not volume of complaint.

  3. The reach estimator

    • A problem affecting millions differs from one affecting hundreds.

    • Significance scales attention to population touched.

  4. The reversibility filter

    • Irreversible harms deserve priority over recoverable ones.

    • Significance privileges the problems that cannot wait.

  5. The agenda guard

    • Agendas are captured by whoever manufactures urgency.

    • Significance defends the agenda against manufactured noise.

Why it works: 12 principles

  1. Comparability — it puts dissimilar harms on a common scale.

  2. Proportionality — it matches attention to magnitude.

  3. Explicitness — it makes the priority order visible and defensible.

  4. Resistance to noise — it discounts volume in favour of severity.

  5. Forward weighting — it privileges the irreversible and the compounding.

  6. Coverage — it scans the whole problem space, not the trending slice.

  7. Auditability — it leaves a record of why a problem was prioritised.

  8. Multi-dimensionality — it weighs reach, severity, urgency, and reversibility together.

  9. Counterfactual framing — it asks what happens if nothing is done at all.

  10. Stakeholder breadth — it counts the silent affected, not only the vocal.

  11. Recurrence sensitivity — it flags chronic problems that never spike but never resolve.

  12. Revisability — priorities update as conditions change.

Three major patterns of how it works

  1. Scan → score → rank

    • Scan the full landscape of candidate problems

    • Score each by reach, severity, urgency, reversibility

    • Rank into an explicit priority order

  2. Aggregate → weight → triage

    • Aggregate signals of harm across data sources

    • Weight by magnitude and population

    • Triage into act / monitor / ignore

  3. Compare → justify → publish

    • Compare a problem against the current agenda

    • Justify its place with explicit criteria

    • Publish the reasoning for scrutiny

Key mechanisms and how they work

A. Severity thresholds and common currencies

  • Institutions already triage life-and-death allocation with explicit severity metrics.

  • Example: The UK’s NICE allocates health spending against an explicit cost-per-quality-adjusted-life-year threshold, with a formal “severity modifier” that raises the bar a society will pay for the most severe conditions—a working machine for ranking meaningfulness.

B. Evaluation coverage as a significance signal

  • Knowing which programs are unexamined reveals where attention is missing.

  • Example: Reformers behind the U.S. evidence-based-policy movement estimate that only a small fraction of public spending is rigorously evaluated, and propose setting aside as little as 1% of program funds for evaluation—evidence that significance is currently unmeasured.

C. The ghost-law problem

  • Laws passed and never revisited are significance failures by default.

  • Example: Scoping reviews of ex-post legislative evaluation find that the societal impact of most laws is rarely measured after passage, leaving “ghost laws” on the books with no one asking whether they still matter.

How AI changes the game: definition

AI turns significance from an implicit, noise-driven reflex into an explicit, continuous triage—scoring a thousand candidate problems by reach and severity in the time a staffer reads one lobbyist memo—while raising the danger that whatever the model fails to count becomes invisible.

In short: prioritisation becomes deliberate, not reactive.

Four principles of how AI changes the game

  1. From loudest to largest — from the problem that trends to the problem that matters.

  2. From episodic to continuous — from crisis-driven attention to standing triage.

  3. From implicit to explicit — from gut ranking to a defensible, published score.

  4. From narrow to comprehensive — from the visible slice to the whole problem space.

Advantages and disadvantages

Advantages

  1. Protects the agenda from manufactured urgency.

  2. Surfaces large, quiet problems that never trend.

  3. Makes prioritisation transparent and contestable.

  4. Aligns scarce attention with actual magnitude of harm.

Disadvantages

  1. What the model cannot quantify, it may silently de-prioritise.

  2. Severity scoring embeds contestable value judgments as if neutral.

  3. A triage metric, once public, becomes a target to be gamed.

  4. Quantified significance can crowd out legitimate moral salience that resists numbers.


3) Tractability

Definition

Tractability is the faculty of estimating how hard a problem is to actually move—how large an effect a law can realistically produce, against how much cost, complexity, and resistance it must overcome.

It functions as the legislature’s realism layer: the discipline that separates problems a law can genuinely solve from problems a law can only perform solving, redirecting effort from the immovable to the achievable.

Place in lawmaking: 5 aspects

  1. The effect-size estimator

    • Some interventions move the needle; many do not.

    • Tractability forecasts the realistic magnitude of impact.

  2. The difficulty appraiser

    • Implementation, enforcement, and compliance all cost.

    • Tractability prices the friction of making a law bite.

  3. The resistance map

    • Every law meets opposition proportional to whose value it moves.

    • Tractability anticipates where the law will be fought.

  4. The leverage finder

    • Small, well-placed changes can outperform sweeping ones.

    • Tractability locates the high-leverage intervention point.

  5. The futility filter

    • Some problems are structurally beyond a single statute.

    • Tractability flags where law is the wrong instrument.

Why it works: 12 principles

  1. Expected value — it weighs impact by probability of success, not hope.

  2. Cost realism — it counts implementation and enforcement, not just intent.

  3. Resistance modelling — it forecasts opposition and capture.

  4. Leverage focus — it seeks the minimal change with maximal effect.

  5. Mechanism clarity — it demands a causal story for why a law would work.

  6. Boundary honesty — it admits where law cannot reach.

  7. Comparability — it ranks interventions by achievability, not ambition.

  8. Path dependence — it accounts for what current structures actually permit.

  9. Time horizon — it distinguishes quick wins from slow burns.

  10. Reversibility of the fix — it favours interventions that can be undone if wrong.

  11. Enforcement realism — it weighs whether a rule can actually be policed.

  12. Coalition feasibility — it estimates whether the votes and allies exist to pass it.

Three major patterns of how it works

  1. Model → estimate → discount

    • Model the causal mechanism

    • Estimate the raw effect size

    • Discount by implementation difficulty and resistance

  2. Decompose → locate → target

    • Decompose a problem into movable and immovable parts

    • Locate the high-leverage component

    • Target the intervention there

  3. Forecast → stress → revise

    • Forecast the expected outcome

    • Stress it against opposition and evasion

    • Revise the ambition to match what can bite

Key mechanisms and how they work

A. Effect-size evidence from trials

  • A growing body of randomised trials gives realistic priors on how much an intervention moves.

  • Example: The development-economics network J-PAL has run nearly a thousand randomised controlled trials across more than eighty countries, producing concrete effect sizes that tell a drafter whether a given lever historically moved the outcome at all.

B. Calibrated, low-cost interventions

  • Cheap, well-targeted changes can have outsized, measurable effects.

  • Example: The UK’s behavioural-insights work found that a single rewritten tax-reminder letter—telling recipients most neighbours had already paid—was estimated to raise tens of millions a year, a high-tractability win invisible to grand legislation.

C. The futility signal from backfires

  • History records interventions whose tractability was misjudged and which moved the problem the wrong way.

  • Example: Research on “three-strikes” sentencing found it flattened the penalty gradient so severely that eligible offenders became measurably more likely to commit violent crimes—an immovable problem made worse by a law that looked decisive.

How AI changes the game: definition

AI turns tractability from a gut feel into a modelled estimate—pulling real effect sizes from the global trial record and weighing them against implementation friction—while risking false precision that dresses guesswork as forecast.

In short: ambition gets calibrated to what can actually move.

Four principles of how AI changes the game

  1. From hope to expected value — from “this should work” to “this historically moved X.”

  2. From intent to friction — from the goal to the real cost of enforcing it.

  3. From sweeping to leveraged — from grand gestures to minimal high-impact changes.

  4. From certainty to calibrated doubt — from false confidence to honest probability.

Advantages and disadvantages

Advantages

  1. Redirects effort from immovable problems to achievable ones.

  2. Grounds ambition in real historical effect sizes.

  3. Exposes the implementation friction politicians routinely ignore.

  4. Surfaces cheap, high-leverage interventions that never make headlines.

Disadvantages

  1. Effect sizes from one context transfer imperfectly to another.

  2. Quantified tractability can bias toward the easily measured and against the structurally important.

  3. A low-tractability score can become an excuse for inaction on hard, vital problems.

  4. Modelled forecasts carry false precision that invites over-trust.


4) Diffusion

Definition

Diffusion is the faculty of learning from every jurisdiction that already faced a problem—mining the fifty states and the hundred-ninety countries for the policies that spread, the ones that worked, and the ones that backfired.