June 5, 2026

Passed every test. Still took down prod.

aitypescriptsoftware-engineeringstate-managementfeature-flagstypesformal-methodspostgresformal-verificationrails

The combination no test ever ran

On August 1, 2012, Knight Capital pushed a deploy to eight servers. One of them didn't take the new code. That single mismatch — eight servers, new code or old, 256 possible configurations and exactly one of them ever tested — woke a decade-dead code path in production and lost the firm $440 million in 45 minutes.

256 possible configurations. They'd tested exactly one. $440 million, gone in 45 minutes.

No single line of that code was wrong. The bug was a state nobody designed, a combination no test ever ran — and you've shipped the small version of it. The spinner keeps spinning over content that's already on screen. The error toast slides in over a page that loaded fine. The empty state flashes for a frame before the data it already had paints.

Three fields behind it — isLoading, isError, and data that's either there or not. Eight combinations between them; four mean something — idle, loading, loaded, failed. The other four are the bug: the spinner still turning while the data sits right there, loading and errored at once, the render that's somehow none of them. Your code has to survive all eight, because the type says all eight exist.

That shape has a name: it's useQuery's old return type — and the footgun was common enough that React Query itself replaced those booleans with a single status field (pending | error | success).

State explosion

useQuery · 3 fetch booleans

loaderrdata

idle

loaderrdata

failed

loaderrdata

loading + errored

loaderrdata

loaded

loaderrdata

spinner over data

loaderrdata

errored, but data?

loaderrdata

all three at once

8 representable4 valid4 nonsense

Those three fields are tangled — most of the eight states are nonsense you can delete outright. Feature flags are the opposite: independent by design, so every combination is a real timeline your code runs in production. Open your dashboard, count the flags older than six months you "meant to clean up." I'll wait.

Now multiply them. Each flag forks the timeline — on or off. One flag, two timelines. Two, four. Three, eight. It doubles every single time.

Find your team's number — 20flags1,048,576timelines — and every one of those is a path your code can take in production.

Nobody tests a million timelines. QA signs off on the dozen you flip on purpose; the suite goes green. The rest just exist — in production, never enumerated by anyone, waiting for the right user to load the right page in the right order. That's Knight Capital's 256 configurations, one layer up — the same trap, now with your feature flags instead of eight servers.

And it rarely looks like a bug in the code — no crash, no stack trace, no failing test. It passed everything you wrote; it broke the one combination you didn't. It looks like "customer X is seeing the wrong price." You chase it as a race condition until someone pulls the flag history and sees this exact three-flag combo has never happened in the system's life, never will again after Friday's release, and is breaking precisely one invoice right now. Nobody designed that state. Every flag did what it said on the toggle. The combination was a timeline no human ever imagined.

Nobody told me this in school. That bug you couldn't reproduce usually isn't a logic error — it's a state you forgot existed, a branch of the timeline nobody meant to grow.

And it isn't just flags. One invisible thing explodes the state space in a way nothing else does — and you've spent more of your career than you'd like debugging what it causes: the NullPointerException, the undefined is not a function, the one check you forgot on the one path that mattered.

It's null. Its own inventor, Tony Hoare, calls it his billion-dollar mistake — he slipped it into a type system in 1965 "simply because it was so easy to implement," and spent the next fifty years watching it cause "innumerable errors, vulnerabilities, and system crashes."

Here's why it's the worst offender. A boolean you add on purpose, and you can see it coming. null the language adds for you — quietly, to almost every field at once, turning each T into T | null. It's a flag stapled to every value you have: it doubles that field's states, and multiplies across all of them. A form with 15 optional fields is 32,768 presence-and-absence states before you've written a line of logic. (Your largest form — you know the one.) A user model with isAdmin, isVerified, isBanned, isDeleted has 16 states, about four of which make sense — and at least one database row somewhere has all four set to true. Out-of-order webhooks, a retry firing through a half-finished write, a message that arrives twice or never — same shape every time. The set of states your code can be in dwarfs the set it actually handles.

And you can't test your way out — the math is against you. Knight was the cheap version; a 2015 study of production failures in major distributed systems found that about a quarter trace to configuration combinations nobody tested — the same trap at the infrastructure layer. The state space expands by default; your ability to verify it grows linearly at best. The gap widens every time someone ships a feature.

TL;DR — the two cheapest cuts. Most prod bugs aren't bad lines; they're states nobody enumerated. You can't test your way out — the combinations explode — but you can shrink the state space so the bad states can't exist:

Discriminated union — a type that's always exactly one of a few shapes, never a mix; impossible states won't even compile.

Database constraint — enforce what the type can't, where nothing can bypass it.

Start today: take one boolean-soup model and collapse it into a union. For covering arrays, model checking, and why the spec is now cheap to write — Part 2.

Now the AI is branching it too

An LLM writes a growing share of your code now, faster than review can keep up. Generation scales; careful reading doesn't. The branches multiply faster than anyone can check them by hand, so I've stopped betting on the check. You can make the check cheaper — tighten the feedback loop so mistakes surface fast, turn your conventions into lint rules the CI enforces — and you should. But the cheapest branch to review is the one that can't exist. So prune the branches that should never exist — make them impossible to write down. A branch the AI can't spell is a bug it can't ship.

Pruning means the bad branch can't be spelled

Almost everything a type allows is invalid. The legal states are a small island; what the type actually permits is an ocean. It took me embarrassingly long to see my own defensive code for what it was — the guard clauses, the "this shouldn't happen" comments, the same validation in five different places. All of it babysitting combinations that should never have been representable in the first place.

The fix is to shrink the ocean until it matches the island. Two essays taught me the moves, from different angles:

Make illegal states unrepresentable is Wlaschin's: design the shape so bad states have no spelling. If a state can't be constructed, it can't be passed in, returned, or stored. Parse, don't validate is King's: check incoming data once at the boundary, hand back a type with the invariant baked in, and the compiler carries it from there.

Both cut at the door. The toolkit runs deeper from there.

A huge circle packed with bare dead twigs wraps a small, clearly bounded circle of living leaves at its center; the blob prunes the dead twigs with scissors beside a tidy pile of cut deadwood — almost everything you can spell is invalid, only the small inner set is valid.

Kill the booleans

The classic example. Nobody designs this. Three boolean lifecycle fields accrete one feature at a time: isDraft this quarter, isArchived two quarters later, each added by someone who never looked at the other two (and who has since moved to another team):

interface Order {
  isDraft: boolean
  isPublished: boolean
  isArchived: boolean
}

Three booleans. Eight possible states, and ✗ nonsense. The other five — published and draft at once, archived and draft and published — are nonsense your code still has to handle, because the type says they exist.

You don't have to swear off booleans. The test I use is Matt Pocock's: bad booleans store state; good booleans are derived from it. A stored isPublished you set by hand is the disease; const isPublished = status === "published" is fine: it's computed, so it can't contradict anything.

The fix is a discriminated union — a type that says "this value is exactly one of these shapes, never a mix." Every major language has one; what changes is how hard the compiler makes it to get wrong. (No Sorbet in your Rails project? The DB constraint section is the version that enforces the same rule on any stack.)

// TypeScript
type Order =
  | { status: "draft"; content: string }
  | { status: "published"; content: string; publishedAt: Date }
  | { status: "archived"; content: string; archivedAt: Date }

Eight states → three, and the nulls went with them. The boolean version needed a nullable publishedAt: a real date on published orders, null on drafts, and nothing stopping a draft from carrying a stray timestamp or a published order from carrying none. The union deletes the field from every variant that shouldn't have it, so publishedAt exists only on a Published order. No nullable column to forget, no illegal combination, no billion-dollar mistake left to check for. And "published and draft at once" never had a shape to live in. (No union handy? The one-level-down version is a language that makes null opt-in — Rust's Option, Kotlin's ?, TypeScript's strictNullChecks — so absence becomes a case the compiler forces you to name.)

TypeScript, Python, and Rust make this a compile-time hard stop. Go and C# enforce the structure but only refuse an illegal value at runtime; there's no exhaustiveness proof at compile time. That difference matters when the union grows:

It's the same cut for the spinner from the top: model the fetch as one of idle | loading | error | data and "loading and errored at once" has no shape to live in — not caught at runtime, just unbuildable.

And when you switch on this, the compiler makes you handle every case:

// TypeScript — miss a case and it won't compile
function render(order: Order) {
  switch (order.status) {
    case "draft":     return renderDraft(order.content)
    case "published": return renderPublished(order.content, order.publishedAt)
    case "archived":  return renderArchived(order.content, order.archivedAt)
  }
}

Add a fourth status and this stops compiling until you handle it. Rust enforces the same exhaustiveness as a hard error; Python's assert_never and Sorbet's T.absurd catch it at type-check time; Go and C# only warn or throw at runtime (neither proves a class hierarchy exhaustive).

And the same idea works at the database level. SQL is weaker than a type system, but it's enforced on every write, by every client, forever:

▶The same rule as a Postgres table

-- PostgreSQL — the database refuses to store illegal states
CREATE TABLE orders (
  id           UUID PRIMARY KEY,
  status       TEXT NOT NULL CHECK (status IN ('draft', 'published', 'archived')),
  content      TEXT NOT NULL,
  published_at TIMESTAMPTZ,
  archived_at  TIMESTAMPTZ,
 
  -- Same invariant as the discriminated union, at the persistence layer.
  -- Archived drops `published_at` because archive is reachable from draft
  -- (never published) — so the union can't carry it on `archived` either; the
  -- two layers agree. If archive only ever followed publish, keep it on both.
  CHECK (
    (status = 'draft'     AND published_at IS NULL     AND archived_at IS NULL) OR
    (status = 'published' AND published_at IS NOT NULL AND archived_at IS NULL) OR
    (status = 'archived'  AND published_at IS NULL     AND archived_at IS NOT NULL)
  )
);

Your TypeScript types stop at the network boundary. Your database doesn't, and neither does CHECK. Even if some service in another language forgets the rule, the constraint catches it. We'll come back to this.

Add a fourth status? The compiler points at every place that needs updating, and the code won't build until you've handled them all.

A union collapses one field's states. But some rules span two independent fields — a voided invoice whose payment_status is somehow still succeeded — and no single type can spell that combination away, because each field is legal on its own. That needs a cut that sees both fields at once.

The cut no app can bypass

Every cut so far lives in your code, which means it stops the moment data crosses a boundary you don't own: the JSON the API just received, the row another service wrote, the message off Kafka. Your Order type guarantees nothing about any of them.

There's a word for what all these cuts enforce: an invariant, the one sentence your data must never contradict. "A deleted row never comes back." "A refunded order can't reopen." Bugs are violated invariants. The space the fence needs to protect can be astronomical; the fence itself stays small — a 2021 proof of Paxos searched billions of candidate invariants and found the rule that pins it fits in a handful of terms. The only question is where you enforce it, from weakest to strongest:

Four barriers of rising strength — a paper note, a low fence, a taller fence, a brick wall — a dead twig blows past the first three and shatters on the wall.

▶The full spectrum, as a table

Mechanism	When checked	What happens on violation
Comments / docs	Never	Nothing
Runtime assert	At call site, sometimes	Crash, hopefully in dev
Tests	At CI time	Build fails (for cases you wrote)
Linters	At lint time	PR fails (for patterns you encoded)
TypeScript types	At compile time, erased at runtime	Build fails, but can be bypassed with `as any` and stops at the network boundary
Strong types (Rust, Haskell)	At compile time, harder to bypass	Build fails earlier and `unsafe` is opt-in rather than the escape hatch
Schema validation (Zod, Pydantic) — a library that checks incoming data against a declared shape	At system boundary, at runtime	Reject input; on success, the type carries the invariant forward
Database constraints	At every write, by every client, forever	INSERT/UPDATE rejected — the only layer no application can bypass

Most teams underuse the database here. I did too, for years, and it's probably the strongest invariant layer you have.

At a brick doorway labelled "DB", the blob snips the bare dead twigs off a branch with scissors so they drop into a pile outside, then sends the clean leafy branch through the door — the database only takes a state once the illegal parts are cut off, and no app can bypass the cut.

Each constraint deletes a whole class of bad state. And unlike your types, they hold on every write, from every client, forever, no matter which service forgot:

NOT NULL eliminates a state.
UNIQUE eliminates a class of duplicate states.
FOREIGN KEY eliminates orphan-reference states.
CHECK (status IN ('active', 'done', 'archived')) collapses an unbounded text field to three legal values.

▶Why CHECK and not a native ENUM?

Evolving the set stays an ordinary migration. A Postgres ENUM only lets you ADD VALUE; dropping or reordering one means recreating the whole type and every column built on it. text + CHECK is the trade most teams at scale make: GitLab's schema alone carries 35 enum-whitelist checks doing exactly an ENUM's job.

And the teams at serious scale already live this way:

GitLab's schema declares 2,419 CHECK constraints — 82 of them exclusive-arc checks ("exactly one of these columns is set," the database half of a discriminated union).

It's institutionalized, too: a first-class migration helper, add_multi_column_not_null_constraint, makes pushing "exactly one of these is set" into the schema a one-liner any engineer reaches for. That's what a codebase looks like once it takes the database seriously as an enforcement layer.

The part that got under my skin once I started counting: your model already has a spec for which combinations are legal. It's just split across two layers that don't talk to each other.

One side is the pile of conditional validations every non-trivial model accumulates (validates … if:, validate, conditional callbacks). GitLab's app/models carries 173 of them, Mastodon 72. That's literally a decision table — the predicates are the parameters, the rules fire on combinations nobody enumerates. But those validations drift: they're code that has to be remembered, and update_column / upsert_all / insert_all skip them entirely.

The other side is the CHECKs and partial indexes above, which can't drift, because the constraint is the enforcement. So the legal state space ends up half-declared in a layer that rots and half-declared in a layer that can't. The bugs live in the gap between them.

▶Two more constraints worth knowing: partial unique indexes and EXCLUDE

A partial unique index is the idiomatic way to enforce state-machine cardinality at the persistence layer:

CREATE UNIQUE INDEX one_active_subscription_per_customer
  ON subscriptions (customer_id)
  WHERE status = 'active';

After this, the database itself refuses to ever store two simultaneously active subscriptions for the same customer. No race condition, no application bug, no service in another language can produce that state. This isn't a clever trick either: GitLab's schema carries 163 of these state-gated partial unique indexes (WHERE status = ...), enforcing "at most one row in this state" across the codebase.

An EXCLUDE constraint generalizes the same idea to ranges and overlaps. Imagine room bookings:

-- the `=` operator class needs the btree_gist extension:
CREATE EXTENSION IF NOT EXISTS btree_gist;
 
ALTER TABLE bookings ADD CONSTRAINT no_overlap
  EXCLUDE USING gist (
    room_id WITH =,
    during  WITH &&
  );

Every booking system reimplements "no double-booked rooms" in application code, badly, with race conditions the database just refused to permit.

And if you want to know what a missing constraint costs, the cleanest case is Robinhood, 2020–21. The app showed customers false negative balances and fired 84,100 erroneous margin calls off them, drawing FINRA's record $70 million penalty for the "significant harm" it caused. Knight was the combinatorial kind — 256 configs, one tested. Robinhood is the missing-invariant kind: "never show a user a balance the ledger doesn't support" is a rule, and no layer of the stack enforced it.

That's the cut a type can't make: it held even when every layer of application code — the validation, the guard, the recompute — was wrong. Schema validation at the boundary helps; a database constraint helps more, because it doesn't trust the application to be right.

So I've stopped trying to pick the one right mechanism. Stack them. Each layer in that table cuts the state space a little; together they pin the system down to something close to "only valid states are reachable."

The cut at the door

A blob standing in a doorway like a gatekeeper: a chaotic pile of tangled dead branches outside, a tidy living leafy tree inside — parse at the door, trust the type within.

It's what Alexis King named parse, don't validate: check once at the boundary and return a type with the invariant built in, not a boolean everything downstream has to re-check:

▶Parsing vs. validating, in code

// Validation: check, hope, repeat everywhere
function isValidOrder(input: unknown): boolean {
  /* ... */
}
function processOrder(input: unknown) {
  if (!isValidOrder(input)) throw new Error("bad")
  // input is still `unknown`. You'll check again. And again.
}
 
// Parsing: check once, type carries it forever
const result = OrderSchema.safeParse(rawJson)
if (result.success) {
  const order: Order = result.data // ← invariant now lives in the type
  processOrder(order) // No checks downstream. Compiler enforces.
}

Zod, Pydantic, io-ts, Valibot — same trick: a runtime invariant goes in, a type-level invariant comes out. Pay the cost once, at the door; the compiler enforces it for the rest of the program's life.

Start cutting

Two cuts. Make one today.

Today. Find one model with three or more status booleans, or a status string with no CHECK behind it. Replace the booleans with a discriminated union, or add the constraint. One bad state that can no longer be spelled. It's the cheapest cut there is, and you'll feel it.

This week. Take the cross-field invariant your app assumes but never enforces — "a voided invoice is never paid," "an active subscription has a customer" — and push it into the database: a CHECK, a partial unique index, an EXCLUDE. The cut no service can route around.

These two cuts delete most of the bad states outright. For the combinations that survive — the space too large to type away, the sequence nobody can enumerate, the 3am bug that passed every test — that's the verification layer: Part 2 — covering arrays, model checking, and the bug that survives every test.

Prune the timeline. One cut at a time.

References

▶Sources & further reading

Alexis King, Parse, Don't Validate (2019)
Yaron Minsky, Effective ML (Jane Street, Effective ML talks at CUFP mid-2000s and the "Effective ML Revisited" blog post) — the OCaml-community version of "make illegal states unrepresentable," predating the F# write-up by a decade
Scott Wlaschin, Making Illegal States Unrepresentable (F# for Fun and Profit) — the popular F#/TypeScript-era restatement
Chris Krycho, Making Illegal States Unrepresentable in TypeScript
David Harel, Statecharts: A Visual Formalism for Complex Systems (Science of Computer Programming, 1987) — the source XState's hierarchical and parallel states descend from
Tony Hoare, Null References: The Billion Dollar Mistake (QCon London 2009) — the inventor of the null reference on why every nullable field is a state you didn't mean to add
Doug Seven, Knight Capital — A DevOps Cautionary Tale
SEC, In the Matter of Knight Capital Americas LLC (Order 34-70694, 2013) — primary source for the $440M trading loss and the $12M penalty
CNBC, Robinhood to pay $70 million for outages and misleading customers (2021) — the FINRA order behind the missing-invariant example
Tianyin Xu et al., Hey, You Have Given Me Too Many Knobs! (FSE 2015) — misconfiguration as a leading cause of production outages
Ben Moseley, Peter Marks, Out of the Tar Pit (2006) — the canonical argument that state is the primary source of complexity; "for every single bit of state that we add, we double the total number of possible states"

5 июня 2026 г.

Прошёл все тесты. И всё равно уронил прод.

aitypescriptsoftware-engineeringstate-managementfeature-flagstypesformal-methodspostgresformal-verificationrails

Комбинация, которую не гонял ни один тест

1 августа 2012 года Knight Capital раскатила деплой на восемь серверов. Один из них новый код так и не подхватил. Одно это несовпадение — восемь серверов, новый код или старый, 256 возможных конфигураций, и ровно одна из них хоть когда-нибудь проверенная — разбудило в проде ветку кода, десять лет как мёртвую, и стоило фирме 440 миллионов долларов за 45 минут.

256 возможных конфигураций. Проверили ровно одну. 440 миллионов долларов — за 45 минут.

Сама по себе ни одна строчка того кода не была кривой. Баг был состоянием, которого никто не закладывал, комбинацией, которую не гонял ни один тест, — а версию поменьше ты уже выкатывал. Спиннер всё крутится поверх контента, который и так уже на экране. Тост с ошибкой выезжает над страницей, которая прекрасно загрузилась. Пустое состояние мелькает на кадр, прежде чем отрисуются данные, которые уже были на руках.

За всем этим — три поля: isLoading, isError и data, которое либо есть, либо нет. Между ними восемь комбинаций; четыре что-то значат — idle, loading, loaded, failed. Остальные четыре — баг: спиннер всё крутится, хотя данные уже вот они; загрузка и ошибка разом; рендер, который непонятно вообще в каком состоянии. Код обязан пережить все восемь — потому что тип утверждает, что все восемь существуют.

У этой формы есть имя: это старый возвращаемый тип useQuery. На эти грабли наступали так часто, что React Query взял и заменил булевы на единственное поле status (pending | error | success).

Взрыв состояний

useQuery · 3 булевых поля

loaderrdata

idle

loaderrdata

failed

loaderrdata

загрузка + ошибка

loaderrdata

loaded

loaderrdata

спиннер поверх данных

loaderrdata

ошибка, но данные?

loaderrdata

все три разом

8 представимых4 валидных4 бессмыслица

Эти три поля сцеплены друг с другом, и большинство из восьми состояний — чистая бессмыслица, которую можно выкинуть подчистую. С фича-флагами всё наоборот: они независимы по задумке, поэтому каждая комбинация — это настоящий таймлайн, в котором твой код крутится в проде. Открой дашборд и посчитай флаги старше полугода, которые ты «всё собирался прибрать». Подожду.

А теперь перемножь их. Каждый флаг ветвит таймлайн — включён или выключен. Один флаг — два таймлайна. Два — четыре. Три — восемь. И так удваивается каждый раз.

Найди число своей команды — 20флагов1 048 576таймлайнов — и каждый из них путь, по которому твой код может пойти в проде.

Никто не тестирует миллион таймлайнов. QA подписывает ту дюжину, что ты включаешь осознанно, — тесты горят зелёным. Остальные просто есть: в проде, никем не перечисленные, ждут нужного пользователя, который откроет нужную страницу в нужном порядке. Это те самые 256 конфигураций Knight Capital, только слоем выше, — та же ловушка, теперь с твоими фича-флагами вместо восьми серверов.

И на баг в коде это редко похоже — ни краша, ни стектрейса, ни красного теста. Прошло всё, что ты написал; сломалась ровно та комбинация, которую ты не написал. Выглядит это как «клиент X видит не ту цену». Ты гоняешься за ним как за состоянием гонки, пока кто-нибудь не поднимет историю флагов и не увидит: вот эта конкретная тройка флагов не сходилась за всю жизнь системы, после пятничного релиза не сойдётся больше никогда — и прямо сейчас ломает ровно один инвойс. Этого состояния никто не закладывал. Каждый флаг делал ровно то, что обещал на тумблере. А вот комбинация оказалась таймлайном, которого не представлял себе ни один человек.

В универе мне про это не рассказывали. Тот баг, который ты не смог воспроизвести, — это обычно не ошибка в логике, а состояние, о котором ты забыл, ветка таймлайна, которую никто не собирался отращивать.

И дело не только во флагах. Есть одна невидимая штука, которая раздувает пространство состояний как ничто другое, — и ты потратил на её отладку больше времени в карьере, чем хотел бы: тот самый NullPointerException, то самое undefined is not a function, та единственная проверка, которую ты забыл на единственном пути, где она была важна.

Это null. Его же изобретатель, Тони Хоар, называет его своей ошибкой на миллиард долларов: он всунул его в систему типов в 1965-м «просто потому, что это было легко реализовать» — и следующие полвека смотрел, как тот порождает «бесчисленные ошибки, уязвимости и падения систем».

Вот почему он худший из всех. Булев флаг ты добавляешь осознанно и видишь его заранее. А null язык добавляет за тебя — тихо, почти к каждому полю сразу, превращая любой T в T | null. Это флаг, пришпиленный к каждому твоему значению: он удваивает состояния этого поля и умножается на все остальные. Форма с 15 необязательными полями — это 32 768 комбинаций «заполнено/пусто». (Твоя самая большая форма — ты её знаешь.) Модель пользователя с isAdmin, isVerified, isBanned, isDeleted — это 16 состояний, из которых осмысленны от силы четыре — и как минимум в одной базе где-то все четыре выставлены в true. Вебхуки, пришедшие не по порядку; ретрай поверх недописанной записи; сообщение, которое приходит дважды или не приходит вовсе, — и каждый раз форма одна и та же. Множество состояний, в которых код может оказаться, в разы больше того, которое он на самом деле обрабатывает.

И тестами отсюда не выберешься — математика против тебя. Knight — это ещё дешёвый случай; исследование 2015 года по продакшен-авариям в крупных распределённых системах показало, что примерно четверть уходит корнями в комбинации конфигурационных параметров, которые никто не тестировал, — та же ловушка, только на уровне инфраструктуры. Пространство состояний разрастается само собой; твоя способность его проверять растёт в лучшем случае линейно. И разрыв растёт каждый раз, когда кто-то выкатывает фичу.

TL;DR — две самые дешёвые подрезки. Большинство прод-багов — не кривые строчки, а состояния, которых никто не перечислил. Тестами отсюда не выберешься — комбинации взрываются, — но можно ужать пространство состояний так, чтобы плохих состояний просто не было:

Размеченное объединение — тип, который всегда ровно одна из нескольких форм, а не их смесь: невозможные состояния даже не скомпилируются.

Ограничение в базе — обеспечь то, что не может тип, там, где это не обойти.

Начни сегодня: возьми одну модель из булевой каши и схлопни её в объединение. Про покрывающие массивы, проверку моделей и почему спеку теперь дёшево писать — Часть 2.

Теперь ветки плодит ещё и ИИ

Всё большую долю твоего кода теперь пишет LLM, и пишет быстрее, чем поспевает ревью. Генерация масштабируется, вдумчивое чтение — нет. Ветки плодятся быстрее, чем кто-нибудь успеет проверить их руками, так что я перестал ставить на проверку. Проверку можно удешевить — затянуть петлю обратной связи, чтобы ошибки всплывали быстро, превратить свои договорённости в линт-правила, которые гоняет CI, — и это стоит сделать. Но самая дешёвая ветка для ревью — та, которой не может быть. Так что подрежь те ветки, которых вообще не должно быть, — сделай так, чтобы их нельзя было даже записать. Ветка, которую ИИ не может записать, — это баг, который он не может зашипить.

Подрезать — значит сделать плохую ветку невыразимой

Почти всё, что допускает тип, — недопустимо. Легальные состояния — маленький островок; то, что тип на самом деле разрешает, — целый океан. Мне до неловкого долго не доходило, чем на самом деле был мой собственный защитный код: гарды, комментарии «такого быть не должно», одна и та же валидация в пяти разных местах. Всё это нянчило комбинации, которые вообще не должны были быть представимы.

Лекарство — ужать океан, пока он не сойдётся с островком. Приёмам меня научили два эссе, каждое со своей стороны:

Сделай недопустимые состояния непредставимыми — Влашин: спроектируй саму форму так, чтобы у плохих состояний просто не было написания. Если состояние нельзя собрать, его нельзя ни передать, ни вернуть, ни сохранить. Парси, а не валидируй — Кинг: проверь входящие данные один раз, на границе, и верни тип со встроенным инвариантом, а дальше его несёт компилятор.

Оба подрезают на входе. А оттуда набор инструментов уходит глубже.

Огромный круг, плотно забитый голыми сухими ветками, окружает маленький чётко очерченный круг с живой листвой в центре; блоб подрезает сухие ветки ножницами, рядом аккуратная горка срезанного сушняка: почти всё, что можно записать, недопустимо, и валидна лишь малая внутренняя часть.

Убей булевы флаги

Хрестоматийный пример. Никто не проектирует такое нарочно. Три булевых поля жизненного цикла нарастают по одному за раз: isDraft в этом квартале, isArchived через два, и каждое добавляет тот, кто не глядел на два других (и кто с тех пор перешёл в другую команду):

interface Order {
  isDraft: boolean
  isPublished: boolean
  isArchived: boolean
}

Три булевых флага. Восемь возможных состояний, и ✗ бессмыслица. Остальные пять — опубликован и черновик разом, архив и черновик и опубликован — бессмыслица, с которой код всё равно обязан возиться, потому что тип утверждает, что они существуют.

Отказываться от булевых совсем не обязательно. Тест, которым пользуюсь я, — Мэтта Покока: плохие булевы хранят состояние; хорошие — выводятся из него. Хранимый isPublished, который ты проставляешь руками, — это болезнь; а const isPublished = status === "published" — нормально: он вычисляемый, а значит ничему противоречить не может.

Лекарство — размеченное объединение: тип, который говорит «это значение — ровно одна из этих форм и никогда не смесь». Оно есть в каждом крупном языке; разнится лишь то, насколько трудно компилятор даёт в нём ошибиться. (Нет Sorbet в твоём Rails-проекте? Раздел про ограничения базы ниже — это та же подрезка, которая работает на любом стеке.)

// TypeScript
type Order =
  | { status: "draft"; content: string }
  | { status: "published"; content: string; publishedAt: Date }
  | { status: "archived"; content: string; archivedAt: Date }

Восемь состояний → три, и null ушли вместе с ними. В булевой версии publishedAt приходилось делать nullable: настоящая дата у опубликованных заказов, null у черновиков — и ничто не мешало черновику таскать шальной таймстамп, а опубликованному заказу остаться вовсе без него. Объединение выкидывает поле из каждой формы, которой оно не положено, так что publishedAt живёт только у Published. Ни nullable-колонки, про которую забудешь, ни недопустимой комбинации, ни ошибки на миллиард, которую надо проверять. А «опубликован-и-черновик разом» никогда и не имел формы, в которой мог бы жить. (Нет под рукой объединения? Версия на уровень ниже — язык, где null включается явно: Option в Rust, ? в Kotlin, strictNullChecks в TypeScript, — и отсутствие значения становится случаем, который компилятор заставляет назвать.)

TypeScript, Python и Rust делают это жёстким стопом на этапе компиляции. Go и C# обеспечивают структуру, но недопустимое значение отвергают лишь в рантайме; доказательства исчерпываемости на компиляции нет. И эта разница важна, когда объединение растёт:

Та же подрезка — и для того спиннера из начала: опиши загрузку как одно из idle | loading | error | data — и состоянию «грузится и упало разом» просто негде жить: не ловится в рантайме, а вообще не собирается.

А когда ты делаешь по нему switch, компилятор заставляет разобрать каждый случай:

// TypeScript — пропусти случай, и оно не скомпилируется
function render(order: Order) {
  switch (order.status) {
    case "draft":     return renderDraft(order.content)
    case "published": return renderPublished(order.content, order.publishedAt)
    case "archived":  return renderArchived(order.content, order.archivedAt)
  }
}

Добавь четвёртый статус — и оно перестанет компилироваться, пока ты его не разберёшь. Rust обеспечивает ту же исчерпываемость жёсткой ошибкой компиляции; assert_never в Python и T.absurd в Sorbet ловят это на этапе проверки типов; Go и C# лишь предупреждают или кидают исключение в рантайме (исчерпываемость иерархии классов не доказывает ни один из них).

И та же идея работает на уровне базы. SQL слабее системы типов, зато применяется на каждой записи, каждым клиентом и навсегда:

▶То же правило как таблица Postgres

-- PostgreSQL — база отказывается хранить недопустимые состояния
CREATE TABLE orders (
  id           UUID PRIMARY KEY,
  status       TEXT NOT NULL CHECK (status IN ('draft', 'published', 'archived')),
  content      TEXT NOT NULL,
  published_at TIMESTAMPTZ,
  archived_at  TIMESTAMPTZ,
 
  -- Тот же инвариант, что и в размеченном объединении, но на уровне хранения.
  -- archived отбрасывает `published_at`, потому что архивировать можно из draft
  -- (который не публиковался) — поэтому и объединение не несёт его на `archived`;
  -- оба слоя согласованы. Если бы архив всегда шёл после публикации — держи на обоих.
  CHECK (
    (status = 'draft'     AND published_at IS NULL     AND archived_at IS NULL) OR
    (status = 'published' AND published_at IS NOT NULL AND archived_at IS NULL) OR
    (status = 'archived'  AND published_at IS NULL     AND archived_at IS NOT NULL)
  )
);

Твои TypeScript-типы кончаются на сетевой границе. А база — нет, и CHECK тоже. Даже если какой-нибудь сервис на другом языке забудет правило, ограничение его поймает. К этому мы ещё вернёмся.

Добавляешь четвёртый статус? Компилятор ткнёт в каждое место, которое надо поправить, — и код не соберётся, пока не разберёшь их все.

Объединение схлопывает состояния одного поля. Но некоторые правила связывают два независимых поля — инвойс со status: voided, у которого payment_status почему-то всё ещё succeeded, — и ни один отдельный тип не сделает такую комбинацию непредставимой, потому что каждое поле по отдельности легально. Тут нужна подрезка, которая видит оба поля сразу.

Подрезка, которую не обойдёт ни одно приложение

Все подрезки до сих пор живут в твоём коде, а значит, кончаются ровно в тот миг, когда данные пересекают границу, которой ты не владеешь: JSON, который только что принял API, строка, записанная другим сервисом, сообщение из Kafka. Про них твой тип Order не гарантирует ничего.

Тому, что обеспечивают все эти подрезки, есть название — инвариант: та единственная фраза, которой твои данные не должны противоречить. «Удалённая строка не возвращается». «Возвращённый заказ не открывается заново». Баги — это нарушенные инварианты. Пространство, которое забору нужно оборонять, может быть астрономическим, а сам забор при этом остаётся крошечным: доказательство безопасности Paxos 2021 года перелопатило миллиарды кандидатов в инварианты и нашло, что правило, которое всё пришпиливает, умещается в горстку термов. Вопрос только в том, где ты его обеспечиваешь, от самого слабого к самому сильному:

Четыре барьера нарастающей силы — бумажка, низкий забор, забор повыше, кирпичная стена — сухая ветка пролетает первые три и разбивается о стену.

▶Весь спектр — таблицей

Механизм	Когда проверяется	Что при нарушении
Комментарии / доки	Никогда	Ничего
Runtime-assert	В точке вызова, иногда	Падение, в идеале в деве
Тесты	На CI	Сборка падает (для кейсов, что ты написал)
Линтеры	На линте	PR падает (для паттернов, что ты закодировал)
Типы TypeScript	На компиляции, стираются в рантайме	Сборка падает, но обходится через `as any` и заканчивается на сетевой границе
Сильные типы (Rust, Haskell)	На компиляции, обойти труднее	Сборка падает раньше, а `unsafe` — это opt-in, а не запасной выход
Валидация схемы (Zod, Pydantic) — библиотека, что проверяет входящие данные по объявленной форме	На границе системы, в рантайме	Отклонить вход; при успехе тип несёт инвариант дальше
Ограничения базы данных	На каждой записи, каждым клиентом, навсегда	INSERT/UPDATE отклонён — единственный слой, который не обойдёт ни одно приложение

Большинство команд недоиспользует базу как слой инвариантов. Я и сам так годами. А это, наверное, самый мощный слой, какой у тебя есть.

Слева блоб ножницами срезает с длинной ветки голые сухие сучья — они падают в кучу на землю, — а её облиственный конец справа уходит в тёмный кирпичный проём с табличкой «DB»: база принимает состояние, только когда недопустимые части срезаны, и эту подрезку не обойдёт ни одно приложение.

Каждое ограничение убирает целый класс плохих состояний. И, в отличие от твоих типов, оно держит на каждой записи, от каждого клиента, навсегда, неважно, какой сервис что забыл:

NOT NULL устраняет состояние.
UNIQUE устраняет класс дубликатных состояний.
FOREIGN KEY устраняет состояния с висячими ссылками.
CHECK (status IN ('active', 'done', 'archived')) схлопывает неограниченное текстовое поле до трёх легальных значений.

▶Почему CHECK, а не нативный ENUM?

Менять набор значений остаётся обычной миграцией. Postgres ENUM умеет только ADD VALUE; а удалить или переставить значение значит пересоздать весь тип и каждую колонку на нём. text + CHECK — компромисс, на который идёт большинство команд на масштабе: одна только схема GitLab несёт 35 enum-whitelist-проверок, делающих ровно ту же работу, что и ENUM.

И команды на серьёзном масштабе уже так живут:

Схема GitLab объявляет 2 419 ограничений CHECK — из них 82 проверки исключающей дизъюнкции («ровно одна из этих колонок задана» — половина размеченного объединения на стороне базы).

И это поставлено на поток: штатный хелпер миграций add_multi_column_not_null_constraint превращает «ровно одна из этих задана» в однострочник, к которому тянется любой инженер. Вот так и выглядит кодовая база, когда она всерьёз держит базу за слой обеспечения инвариантов.

А вот что зацепило меня, когда я начал считать: у твоей модели уже есть спека того, какие комбинации легальны. Просто она размазана по двум слоям, которые между собой не разговаривают.

Один слой — это куча условных валидаций, которые накапливает любая нетривиальная модель (validates … if:, validate, условные коллбэки). В app/models у GitLab их 173, у Mastodon — 72. И это буквально таблица решений: предикаты — это параметры, а правила срабатывают на комбинациях, которые никто не перечисляет. Вот только эти валидации расходятся с реальностью: это код, про который надо помнить, а update_column / upsert_all / insert_all обходят их целиком.

Другой слой — это CHECK и частичные индексы выше, которые разойтись не могут, потому что ограничение и есть обеспечение. Получается, легальное пространство состояний наполовину объявлено в слое, который гниёт, и наполовину — в слое, который гнить не может. А баги живут в зазоре между ними.

▶Ещё два инструмента: частичные уникальные индексы и EXCLUDE

Частичный уникальный индекс — идиоматичный способ обеспечить кардинальность конечного автомата на уровне хранения:

CREATE UNIQUE INDEX one_active_subscription_per_customer
  ON subscriptions (customer_id)
  WHERE status = 'active';

После этого сама база наотрез откажется когда-либо хранить две одновременно активные подписки для одного клиента. Ни гонка, ни прикладной баг, ни сервис на другом языке это состояние уже не создадут. И это тоже не хитрый трюк: схема GitLab несёт 163 таких частичных уникальных индекса с привязкой к состоянию (WHERE status = ...), которые по всей кодовой базе держат «не больше одной строки в этом состоянии».

Ограничение EXCLUDE обобщает ту же идею на диапазоны и пересечения. Представь бронирование переговорок:

-- классу операторов `=` нужно расширение btree_gist:
CREATE EXTENSION IF NOT EXISTS btree_gist;
 
ALTER TABLE bookings ADD CONSTRAINT no_overlap
  EXCLUDE USING gist (
    room_id WITH =,
    during  WITH &&
  );

Каждая система бронирования заново изобретает «никаких дважды забронированных переговорок» в прикладном коде — плохо, с гонками, которые база просто отказалась бы допустить.

А если хочешь понять, во что обходится отсутствующее ограничение, самый чистый случай — Robinhood, 2020–21. Приложение показывало клиентам ложные отрицательные балансы и наставило по ним 84 100 ошибочных маржин-коллов, схлопотав рекордный штраф FINRA в $70 млн за причинённый «значительный вред». Knight был из комбинаторных — 256 конфигураций, проверена одна. Robinhood — из тех, где не хватает инварианта: «никогда не показывай пользователю баланс, который не подтверждён реестром» — это правило, и не обеспечивал его ни один слой стека.

Вот подрезка, которую типом не сделаешь: она держала даже тогда, когда весь прикладной код — валидация, гард, пересчёт — врал. Валидация схемы на границе помогает; ограничение базы помогает сильнее, потому что не полагается на то, что приложение право.

Поэтому я перестал выбирать какой-то один правильный механизм. Складывай их стопкой. Каждый слой в той таблице срезает пространство состояний понемногу; вместе они пришпиливают систему к чему-то близкому к «достижимы только валидные состояния».

Подрезка на входе

Блоб стоит в дверном проёме как привратник: снаружи — хаотичная куча спутанных сухих веток, внутри — аккуратное живое дерево с листвой: парси на входе и дальше доверяй типу.

Это то, что Алексис Кинг назвала «парси, а не валидируй»: проверь один раз на границе и верни тип со встроенным инвариантом, а не булево, которое всё, что ниже по коду, обязано перепроверять заново:

▶Парсинг против валидации, в коде

// Валидация: проверь, понадейся, повтори везде
function isValidOrder(input: unknown): boolean {
  /* ... */
}
function processOrder(input: unknown) {
  if (!isValidOrder(input)) throw new Error("bad")
  // input всё ещё `unknown`. Будешь проверять снова. И снова.
}
 
// Парсинг: проверь один раз, дальше инвариант несёт тип
const result = OrderSchema.safeParse(rawJson)
if (result.success) {
  const order: Order = result.data // ← инвариант теперь живёт в типе
  processOrder(order) // Ниже — ни одной проверки. За это отвечает компилятор.
}

Zod, Pydantic, io-ts, Valibot — всё это один и тот же трюк: на входе рантайм-инвариант, на выходе инвариант уровня типов. Платишь цену один раз, на входе, — а дальше компилятор обеспечивает его до конца жизни программы.

Начни подрезать

Две подрезки. Одну сделай сегодня.

Сегодня. Найди одну модель с тремя или больше статусными булевыми полями или со строкой status без CHECK за ней. Замени булевы поля размеченным объединением или добавь ограничение. Одно плохое состояние, которое больше нельзя записать. Это самая дешёвая подрезка из всех, и ты её почувствуешь.

На этой неделе. Возьми кросс-полевой инвариант, который твоё приложение подразумевает, но нигде не обеспечивает — «отменённый инвойс никогда не оплачен», «у активной подписки есть клиент», — и затолкай его в базу: CHECK, частичный уникальный индекс, EXCLUDE. Подрезка, которую не объедет ни один сервис.

Эти две подрезки разом удаляют большинство плохих состояний. А для тех комбинаций, что уцелели, — пространства, слишком большого, чтобы вытеснить его типами; последовательности, которую никто не перечислит; бага в 3 ночи, что прошёл все тесты, — есть слой верификации: Часть 2 — покрывающие массивы, проверка моделей и баг, что переживает любой тест.

Подрежь таймлайн. По одной подрезке за раз.

Источники

▶Источники и что почитать

Алексис Кинг, Parse, Don't Validate (2019)
Ярон Мински, Effective ML (Jane Street, доклады Effective ML на CUFP середины 2000-х и пост «Effective ML Revisited») — версия «сделай недопустимые состояния непредставимыми» от сообщества OCaml, на десятилетие раньше F#-разбора
Скотт Влашин, Making Illegal States Unrepresentable (F# for Fun and Profit) — популярный пересказ эпохи F#/TypeScript
Крис Крайко, Making Illegal States Unrepresentable in TypeScript
Дэвид Харел, Statecharts: A Visual Formalism for Complex Systems (Science of Computer Programming, 1987) — источник иерархических и параллельных состояний, от которых происходит XState
Тони Хоар, Null References: The Billion Dollar Mistake (QCon London 2009) — изобретатель null-ссылки о том, почему каждое nullable-поле — это состояние, которое ты не собирался добавлять
Даг Севен, Knight Capital — A DevOps Cautionary Tale
SEC, In the Matter of Knight Capital Americas LLC (Order 34-70694, 2013) — первоисточник про убыток в $440 млн и штраф в $12 млн
CNBC, Robinhood to pay $70 million for outages and misleading customers (2021) — приказ FINRA за примером с отсутствующим инвариантом
Тяньинь Сюй и др., Hey, You Have Given Me Too Many Knobs! (FSE 2015) — неправильная конфигурация как ведущая причина продакшен-аварий
Бен Мозли, Питер Маркс, Out of the Tar Pit (2006) — канонический довод, что состояние — главный источник сложности; «каждый добавленный бит состояния удваивает число возможных состояний»