Brackets Are a Bad Way to Regulate

Hide

Continuous distributions are everywhere - for virtually everything we care about, a little more is a little better (or worse), and a lot more is a lot better (or worse). This presents a problem - we need to create rules that reasonably and fairly apply across these continuums, where the degree to which a thing possesses a trait makes a difference to the reasonable treatment of it.

Going 1m/h over the limit, and going 150 in a 40 zone are both “speeding”, yet we must punish these things differently.

The default solution for almost all regulations is to slice these continuous distributions into chunks, and treat the chunks as basically equivalent phenomena - squishing a continuous distribution into five or so blocks, and manually writing rules to apply uniformly within the blocks.

Examples include:

Speed limits
Tax brackets
Sentencing thresholds
Overtime thresholds
Pension eligibility

This is a very bad system.

Brackets are fundamentally inefficient

For any bracket over a continuous distribution, the upper section of the bracket has more of the trait than the bottom section.

As a result, for any incentive or punishment applying uniformly across a bracket, the ends of the bracket will be disproportionately affected. This introduces inefficiency and incentivises clustering near the edges of the bracket.

Businesses exploit the inefficiency of brackets all the time. Telcos may charge you per minute of usage, such that the second you have called for 1:00:01, you’ve paid for 2 minutes of phone time. More blatantly, hotels will often bill you for an entire extra night if you check out a minute late. Airlines will gleefully charge you double for a bag that’s a gram over the 20kg cutoff. Software subscriptions charge you monthly rather than by the second, and will push you towards even longer, yearly intervals.

Discontinuous thresholds of reward and punishment imposed on continuous distributions almost always incentivise moving to the the top of a given bracket. Once you’re in the bracket, you’re paying the cost of entry, and should rush to the top of the bracket to get your money’s worth.

In 18th century England, there were basically two brackets for criminal justice: perfectly legal, and egregious mortal crime. This forced a choice:

Minor crimes go unpunished
All crimes are punished mildly
All crimes are punished severely

They went for option 3.

At the time, stealing 12 pence (~$40 today) worth of goods, cutting down a tree, and destroying a fish pond were all capital crimes. This incentivised severe escalation. The moment you’ve been caught pickpocketing, you’re going to hang, and you can only hang once. So, the marginal cost of murdering witnesses to your crime was effectively zero. This led not only to escalation of crimes, but also extremely inconsistent enforcement of these penalties. Juries were known to engage in “pious perjury” - lying about the value of goods stolen to avoid the capital threshold, and otherwise excusing behaviour that, by the letter of the law, should have been a killing offense.

The peak of this “bloody code” lasted over a century, from ~1688-1823, before the list of capital offenses was eventually reduced from 225, to 5.

Simplicity trades off against efficiency

The larger the size of the brackets, the greater the inefficiency. This creates a tradeoff - fewer brackets are simpler to understand and administrate, but smaller brackets mean less of a qualitative difference between the top and bottom of each segment.

In practice, virtually all regulations opt for a single-digit number of subdivisions for the entire population.

Given the immense variation present within large populations, there is almost always a relevant qualitative difference within the brackets. A man weighing 230lbs, and a man weighing 630lbs are both in the top quintile of body weight, and are both “obese”^[1], yet experience vastly different treatment and outcomes.

The obvious solution is to add more brackets to account for these differences. But this means more manual rulesetting, and often requires dozens of brackets to adequately smooth out qualitative differences within groups.

For low variance distributions, a few brackets may do the trick. For high variance distributions, your options are many brackets, or obscenely inefficient incentives.

Formulas completely eliminate bracketing inefficiency

We have possessed the mathematical technology to completely eliminate this gratuitous inefficiency for millenia. Behold: the function.

A function is the ideal treatment of continuous distributions, and can accommodate any shape of reward, punishment, or compensation you desire.

If more X is better than less X, you can either:

Create a few brackets and incentivise the bare minimum to qualify for the bracket
Create many brackets to partially solve the first problem, and invest time manually writing rules for 100 brackets, producing opaque, lengthy regulations.
Express the relationship between X and goodness as a function, and solve both problems completely.

For example, US federal drug trafficking laws impose the following mandatory minimums for cocaine possession:

Under 500g — no mandatory minimum, judge has discretion

500g to 4,999g — mandatory 5 years, maximum 40

5kg+ — mandatory 10 years, maximum life

Crossing these arbitrary thresholds by trivial amounts produces a massive, discontinuous change in mandatory punishment. Once you’re past a threshold, massive increases within the threshold produce negligible differences. Under this system, a trafficker has no marginal incentive, once they’ve got 500g of inventory, to avoid adding an extra 4.499kg to their stock^[2].

With a formula, you can still hand out 10-year mandatory minimums to 5kg traffickers, and 5-year minimums to 0.5kg traffickers, while disincentivising additional possession between those ranges.

For example: Mandatory minimum sentence = 0.80 * (grams-500)^0.30 gives us a very reasonable looking prescription for every stage^[3].

With this system, there is no amount of contraband you can possess for which additional contraband goes unpunished.

And, because these are minimums rather than exact prescriptions, this preserves the ability for judiciary discretion in exactly the same way brackets do.

What about administrative convenience?

Administrative convenience is valuable, and most regulations are a pragmatic compromise between efficacy and usability. This may produce concerns like:

People won’t remember the formulas

Nobody remembers the brackets either, and you can publish bracketed tables that show example values just like they do now, with intervals as large or small as you like. For example, the (rounded) trafficking formula outputs represented as a table is as follows:

Quantity	Minimum Sentence
1kg	5.2yr
2kg	7.2yr
3kg	8.4yr
4kg	9.3yr
5kg	10 yr

Formulas are opaque and difficult to use

No, they aren’t. Published formulas and their outputs can be verified by anyone. Online calculators are featured on many commercial and government websites and are trivial to use.

Current widely used brackets are already opaque. For instance, in California, speeding fines are calculated based on brackets: 1–15 mph over costs $35 as a baseline, but this is modified by as many as 15 separate legislative surcharges which multiply that figure by up to 7×, varying by county, producing a final bill that is virtually impossible to exactly predict in advance.

Additionally, many brackets are framed as “over this limit, X applies”, but X is often modified by “whatever the judge deems reasonable”, which is fundamentally more unpredictable, opaque, and administratively costly than a standard formula.

Formula-based systems are already used without catastrophic admin and communication issues. Child support calculations in 41 US states use an income share formula where both parents' incomes, the custody split, and number of children feed into a continuous dollar output. FICO credit scores compress dozens of variables into a single output, and there is broad agreement among economists that this produces more consistent lending decisions. Student loan repayments are calculated formulaically as well.

Defending bracket-based systems on the basis of administrative simplicity and public interpretability therefore relies on the often false premise that bracket-based systems are simple and interpretable, and that formulaic ones are not.

As long as “more is worse”, “a little more is a little worse”, and “a lot more is a lot worse”, brackets are the wrong tool for the job. We should make the world a bit saner and start using more formulas.

^{^}
At average height)
^{^}
This is modified by the discretion of a judge, who will often tack on extra sentences in proportion to the amount of drugs you have - acting much like a function
^{^}
Assuming you find the original paradigm mostly reasonable, a separate question

I like this and agree with you generally. In NZ there are some things like this already, the "In‑Work Tax Credit" is a benefit for those with children that shrinks based on your income. I think the formula is "Final payment = Base entitlement for your number of children – 27% × (income above threshold)".

I would probably watch out for things that look like a distribution but hide a categorical shift. In your drugs example the crime being punished at the low end, is having drugs for yourself. The crime being punished at the high end is having drugs for sale. There are thresholds that can be usefully set to distinguish those cases, and those cases have different harms, grams possessed is only a proxy for which case you're in.

I would probably watch out for things that look like a distribution but hide a categorical shift. In your drugs example the crime being punished at the low end, is having drugs for yourself. The crime being punished at the high end is having drugs for sale. There are thresholds that can be usefully set to distinguish those cases, and those cases have different harms, grams possessed is only a proxy for which case you're in.

If one uses "amount possessed" as a proxy for commercial/personal, and argues this categorical distinction calls for a discontinuous punishment, that's fine - but that argues for a single discontinuous jump, not more.

Assuming someone with 499g of cocaine is just a big time cokehead, but 500g is a distributor, then it makes sense to have a sharp jump in penalties at that point. But what about the second threshold? There seems to be no plausible categorical difference between 4999g and >5kg - both brackets are commercial quantities, and lots of commercial trafficking is worse than a little bit of commercial trafficking.

So, within the same category, it still makes sense to use a formula.

I agree, it's a bit werid as is. Maybe they were thinking different levels of dealing? Like your local small time vs your national distributor.

Continuous functions only work where the underlying reality is continuous.

Using speeding as an example, going 9 over the limit is de-facto legal. Cops can't pull you 90% over, so there's a step-change at 11 over where they start bothering to do it and you're suddenly de-facto illegal and will face a moderate fine. Similarly, you can't get sent 90% to jail or have your license 90% revoked (for a single offense), so there are another couple step changes in the punishment.

Same with kind-of writing up an accommodation plan for a sort-of disabled employee, barely retaking a class that you barely failed (or graduating with nearly-honors), or almost serving water that's almost safe.

I agree for some of these examples, but not for others.

You can't get sent 90% to jail, but you can get sent to jail for 90% as long. You can have your license revoked for 90% as long too.

And yes, this does only apply to continuous variables. So, to your point, pulling someone over makes sense as a threshold since it's discontinuous. However, most continuous distributions are handled with brackets, which is just as inappropriate as handling discrete qualitative step changes with continuous formulas.

Getting sent to jail for 90% as long is still getting sent to jail. There are practical minimums for it (one day, which is honestly a lot shorter than I thought) just like there are practical minimums for pulling someone over. Something that's 1/24 as serious as a one-day offense won't get you an incarceration record (for one hour), and can be conveniently swept under the rug when dealing with bureaucracy. Same with getting your license revoked: even if it's for a single minute (as opposed to a year or whatever), you still have to apply to get it reinstated and all that.

I think there are many hidden thresholds when various punishments (or rewards, or simple reactions) become relevant for things. Do you want baggage with "kind of heavy load" marked on it, that requires 1.03x the OSHA-standard amount of strength to handle safely? A fuzzy line between "general wellness" and "diagnosable medical condition" for weight?

You can totally be put to jail 90%. You can make a randomized decision with 90% jail probability.

Sure you can, but people tend to dislike arbitrary and capricious punishments. If you want a roll of the dice to hold up as valid legal reasoning in a sentencing hearing, you'll have to reform the legal system by quite a bit first.

Funnily enough I am thinking about a post proposing such a change. I am not claiming that this would be a good idea for jail or the legal system specifically. But maybe for community disputes where there is an extremely serious accusation where you know either the accuser or the accused has done something serious (accusation vs. false accusation). I still have to think more about it.

Functions are so underused I agree. You can even still publish a bracket as an approximation for the function if people really need the bracket for communicating the rough payoffs.

Income-Taxes are a hilarious example where they use a function but the derivative is still a bracket because BRACKET! (doesn't do that much damage in that case I guess though).

At least income taxes are continuous and monotonically increasing. That discontinuous second derivative is still a nuisance though, and I know of quite a few people who go out of their way to keep their income just under the threshold for their tax bracket.

Not to mention that the simultaneous use of brackets for both benefits and income taxes often leads to situations where final income after taxes and benefits fails to be a monotonically increasing function of gross pay. The result: serious economic distortion and people being disincentivised from working. Removing all those brackets would allow us to implement steeply progressive taxation and welfare systems without anyone being overly discouraged from working.

We could easily get rid of the brackets, and all those people who don't like formulas containing logarithms and exponents should hopefully be able to muddle through using graphs and online calculators (which is probably what they are doing already).

See this blog post for a smooth function that approximates the Norwegian taxation system:

https://tommyodland.com/articles/2024/smooth-taxes-without-brackets/index.html

Ehn. I mostly disagree, but a lot depends on specifics and the purpose of regulation.

For things with external reasons for regulation, easy measurement, and fairly low cost of enforcement/collection, I generally like the idea of coasean taxation - charge just enough to offset the harm.

That actually covers a small part of the spectrum of things we want to discourage. For a lot of things, where we'd effectively like "none", even if more is worse, Schelling lines make a lot of sense - if a bright line is published, and crossed, it just costs a lot less to measure and litigate the enforcement. And you usually need multiple bands of severity, to avoid the "a fine is just an expense" and the "death penalty for a very minor error" cases.

Yeah, agreed, and with that, I think there's a big difference between what kinds of changes make sense on the margin of our current legal systems, and what you might try if you were building a new system from the ground up.

Yeah! I hate those AI services that force me to pay for month instead of buying tokens directly

I would probably watch out for things that look like a distribution but hide a categorical shift. In your drugs example the crime being punished at the low end, is having drugs for yourself. The crime being punished at the high end is having drugs for sale. There are thresholds that can be usefully set to distinguish those cases, and those cases have different harms, grams possessed is only a proxy for which case you're in.

So, within the same category, it still makes sense to use a formula.

I agree, it's a bit werid as is. Maybe they were thinking different levels of dealing? Like your local small time vs your national distributor.

Continuous functions only work where the underlying reality is continuous.

You can totally be put to jail 90%. You can make a randomized decision with 90% jail probability.

Functions are so underused I agree. You can even still publish a bracket as an approximation for the function if people really need the bracket for communicating the rough payoffs.

Income-Taxes are a hilarious example where they use a function but the derivative is still a bracket because BRACKET! (doesn't do that much damage in that case I guess though).

See this blog post for a smooth function that approximates the Norwegian taxation system:

https://tommyodland.com/articles/2024/smooth-taxes-without-brackets/index.html

Yeah! I hate those AI services that force me to pay for month instead of buying tokens directly

76

Brackets Are a Bad Way to Regulate

76

76

76