LESSWRONG
LW

627
Wikitags

Quantilization

Edited by plex, Mateusz Bagiński, et al. last updated 2nd Dec 2024

A Quantilizer is a proposed AI design that aims to reduce the harms from Goodhart's law and specification gaming by selecting reasonably effective actions from a distribution of human-like actions, rather than maximizing over actions. It is more of a theoretical tool for exploring ways around these problems than a practical buildable design.

See also

  • Quantilizers: AI That Doesn't Try Too Hard by Rob Miles
  • Arbital page on Quantilizers
  • Quantilizers: A Safer Alternative to Maximizers for Limited Optimization by Jessica Taylor (original paper)
Subscribe
Discussion
1
Subscribe
Discussion
1
Posts tagged Quantilization
12
33Quantilizers maximize expected utility subject to a conservative cost constraint
Ω
jessicata
10y
Ω
3
12
26Another view of quantilizers: avoiding Goodhart's Law
Ω
jessicata
10y
Ω
2
11
65When to use quantilization
Ω
RyanCarey
7y
Ω
5
11
14Quantilal control for finite MDPs
Ω
Vanessa Kosoy
8y
Ω
0
11
9Computing an exact quantilal policy
Ω
Vanessa Kosoy
8y
Ω
0
7
119Soft optimization makes the value target bigger
Ω
Jeremy Gillen
3y
Ω
20
6
24Quantilizers and Generative Models
Ω
Adam Jermyn
3y
Ω
5
4
25Why don't quantilizers also cut off the upper end of the distribution?
QΩ
Alex_Altair, Jeremy Gillen
3y
QΩ
2
2
44[Aspiration-based designs] 1. Informal introduction
Ω
B Jacobs, Jobst Heitzig, Simon Fischer, Simon Dima
2y
Ω
4
2
37The murderous shortcut: a toy model of instrumental convergence
Ω
Thomas Kwa
1y
Ω
0
2
20Hedonic Loops and Taming RL
Ω
beren
2y
Ω
14
2
11Quantilizer ≡ Optimizer with a Bounded Amount of Output
Ω
itaibn0
4y
Ω
4
1
47How to safely use an optimizer
Ω
Simon Fischer
2y
Ω
21
1
38AISC team report: Soft-optimization, Bayes and Goodhart
Simon Fischer, benjaminko, jazcarretao, DFNaiff, Jeremy Gillen
2y
2
1
30Recursive Quantilizers II
Ω
abramdemski
5y
Ω
15
Load More (15/18)
Add Posts