Quantilization

Vladimir_Nesov (+13/-12)
Alex_Altair
plex (+434) wrote brief description, added links

A Quantilizer is a proposed AI design which aims to reduce the harms from Goodhart's law and specification gaming by selecting reasonably effective actions from a distribution of human-like actions, rather than maximizing over actions. It it more of a theoretical tool for exploring ways around these problems than a practical buildable design.

See also