Vladimir_Nesov | v1.3.0Jun 26th 2023 | (+13/-12) | ||
Alex_Altair | v1.2.0Jul 22nd 2022 | |||
plex | v1.1.0Sep 16th 2021 | (+434) wrote brief description, added links |
A Quantilizer is a proposed AI design which aims to reduce the harms from Goodhart's law and specification gaming by selecting reasonably effective actions from a distribution of human-like actions, rather than maximizing over actions. It it more of a theoretical tool for exploring ways around these problems than a practical buildable design.
Rob Miles'sQuantilizers: AI That Doesn't Try Too Hard by Rob Miles