plex | v1.2.0Dec 28th 2020 | (+50/-1) Added link to arbital page | ||
TurnTrout | v1.1.0Aug 6th 2020 | (+188) |
Mild optimization is an approach for mitigating Goodhart's law in AI alignment. Instead of maximizing a fixed objective, the hope is that the agent pursues the goal in a "milder" fashion.
Mild optimization is an approach for mitigating Goodhart's law in AI alignment. Instead of maximizing a fixed objective, the hope is that the agent pursues the goal in a "milder" fashion.
Further reading: Arbital page on Mild Optimization