Mild Optimization

Mild optimization is an approach for mitigating Goodhart's law in AI alignment. Instead of maximizing a fixed objective, the hope is that the agent pursues the goal in a "milder" fashion. 

Further reading: Arbital page on Mild Optimization

Mild optimization is an approach for mitigating Goodhart's law in AI alignment. Instead of maximizing a fixed objective, the hope is that the agent pursues the goal in a "milder" fashion. 

Created by TurnTrout at 1y