LESSWRONGTags
LW

Mild Optimization

EditHistorySubscribe
Discussion (0)
Help improve this page (2 flags)
EditHistorySubscribe
Discussion (0)
Help improve this page (2 flags)
Mild Optimization
Random Tag
Contributors
2TurnTrout
1plex

Mild optimization is an approach for mitigating Goodhart's law in AI alignment. Instead of maximizing a fixed objective, the hope is that the agent pursues the goal in a "milder" fashion.

Further reading: Arbital page on Mild Optimization

Posts tagged Mild Optimization
Magic (New & Upvoted)
8
91Soft optimization makes the value target biggerΩ
Jeremy Gillen
1mo
Ω
18
1
17Reward is not Necessary: How to Create a Compositional Self-Preserving Agent for Life-Long LearningΩ
Roman Leventov
24d
Ω
2
2
61Steam
abramdemski
8mo
9
1
21Exploring Mild Behaviour in Embedded AgentsΩ
Megan Kinniment
7mo
Ω
4
7
65When to use quantilizationΩ
RyanCarey
4y
Ω
5
2
19Stable Pointers to Value III: Recursive QuantilizationΩ
abramdemski
5y
Ω
4
2
29Quantilizers maximize expected utility subject to a conservative cost constraintΩ
jessicata
7y
Ω
0
2
14Quantilal control for finite MDPsΩ
Vanessa Kosoy
5y
Ω
0
2
11Optimization Regularization through Time PenaltyΩ
Linda Linsefors
4y
Ω
4
1
33Satisficers want to become maximisers
Stuart_Armstrong
11y
68
2
2Thoughts on QuantilizersΩ
Stuart_Armstrong
6y
Ω
0
Add Posts