Goodhart's Law

Diabloto96	v1.8.0Mar 19th 2023	(-2)
Ruby	v1.7.0Oct 1st 2020	(+227)
Ruby	v1.6.0Jul 26th 2020	C-Class on the basis of having some explanation of concept plus some elaboration in conjuction with a really strong posts list.
Ruby	v1.5.0May 4th 2020
Ruby	v1.4.0Apr 17th 2020	(+10/-10)
Ruby	v1.3.0Apr 17th 2020	(+9/-9)
Ruby	v1.2.0Apr 17th 2020	(+806/-8)
Ruby	v1.1.0Apr 17th 2020	(+559/-528)
Vladimir_Nesov	v0.0.3May 27th 2010	(+12) /* See also */
Vladimir_Nesov	v0.0.2May 27th 2010	(+14) /* See also */

Load More (10/12)

Diabloto96 v1.8.0Mar 19th 2023 (-2) 1

Goodhart's Law is of particular relevance to AI Alignment. Suppose you have something which is generally a good proxy for "the stuff that humans care about", it would be dangerous to have a powerful AI optimize for the proxy, in accordance with Goodhart's law, the proxy will breakdown.

Discuss this tag (0)

Ruby v1.7.0Oct 1st 2020 (+227) 2

Goodhart Taxonomy

In Goodhart Taxonomy, Scott Garrabrant identified four kinds of Goodharting:

Regressional Goodhart - When selecting for a proxy measure, you select not only for the true goal, but also for the difference between the proxy and the goal.
Causal Goodhart - When there is a non-causal correlation between the proxy and the goal, intervening on the proxy may fail to intervene on the goal.
Extremal Goodhart - Worlds in which the proxy takes an extreme value may be very different from the ordinary worlds in which the correlation between the proxy and the goal was observed.
Adversarial Goodhart - When you optimize for a proxy, you provide an incentive for adversaries to correlate their goal with your proxy, thus destroying the correlation with your goal.

Discuss this tag (0)

Ruby v1.1.0Apr 17th 2020 (+559/-528) 2

Goodhart's ~~law~~Law states that ~~once~~when a ~~certain indicator of success is made a~~proxy for some value becomes the target of optimization pressure, the proxy will cease to be a ~~social or economic policy, it will lose~~good proxy. Consider the ~~information content that would qualify it to play such~~Soviet story of a ~~role.~~

LESSWRONG
LW

See Also

Goodhart Taxonomy

Blog posts

See also