LESSWRONGTags
LW

Goodhart's Law

•

Applied to Extinction-level Goodhart's Law as a Property of the Environment by VojtaKovarik 2mo ago

•

Applied to Dynamics Crucial to AI Risk Seem to Make for Complicated Models by VojtaKovarik 2mo ago

•

Applied to Extinction Risks from AI: Invisible to Science? by VojtaKovarik 2mo ago

•

Applied to Approximately Bayesian Reasoning: Knightian Uncertainty, Goodhart, and the Look-Elsewhere Effect by RogerDearnaley 3mo ago

•

Applied to Aldix and the Book of Life by ville 4mo ago

•

Applied to When Can Optimization Be Done Safely? by StrivingForLegibility 4mo ago

•

Applied to Weak vs Quantitative Extinction-level Goodhart's Law by VojtaKovarik 4mo ago

•

Applied to Goodhart's Law Example: Training Verifiers to Solve Math Word Problems by Thomas Kwa 5mo ago

•

Applied to Goodhart's Law in Reinforcement Learning by jacek 7mo ago

•

Applied to Satisficers want to become maximisers by JenniferRM 8mo ago

•

Applied to Optimized for Something other than Winning or: How Cricket Resists Moloch and Goodhart's Law by Noosphere89 10mo ago

•

Applied to AISC team report: Soft-optimization, Bayes and Goodhart by Simon Fischer 10mo ago

•

Applied to Requirements for a STEM-capable AGI Value Learner (my Case for Less Doom) by RogerDearnaley 1y ago

•

Applied to Catastrophic Regressional Goodhart: Appendix by Ruby 1y ago

•

Applied to When is Goodhart catastrophic? by Morpheus 1y ago

•

Applied to How much do you believe your results? by Thomas Kwa 1y ago

•

Applied to Thinking about maximization and corrigibility by James Payor 1y ago