antmaier

Message

4mo

Paper: Take Goodhart Seriously: Principled Limit on General-Purpose AI Optimization

TL;DR: This paper takes existing mathematical results to build the most general and rigorous case for why we should be very cautious about pushing optimization too far in General-Purpose AI systems, as it likely leads to catastrophic Goodhart failures, and ultimately loss of control. Written by Antoine Maier, AI Security...

Oct 28, 2025•14

Message

13 karma

1 post

Member for 4 months

antmaier — LessWrong

antmaier

Message

4mo

antmaier

Paper: Take Goodhart Seriously: Principled Limit on General-Purpose AI Optimization

Oct 28, 2025•14

Message

13 karma

1 post

Member for 4 months

Paper: Take Goodhart Seriously: Principled Limit on General-Purpose AI Optimization

antmaier

4mo

TL;DR: This paper takes existing mathematical results to build the most general and rigorous case for why we should be very cautious about pushing optimization too far in General-Purpose AI systems, as it likely leads to catastrophic Goodhart failures, and ultimately loss of control.

Written by Antoine Maier, AI Security Researcher at the General-Purpose AI Policy Lab, and Aude Maier, PhD student at the École Polytechnique Fédérale de Lausanne (EPFL).

LESSWRONG
LW

LESSWRONG
LW

antmaier

antmaier

antmaier

Paper: Take Goodhart Seriously: Principled Limit on General-Purpose AI Optimization

antmaier

antmaier

antmaier

Paper: Take Goodhart Seriously: Principled Limit on General-Purpose AI Optimization