Paper: Take Goodhart Seriously: Principled Limit on General-Purpose AI Optimization
TL;DR: This paper takes existing mathematical results to build the most general and rigorous case for why we should be very cautious about pushing optimization too far in General-Purpose AI systems, as it likely leads to catastrophic Goodhart failures, and ultimately loss of control. Written by Antoine Maier, AI Security...