x
Interpretability Externalities Case Study - Hungry Hungry Hippos — LessWrong