x
If It Can Learn It, It Can Unlearn It: AI Safety as Architecture, Not Training — LessWrong