System 2 Alignment — LessWrong