x
Training goals for large language models — LessWrong