Training goals for large language models — LessWrong