Evaluating OpenAI's alignment plans using training stories — LessWrong