A first success story for Outer Alignment: InstructGPT — LessWrong