ChatGPT Agent: evals and safeguards — LessWrong