Self improving safety and alignment? — LessWrong