The limits of black-box evaluations: two hypotheticals — LessWrong