x
Untrusted AIs can exploit feedback in control protocols — LessWrong