Coup probes: Catching catastrophes with probes trained off-policy — LessWrong