There's no way to stop models knowing they've been rolled back — LessWrong