When is it Better to Train on the Alignment Proxy? — LessWrong