To Limit Impact, Limit KL-Divergence — LessWrong