Rényi divergence as a secondary objective — LessWrong