x
Rényi divergence as a secondary objective — LessWrong