Learning the prior and generalization — LessWrong