Briefly Extending Differential Optimization to Distributions — LessWrong