Learning from counterfactuals — LessWrong