Learning and testing environments — LessWrong