Building AI safety benchmark environments on themes of universal human values — LessWrong