[AN #91]: Concepts, implementations, problems, and a benchmark for impact measurement — LessWrong