Benchmarking Proposals on Risk Scenarios — LessWrong