RLLMv10 experiment — LessWrong