10x more training compute = 5x greater task length (kind of) — LessWrong