Log-linear Scaling is Worth the Cost due to Gains in Long-Horizon Tasks — LessWrong