Estimating training compute of Deep Learning models — LessWrong