Estimating efficiency improvements in LLM pre-training — LessWrong