Trading off compute in training and inference (Overview) — LessWrong