Bottom-Up: Principled Compression to Shrink LLMs — LessWrong