x
Things I wish I knew to save GPU minutes on Llama 405b model (and other beasts) — LessWrong