Things I wish I knew to save GPU minutes on Llama 405b model (and other beasts)
The goal of the post is to share with you how easy it is to load a llama 405b model in Runpod but also how it might be costly if you don’t know some things in advance, so I hope this post will help you to save these precious gpu...
Nov 25, 20256