Training a Sparse Autoencoder in < 30 minutes on 16GB of VRAM using an S3 cache — LessWrong