Sparse Autoencoders (SAEs)

Applied to Past Tense Features by Can ago
Applied to Transformer Debugger by Joseph Bloom ago