x
Fluent dreaming for language models (AI interpretability method) — LessWrong