x
SAEBER: Sparse Autoencoders for Biological Entity Risk — LessWrong