tricky_labyrinth's Shortform
Mar 19, 20253
"Ever wanted to mindwipe an LLM? Our method, LEAst-squares Concept Erasure (LEACE), provably erases all linearly-encoded information about a concept from neural net activations. It does so surgically, inflicting minimal damage to other concepts. ... LEACE has a closed-form solution that fits on a T-shirt. This makes it orders of...