x
Weird Generalization & Inductive Backdoors — LessWrong