x
NOVA Stage 0: Can Safety Be Structural? A Mechanism Proof at 307M Parameters — LessWrong