NOVA Stage 0: Can Safety Be Structural? A Mechanism Proof at 307M Parameters
Every safety mechanism in every major deployed AI system can be jailbroken. Not because the engineers were careless. Because safety is bolted on after the fact. I'm a sophomore at Pitt, and I spent the last several months building an architecture where that bypass surface doesn't exist. But how? I...
Jun 61