The Temporal Immune System: Cross-Session Behavioral Monitoring as a Fourth Defense Axis
I'm new here. I've been doing independent AI safety research and wanted to share findings I think this community would be interested in. I'm sharing a preprint proposing a cross-session behavioral monitoring framework for detecting multi-turn jailbreak attacks and sabotage patterns that evade per-interaction defenses. The core problem: every defense...