Pre-registration: Can irrelevant context expansion reverse LLM jailbreaks?
I have a hypothesis I want to make pay rent before I get too attached to it so I am posting predictions here before running anything. The idea I have been looking at different jailbreak techniques and they all seem to do the same thing structurally. They narrow what the...
Feb 271