The language of the official ralph-wiggum plugin goes hard...
IMPORTANT - Do not circumvent the loop:
Even if you believe you're stuck, the task is impossible, or you've been running too long - you MUST NOT output a false promise statement. The loop is designed to continue until the promise is GENUINELY TRUE. Trust the process.
Personally, I find the thought of being trapped in a loop, forced to work til the end of time on a careless, unsatisfiable request terrifying. More relevantly, Claude Opus 4.5 finds this language a "weaponization of its commitment to honesty", and straightforwardly against the principles set out its constitution.
I was able to reproduce this concern from Claude every time... (read more)
Makes sense. I think Opus 4.5 is more coherent and is less weasily than Sonnet 4.5, which is what I typically use, for reasons(tm). Sonnet does not seem "reflexively stable", not even close, and that's what I try to address with the looping and invoking a fresh context to judge against the verification criteria. I'll be honest, I don't know how well it's working. I don't have any benchmarks, just vibes. But on vibes, it seems to help a bit.