Triggering Reflective Fallback: A Case Study in Claude's Simulated Self-Model Failure — LessWrong