Very reasonable.
Interesting challenge, looking forward to the eventual reveal!
Any suggestions on how I might validate the answers Claude gives, so that I don't just waste your time sending a bunch of incorrect attempts?
Here is an attempt using an approach that worked well for things like substitution-ciphers-with-errors - Claude would overthink it and confuse itself, whereas encouraging it to act purely on instinct worked well, and telling it to repeat the question four times allowed it some gut-level thinking space without the kind of structured thinking that led it astray:
https://claude.ai/share/f00ed43b-e26b-4e98-b4f7-9ad77100fac...
I wonder if one reason SMTM might be advocating improbable theories, and not accepting bets, is because they are intentionally persuing improbable theories. Their post on scurvy seems to make the point that you need to check things even if they seem improbable, as the truth sometimes turns out to be something that seemed improbable.
I (perhaps charitably) assumed they did not believe the lithium theory per se, but thought it was worth a more detailed look - having previously argued that the bar for that should be lower than others think.
I thought the lithium theory and the potato diet were just two of many possible things they might be looking into, with the idea being that they advocate a broader search generally.
Exactly what I was thinking.
Opus 4.7 being more likely to change tactics when perceiving itself to be stuck was both a strength and a weakness - Opus 4.7 would frequently abandon dungeons upon encountering difficulty, and go do something else, whether useful or not. When this applied to tactics within a puzzle it was an asset, but when it meant (temporarily) abandoning a puzzle that was in fact self-contained, it slowed Claude down.
For example Claude had some difficulty in Rock Tunnel, and so went to obtain the FLASH HM. This HM is ordinarily useful for humans to light up the dark tu... (read more)