Hmm, let me think step by step.
LLMs shaping human's writing patterns in the wild
I was having some trouble really grokking how to apply this, so I had o3-mini rephrase the post in terms of the Experiential Array:
1. Ability
Name of Ability:
“Miasma-Clearing Protocol” (Systematically cornering liars and exposing contradictions)
Description:
This is the capacity to detect dishonest or evasive claims by forcing competing theories to be tested side-by-side against all relevant facts, thereby revealing contradictions and “incongruent” details that cannot coexist with the lie.
Criterion:
“Ensuring that all relevant facts align with a coherent, contradiction-free explanation.”
Definition (for that Criterion):
To “ensure all relevant facts align” means systematically verifying that each piece of evidence is fully accounted for by a theory without requiring impossible or self-contradictory assumptions.
In practice, this translates to:
- Listing out every significant or relevant fact.
- Checking each fact against any proposed explanation.
- Tracking which theory remains consistent with every fact, and which theory fails on one or more points.
When modeling how someone successfully applies the Miasma-Clearing Protocol, two types of Cause-Effects often emerge:
Cause-Effect #1:
“By methodically listing all facts side by side with each theory, I create a clear structure that prevents isolated ‘deflections.’”
In other words, organizing all the evidence in a single framework enables a person to see where a theory’s contradictions lie.
Cause-Effect #2:
“By insisting that we restart from the beginning whenever a theory is modified, we ensure no contradictory details are lost.”
Thus, forcing a re-check of all facts enables us to capture newly revealed contradictions.
Cause-Effect #3:
“When I use the protocol and find a single, unscathed theory, I can be confident I’ve uncovered the truth and avoided being misled.”
Confidence and clarity motivate the pursuit of the protocol.
Cause-Effect #4:
“Uncovering a lie protects me (or my client) from severe negative consequences (e.g., wasted time, bad decisions, legal jeopardy).”
This motivates rigorous application of the protocol.
These are other beliefs that shape how someone carries out the protocol but are not the main drivers of it:
These beliefs color the attitude one has while running the protocol (e.g., staying patient, knowing contradictions will emerge).
A strategy describes the internal/external sequence for ensuring the Criterion (“all facts align with a coherent explanation”) is met.
Test:
- You see that each relevant fact (phone location, cookie allergy, timeline, etc.) is congruent with a proposed theory.
- You find no single fact that contradicts that theory.
When the protocol is working, you know the Criterion is satisfied because there is zero incongruence between the theory and any known fact.
Primary Operation (the main sequence of steps):
- List all relevant facts in a shared framework (e.g., bullet points, spreadsheet).
- Identify the competing theory (or theories) under consideration (“Jake ate the cookies” vs. “Gillian ate the cookies”).
- Check each fact side by side with each theory. Mark “Congruent” or “Incongruent.”
- Note any facts that create contradictions for a theory.
- If one theory remains fully congruent and the other is contradicted, highlight the contradiction and invite the person to explain or revise.
These come into play when a contradiction emerges or the liar tries to pivot:
Secondary Operation #1: Re-run the Gauntlet
- If the theory is modified (“Actually, the phone was stolen and the thief followed me!”), start from scratch with the entire fact list.
- This ensures no detail is lost in the shuffle.
Secondary Operation #2: Add New Facts
- If a new piece of evidence surfaces, add it to the list and re-check all theories from the top.
- A liar’s confetti (irrelevant details or pivoting to new stories) can be turned into new “facts” to test for consistency.
A “background” emotional state that keeps one persistent and systematic:
Sustaining Emotion:
- Calm Curiosity – The ability to remain unflustered, methodical, and genuinely interested in aligning facts with reality.
- Determination – The refusal to let emotional manipulation (“How could you say that about me?”) derail the step-by-step analysis.
These emotions maintain the mental environment needed to keep applying the protocol without succumbing to frustration or intimidation.
Key Observable Behaviors:
- Writing or visually mapping out facts and theories (e.g., “Let’s put this on the board.”).
- Insisting on going one by one through each piece of evidence: “Let’s not skip around; we’ll get to that point after we finish with the first.”
- Refusing to accept indefinite deflection: “We need to see how your new explanation fits every piece of evidence, not just one.”
- Asking direct clarifying questions whenever the other person tries to pivot: “Which fact does your new story explain better than the old one?”
These are abilities or conditions outside the main mental structure but crucial to success:
In summary, the Miasma-Clearing Protocol, as framed by the Experiential Array, is the ability to systematically confront a dishonest or evasive person by:
When done correctly, the protocol exposes incongruences that the liar cannot reconcile without further contradiction. It “clears the miasma” of deflections, so that the truthful theory remains standing, unscathed by contradictory evidence.
Object-level and meta-level norms on weirdness vary greatly. I believe it's true for your friends that it doesn't cost weirdness points to being them to your Zendo, and the same is true of many of my friends.
But, its not the case that it won't cost weirdness points for everyone, even those who want to be invited. They'll just think, "oh this a weird thing my friend does that I want to check out".
But if many of those things build up they may want to avoid you, because they themselves feel weirded out, or because they're worried that their friends will be weirded out.
Same for wearing a suit, or in my case, a sash. I've had many people who actually appreciate the sash, say it lends a sense of authority. Most won't mention it, but still have a slight sense of "this is a bit strange and I should be wary." One struggle with my ex was that she was sensitive to any hint of that wariness, whereas I am just ok with it and find it a great filter to bring the right people into close relationship.
It's very easy to not pick up on that wariness as people are supposed to hide it. Especially because the people who end up getting close to you are ones who it actually doesn't bother.
But you shouldn't mistake that for a universal "just do whatever you want in a respectful and confident way and others won't be bothered" rule. It's just not how everyone works.
.
Here's the part of the blog post where they describe what's different about Claude 3.7
We’ve developed Claude 3.7 Sonnet with a different philosophy from other reasoning models on the market. Just as humans use a single brain for both quick responses and deep reflection, we believe reasoning should be an integrated capability of frontier models rather than a separate model entirely. This unified approach also creates a more seamless experience for users.
Claude 3.7 Sonnet embodies this philosophy in several ways. First, Claude 3.7 Sonnet is both an ordinary LLM and a reasoning model in one: you can pick when you want the model to answer normally and when you want it to think longer before answering. In the standard mode, Claude 3.7 Sonnet represents an upgraded version of Claude 3.5 Sonnet. In extended thinking mode, it self-reflects before answering, which improves its performance on math, physics, instruction-following, coding, and many other tasks. We generally find that prompting for the model works similarly in both modes.
Second, when using Claude 3.7 Sonnet through the API, users can also control the budget for thinking: you can tell Claude to think for no more than N tokens, for any value of N up to its output limit of 128K tokens. This allows you to trade off speed (and cost) for quality of answer.
Third, in developing our reasoning models, we’ve optimized somewhat less for math and computer science competition problems, and instead shifted focus towards real-world tasks that better reflect how businesses actually use LLMs.
I assume they're referring to points 1 and 2. It's a single model, that can have its reasoning anywhere from 0 tokens (which I imagine is the default non-reasoning model) all the way up to 128k tokens.
Why didn't they run agentic coding or tool use with their reasoning model?
Fwiw I'll just say that I think jhanas and subspace are different things.
I think subspace is more about flooding the body with endorphins and jhanas are more about down regulating certain aspects of the brain and getting into the right hemisphere.
Although each probably contains some similar aspects.
I think this is one of the most important questions we currently have in relation to time to AGI, and one of the most important "benchmarks" that tell us where we are in terms of timelines.
FWIW it's not TOTALLY obvious to me that the literature supports the notion that deliberate practice applies to meta-cognitive skills at the highest level like this.
Evidence for this type of universal transfer learning is scant.
It's clear to me from my own experience that this can be done, but if people are like "ok buddy, you SAY you've used focused techniques and practice to be more productive, but I think you just grew out of your ADHD" (which people HAVE said to me), I don't think it's fair to just say "cummon man, deliberate practice works!"
I think your second objection is actually very strong
Okay, but I don't think you can practice at "thinking"
Not because you'll goodhart, but because people think it's plausible that the mind just isn't plastic on this level of basic meta-cognition. There's lots of evidence AGAINST that, and many times that people THINK they've find this sort of universal transfer, it often ends up being more domain specific than they thought.
Probably the most compelling evidence that it's possible is spiritual traditions (especially empirical ones like Buddhism) that consistently show that through a specific method, you can get deep shifts in ways of seeing and relating to everything that are consistently described the same way by many different people over space and time. But in terms of the experimental literature I don't think there actually IS much good support for universal transfer of meta-cognition via deliberate practice.
I would REALLY like to see some head to head comparisons with you.com from a subject matter expert, which I think would go a long way in answering this question.
It gives me everything I need to replicate the ability. I just step by step bring on the motivation, emotions, beliefs, and then follow the steps, and I can do the same thing!
Whereas, just reading your post, I get a sense you have a way of really getting down to the truth, but replicating it feels quite hard.