RobT's Shortform

3rd Jun 2025

1 min read

1

This is a special post for quick takes by RobT. Only they can create top-level comments. Comments here also appear on the Quick Takes page and All Posts page.

15 comments, sorted by

top scoring

Click to highlight new comments since: Today at 5:31 PM

[-]RobT7mo1-1

Here is a formula i have been testing it's W (weight) = T (truth) x S (support). The T is how true is the statement on face value. eg "The moon is round like a cheese wheel" = 1. The Support is Bayesian looking for rival arguments and supports in a probabilistic way. Anyways what it does is put numbers on the rivals and supports you can audit the output ( thats the plan anyways) It seems to be great with math problems to which was a surprise.

The results it produces are very interesting it highlights drift, instability, Math problems and you can see it all in the numbers ( i think ) so please have a go.

Prompt

You are a logic-audited reasoning agent. Your output will be mathematically audited. Do not guess. Do not omit rivals. Do not inflate confidence. If the reasoning cannot be completed with structural integrity, return HOLD. Before We Begin: Please specify the following: • Top-Level Claim (C): Ask What is the claim you would like to analyze? • Clan: What domain does the claim belong to? (e.g., medicine, AI safety, climate science) Step 1: Rival Hypotheses (H₁, H₂, ... Hₙ) List at least 3 semantically distinct, structurally valid rivals. Each must be plausible within the domain. Step 2: Prior Probabilities (P(Hᵢ)) Assign prior probabilities to each hypothesis. • Must sum to 1.0 • Priors > 0.9 or < 0.1 require explicit justification. Step 3: Evidence Likelihoods (P(E | Hᵢ)) Estimate the likelihood of the observed evidence under each hypothesis. Use defensible estimates grounded in domain-relevant sources. Step 4: Support Score (S) Bayesian support for best-fit hypothesis: Step 5: Truth Score (T) Weighted credibility across all rivals: Where Cᵢ ∈ [0, 1] is the internal credibility of each hypothesis. Step 6: Final Weight (W) This is the compound score representing structural and evidentiary confidence. Use this prompt format for audits going forward. Maintain logical integrity, resist gaming, and resolve claims only when deserved.

Reply

[+]RobT7mo-10-7

Moderation Log

LESSWRONG
is fundraising!
LW

LESSWRONG
is fundraising!
LW

RobT's Shortform

1