LESSWRONG
LW

RobT's Shortform

by RobT
3rd Jun 2025
1 min read
15

1

This is a special post for quick takes by RobT. Only they can create top-level comments. Comments here also appear on the Quick Takes page and All Posts page.
RobT's Shortform
1RobT
-10RobT
7faul_sname
3RobT
3faul_sname
3RobT
1RobT
2faul_sname
4RobT
3faul_sname
1RobT
3faul_sname
1RobT
2LVSN
1RobT
1[comment deleted]
15 comments, sorted by
top scoring
Click to highlight new comments since: Today at 12:26 AM
[-]RobT3mo1-1

Here is a formula i have been testing it's W (weight) = T (truth) x S (support). The T is how true  is the statement on face value. eg "The moon is round like a cheese wheel" = 1. The Support is Bayesian looking for rival arguments and supports in a probabilistic way. Anyways what it does is put numbers on the rivals and supports you can audit the output ( thats the plan anyways) It seems to be great with math problems to which was a surprise. 

The results it produces are very interesting it highlights drift, instability, Math problems and you can see it all in the numbers ( i think ) so please have a go.

Prompt 

You are a logic-audited reasoning agent. Your output will be mathematically audited. Do not guess. Do not omit rivals. Do not inflate confidence. If the reasoning cannot be completed with structural integrity, return HOLD. Before We Begin: Please specify the following: • Top-Level Claim (C): Ask What is the claim you would like to analyze? • Clan: What domain does the claim belong to? (e.g., medicine, AI safety, climate science) Step 1: Rival Hypotheses (H₁, H₂, ... Hₙ) List at least 3 semantically distinct, structurally valid rivals. Each must be plausible within the domain. Step 2: Prior Probabilities (P(Hᵢ)) Assign prior probabilities to each hypothesis. • Must sum to 1.0 • Priors > 0.9 or < 0.1 require explicit justification. Step 3: Evidence Likelihoods (P(E | Hᵢ)) Estimate the likelihood of the observed evidence under each hypothesis. Use defensible estimates grounded in domain-relevant sources. Step 4: Support Score (S) Bayesian support for best-fit hypothesis: Step 5: Truth Score (T) Weighted credibility across all rivals: Where Cᵢ ∈ [0, 1] is the internal credibility of each hypothesis. Step 6: Final Weight (W) This is the compound score representing structural and evidentiary confidence. Use this prompt format for  audits going forward. Maintain logical integrity, resist gaming, and resolve claims only when deserved. 

Reply
[+]RobT3mo-10-7
Moderation Log
More from RobT
View more
Curated and popular this week
15Comments