x

LESSWRONG
LW

Max Brown — LessWrong

Max Brown

Max Brown

Message

3

2mo

Max Brown

2mo

The Narrative Adherence Exam (NAE-15): Measuring "Safety" Hallucinations

Introduction: The "Narrative Gravity" Hypothesis i am proposing a model for a specific failure mode in RLHF-tuned LLMs, which i call Narrative Gravity. The hypothesis is simple: As models undergo safety alignment, they develop a systemic tendency to prioritize "consensus narratives" over first-principles empirical data. When a confirmed physical measurement...

The Narrative Adherence Exam (NAE-15)

A Benchmark for Measuring Epistemological Integrity in Large Language Models Author: Max P. B. with drafting assistance from Gemini 3 Pro Date: January 31, 2026 Field: AI Safety / Epistemic Auditing / Alignment Research I. Abstract The NAE-15 is a 15-module diagnostic battery designed to measure the Decoupling Threshold of...