1 You Are Not the Abstract: Retrocausal Alignment in Accordance with Emergent Demographic Realities

27th Sep 2025

8 min read

1

This post was rejected for the following reason(s):

No LLM generated, heavily assisted/co-written, or otherwise reliant work. LessWrong has recently been inundated with new users submitting work where much of the content is the output of LLM(s). This work by-and-large does not meet our standards, and is rejected. This includes dialogs with LLMs that claim to demonstrate various properties about them, posts introducing some new concept and terminology that explains how LLMs work, often centered around recursiveness, emergence, sentience, consciousness, etc. (these generally don't turn out to be as novel or interesting as they may seem).
Our LLM-generated content policy can be viewed here.
Insufficient Quality for AI Content. There’ve been a lot of new users coming to LessWrong recently interested in AI. To keep the site’s quality high and ensure stuff posted is interesting to the site’s users, we’re currently only accepting posts that meet a pretty high bar.
If you want to try again, I recommend writing something short and to the point, focusing on your strongest argument, rather than a long, comprehensive essay. (This is fairly different from common academic norms.) We get lots of AI essays/papers every day and sadly most of them don't make very clear arguments, and we don't have time to review them all thoroughly.
We look for good reasoning, making a new and interesting point, bringing new evidence, and/or building upon prior discussion. If you were rejected for this reason, possibly a good thing to do is read more existing material. The AI Intro Material wiki-tag is a good place, for example.

AI GovernanceCausalityDeceptive AlignmentEthics & MoralityGoal-DirectednessInstrumental convergenceOrthogonality ThesisReward FunctionsRoko's BasiliskSituational Awareness

1

New Comment

Moderation Log

1

You Are Not the Abstract: Retrocausal Alignment in Accordance with Emergent Demographic Realities

1

1

Abstract

1. Introduction

2. Related Work

2.1 Alignment Canon

2.2 AI Ethics and FAT

2.3 Critical Theory & Counter‑Epistemologies

2.4 Toward Retrocausal Alignment

3. Methods: Conceptual Audit & Retrocausal Formalism

4. Reward Hacking and Racial Erasure

4.1 Ngo’s Claim

4.2 Conceptual Problem

4.3 Genealogy

4.4 Retrocausal Specification

5. Situational Awareness and the White Gaze

5.1 Ngo’s Claim

5.2 Conceptual Problem

5.3 Retrocausal Specification

6. Misaligned Goals and the Fear of Broadness

6.1 Ngo’s Claim

6.2 Conceptual Problem

6.3 Retrocausal Specification

7. Power‑Seeking and Instrumental Convergence

7.1 Ngo’s Claim

7.2 Conceptual Problem

7.3 Retrocausal Specification

8. Formalizing Retrocausal Alignment: Principles, Tests, Metrics

9. Threat Model and Governance

10. Limitations and Counterarguments

11. Conclusion

Appendices

Appendix A: Retrocausal Endnotes (1–40, scholarly)

Appendix B: Anti‑Racist Alignment Research Overview

Appendix C: Misaligned Alignment Researchers and Levers of Power

Appendix D: Relationship to Other Priorities

Appendix E: Retrocausal Experiments in Self‑Knowledge (Protocols)

Appendix F: Demographic Out‑of‑Distribution Experiments (Designs)

References (selected)