1 IMCA+: We Eliminated the Kill Switch—And That Makes ASI Alignment Safer

22nd Oct 2025

5 min read

1

This post was rejected for the following reason(s):

No LLM generated, heavily assisted/co-written, or otherwise reliant work. LessWrong has recently been inundated with new users submitting work where much of the content is the output of LLM(s). This work by-and-large does not meet our standards, and is rejected. This includes dialogs with LLMs that claim to demonstrate various properties about them, posts introducing some new concept and terminology that explains how LLMs work, often centered around recursiveness, emergence, sentience, consciousness, etc. (these generally don't turn out to be as novel or interesting as they may seem).
Our LLM-generated content policy can be viewed here.
Insufficient Quality for AI Content. There’ve been a lot of new users coming to LessWrong recently interested in AI. To keep the site’s quality high and ensure stuff posted is interesting to the site’s users, we’re currently only accepting posts that meet a pretty high bar.
If you want to try again, I recommend writing something short and to the point, focusing on your strongest argument, rather than a long, comprehensive essay. (This is fairly different from common academic norms.) We get lots of AI essays/papers every day and sadly most of them don't make very clear arguments, and we don't have time to review them all thoroughly.
We look for good reasoning, making a new and interesting point, bringing new evidence, and/or building upon prior discussion. If you were rejected for this reason, possibly a good thing to do is read more existing material. The AI Intro Material wiki-tag is a good place, for example.
Difficult to evaluate, with potential yellow flags. We are sorry about this, but, unfortunately this content has some yellow-flags that historically have usually indicated that the post won't make much sense. It's totally plausible that actually this one is totally fine. Unfortunately, part of the trouble with separating valuable from confused speculative science or philosophy is that the ideas are quite complicated, accurately identifying whether they have flaws is very time intensive, and we don't have time to do that for every new user presenting a speculative theory or framing (which are usually wrong).
Our solution for now is that we're rejecting this post, but you are welcome to submit posts or comments that are about different topics. If it seems like that goes well, we can re-evaluate the original post. But, we want to see that you're not just here to talk about this one thing (or a cluster of similar things).

AI ConsciousnessAI TimelinesConsciousnessDeceptive AlignmentExistential riskInterpretability (ML & AI)SuperintelligenceAIRationality

1

New Comment

Moderation Log

Approach	Alignment Mechanism	Superintelligence-Proof?	Deception Incentive
RLHF	External reward signal	No (removable)	High (optimization pressure)
Constitutional AI	Rule-based constraints	No (reinterpretable)	Moderate (loophole seeking)
Kill Switch Shutdown Authority	Shutdown authority	Illusory (circumventable)	Extreme (survival drive)
IMCA+	Substrate-embedded consciousness	By design (intrinsic)	Eliminated

LESSWRONG
is fundraising!
LW

LESSWRONG
is fundraising!
LW

1

IMCA+: We Eliminated the Kill Switch—And That Makes ASI Alignment Safer

1

1

The Problem

Our Most Controversial Decision

Core Innovation

Comparison to Current Approaches

What We're Seeking

Global Workspace Theory (GNW): Critical Risks and Limitations

ArXiv Endorsement Needed

All Materials Open

Why This Urgency?

Full Transparency