Emad — LessWrong

Why could current AI alignment strategies fail against unchained AIs?

Currently, AI alignment is more focused on a multi-layer safety strategy to refuse risky requests from users. This approach works well for LLMs and today’s chatbots. However, this ignores a more dangerous threat: The possibility that someone, at some point, runs an unchained AI without these limits. By unchained AI,...

Dec 2, 20251

Potentialism: A Proposal for Redefining Ethics in the Modern Age of AI

I’ve been working as an independent thinker on a framework for rethinking ethics, with a focus on psychology and how people relate to their own traits. I call this framework Potentialism. More recently, I realized that the same framework might also be useful for AI ethics – not as a...

Nov 22, 20251

Potentialism: An Operating System for Ethics in the Age of AI

Traditional moral frameworks are rigid and fragile. If we want both to align advanced AI and to stop fighting our own minds, we may need to shift from “fixed virtues” to “regulated potentials”. This text is a proposal and an invitation to critique. We’re standing at a crossroads. On one...

Nov 22, 20251