Why Ethics Can’t Be Compiled Away
On Moral Residue in Alignment Systems This post is a diagnostic argument about alignment failures, not a proposal for a solution. Introduction In alignment work, it is common to treat ethics as something that can be settled in advance: encoded into objectives, constraints, constitutions, oversight procedures, or proofs of safety....
Jan 121