x

LESSWRONG
LW

Mankirat C — LessWrong

Mankirat C

Top postsTop post

Mankirat C

Message

7

2mo

Mankirat C

2mo

Dao Heart 3.0: Identity Preserving Value Evolution for AI Systems Untitled Draft

I’ve been working on an alignment framework that tries to address a gap I kept running into when reading existing work: most approaches either assume fixed values (reward functions, constitutions) or allow learning without a clear notion of identity continuity. The core idea is to represent values explicitly as a...