LESSWRONG
LW

273
bfitzgerald3132
4100
Message
Dialogue
Subscribe

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by
Newest
No Comments Found
No wikitag contributions to display.
5AI models inherently alter "human values." So, alignment-based AI safety approaches must better account for value drift
9mo
2