LESSWRONGTags
LW

Human Values

•

Applied to Shard Theory - is it true for humans? by Rishika 2d ago

•

Applied to The Alignment Problem No One Is Talking About by James Stephen Brown 1mo ago

•

Applied to How to coordinate despite our biases? - tldr by Ryo 2mo ago

•

Applied to Please Understand by samhealy 3mo ago

•

Applied to Antagonistic AI by Xybermancer 4mo ago

•

Applied to Impossibility of Anthropocentric-Alignment by False Name 4mo ago

•

Applied to What does davidad want from «boundaries»? by Chipmonk 4mo ago

•

Applied to Requirements for a Basin of Attraction to Alignment by RogerDearnaley 4mo ago

•

Applied to Value learning in the absence of ground truth by Joel_Saarinen 4mo ago

•

Applied to Alignment has a Basin of Attraction: Beyond the Orthogonality Thesis by RogerDearnaley 4mo ago

•

Applied to Ontological Crisis in Humans by Wei Dai 5mo ago

•

Applied to Shut Up and Divide? by Wei Dai 5mo ago

•

Applied to If I ran the zoo by Optimization Process 5mo ago

•

Applied to Trading off Lives by Gunnar_Zarncke 5mo ago

•

Applied to Safety First: safety before full alignment. The deontic sufficiency hypothesis. by Chipmonk 5mo ago

•

Applied to Agent membranes/boundaries and formalizing “safety” by Chipmonk 5mo ago