This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
LESSWRONG
Tags
LW
Login
Human Values
•
Applied to
Shard Theory - is it true for humans?
by
Rishika
2d
ago
•
Applied to
The Alignment Problem No One Is Talking About
by
James Stephen Brown
1mo
ago
•
Applied to
How to coordinate despite our biases? - tldr
by
Ryo
2mo
ago
•
Applied to
Please Understand
by
samhealy
3mo
ago
•
Applied to
Antagonistic AI
by
Xybermancer
4mo
ago
•
Applied to
Impossibility of Anthropocentric-Alignment
by
False Name
4mo
ago
•
Applied to
What does davidad want from «boundaries»?
by
Chipmonk
4mo
ago
•
Applied to
Requirements for a Basin of Attraction to Alignment
by
RogerDearnaley
4mo
ago
•
Applied to
Value learning in the absence of ground truth
by
Joel_Saarinen
4mo
ago
•
Applied to
Alignment has a Basin of Attraction: Beyond the Orthogonality Thesis
by
RogerDearnaley
4mo
ago
•
Applied to
Ontological Crisis in Humans
by
Wei Dai
5mo
ago
•
Applied to
Shut Up and Divide?
by
Wei Dai
5mo
ago
•
Applied to
If I ran the zoo
by
Optimization Process
5mo
ago
•
Applied to
Trading off Lives
by
Gunnar_Zarncke
5mo
ago
•
Applied to
Safety First: safety before full alignment. The deontic sufficiency hypothesis.
by
Chipmonk
5mo
ago
•
Applied to
Agent membranes/boundaries and formalizing “safety”
by
Chipmonk
5mo
ago