Human Values

Edited by plex last updated 16th Sep 2021

Human Values are the things we care about, and would want an aligned superintelligence to look after and support. It is suspected that true human values are highly complex, and could be extrapolated into a wide variety of forms.

Posts tagged Human Values

16

262The shard theory of human values

Ω

Quintin Pope, TurnTrout

4y

Ω

67

7

95Human values & biases are inaccessible to the genome

Ω

TurnTrout

4y

Ω

54

7

63Multi-agent predictive minds and AI alignment

Ω

Jan_Kulveit

8y

Ω

18

7

27Constitutional AI Alignment

RogerDearnaley

1mo

9

6

41Grounding Value Learning in Evolutionary Psychology: an Alternative Proposal to CEV

Ω

RogerDearnaley

6mo

Ω

25

6

27What’s Your P(WEIRD)?

Q

RogerDearnaley

4mo

Q

18

5

53What AI Safety Researchers Have Written About the Nature of Human Values

avturchin

7y

3

5

48Requirements for a Basin of Attraction to Alignment

Ω

RogerDearnaley

2y

Ω

12

5

365. Moral Value for Sentient Animals? Alas, Not Yet

Ω

RogerDearnaley

2y

Ω

41

5

3y

5

21Ends: An Introduction

Rob Bensinger

11y

0

5

20Alignment has a Basin of Attraction: Beyond the Orthogonality Thesis

RogerDearnaley

2y

15

5

166. The Mutable Values Problem in Value Learning and CEV

Ω

RogerDearnaley

3y

Ω

0

5

5Utilitarianism and the replaceability of desires and attachments

MichaelStJules

2y

2

4

92[Valence series] 2. Valence & Normativity

Steven Byrnes

3y

9