LESSWRONGTags
LW

AI Success Models

EditHistorySubscribe

Help improve this page

EditHistorySubscribe

Help improve this page

AI Success Models

Contributors

AI Success Models are proposed paths to an existential win via aligned AI. They are (so far) high level overviews and won't contain all the details, but present at least a sketch of what a full solution might look like. They can be contrasted with threat models, which are stories about how AI might lead to major problems.

Posts tagged AI Success Models

8

63Solving the whole AGI control problem, version 0.0001

3y

7

5

211An overview of 11 proposals for building safe advanced AI

4y

36

5

80A positive case for how we might succeed at prosaic AI alignment

3y

46

4

120Conversation with Eliezer: What do you want the system to do?

2y

38

4

53Interpretability’s Alignment-Solving Potential: Analysis of 7 Scenarios

2y

0

3

56a narrative explanation of the QACI alignment plan

1y

29

3

8Any further work on AI Safety Success Stories?

2y

6

2

124AI Safety "Success Stories"

5y

27

2

112Four visions of Transformative AI success

6mo

22

2

84Various Alignment Strategies (and how likely they are to work)

2y

34

2

79An Open Agency Architecture for Safe Transformative AI

2y

22

2

75Success without dignity: a nearcasting story of avoiding catastrophe by luck

HoldenKarnofsky

1y

10

2

59Conditioning Generative Models for Alignment

2y

8

2

53formal alignment: what it is, and some proposals

1y

3

2

46Gradient Descent on the Human Brain

Jozdien, gaspode

4mo

4