Threat Models (AI)

Edited by Quinn, Jacob Pfau, et al. last updated 12th Apr 2023

A threat model is a story of how a particular risk (e.g. AI) plays out.

In the AI risk case, according to Rohin Shah, a threat model is ideally:

Combination of a development model that says how we get AGI and a risk model that says how AGI leads to existential catastrophe.

See also AI Risk Concrete Stories

Posts tagged Threat Models (AI)

12

254Another (outer) alignment failure story

Ω

paulfchristiano

5y

Ω

39

11

464What failure looks like

Ω

paulfchristiano

7y

Ω

56

9

79Distinguishing AI takeover scenarios

Ω

Sam Clarke, Sammy Martin

5y

Ω

11

8

293What Multipolar Failure Looks Like, and Robust Agent-Agnostic Processes (RAAPs)

Ω

Andrew_Critch

5y

Ω

65

8

48Vignettes Workshop (AI Impacts)

Ω

Daniel Kokotajlo

5y

Ω

6

7

979AGI Ruin: A List of Lethalities

Ω

Eliezer Yudkowsky

4y

Ω

715

6

322On how various plans miss the hard bits of the alignment challenge

Ω

So8res

4y

Ω

91

6

114Less Realistic Tales of Doom

Ω

Mark Xu

5y

Ω

13

6

70Survey on AI existential risk scenarios

Ω

Sam Clarke, apc, Jonas Schuett

5y

Ω

11

6

32Investigating AI Takeover Scenarios

Ω

Sammy Martin

5y

Ω

1

4

375Without specific countermeasures, the easiest path to transformative AI likely leads to AI takeover

Ω

Ajeya Cotra

4y

Ω

95

4

132Current AIs Provide Nearly No Data Relevant to AGI Alignment

Ω

Thane Ruthenis

2y

Ω

157

4

71Rogue AGI Embodies Valuable Intellectual Property

Ω

Mark Xu, CarlShulman

5y

Ω

9

3

170Will the growing deer prion epidemic spread to humans? Why not?

eukaryote

3y

33

3

170AI Could Defeat All Of Us Combined

HoldenKarnofsky

4y

42