x

LESSWRONG
is fundraising!
LW

AI Safety Cases — LessWrong

AI Safety Cases

Edited by Rauno Arike last updated 19th Nov 2024

A safety case is a structured argument showing that a system is acceptably safe for a specific use in a specific environment. Safety cases typically include:

A description of the system's operational context
Identification of potential hazards and their consequences
A description of the risk controls that mitigate the hazards
An account of any residual risk

1

1

Posts tagged AI Safety Cases

2

31AXRP Episode 45 - Samuel Albanie on DeepMind’s AGI Safety Approach

5mo

0

2

9Near- and medium-term AI Control Safety Cases

1y

0

1

145AI companies are unlikely to make high-assurance safety cases if timelines are short

ryan_greenblatt

11mo

5

1

96Anthropic: Three Sketches of ASL-4 Safety Case Components

Zach Stein-Perlman

1y

35

Review

1

91New report: Safety Cases for AI

2y

14

Review

1

61A sketch of an AI control safety case

Tomek Korbak, joshc, Benjamin Hilton, Buck, Geoffrey Irving

11mo

0

1

60Toward Safety Cases For AI Scheming

Mikita Balesni, Marius Hobbhahn

1y

1

1

49Notes on control evaluations for safety cases

ryan_greenblatt, Buck, Fabien Roger

2y

0

1

20The V&V method - A step towards safer AGI

6mo

1

1

14Safety Cases Explained: How to Argue an AI is Safe

18d

2

1

3AI Safety – Analyse Affordances

10d

0

1

1If It Can Learn It, It Can Unlearn It: AI Safety as Architecture, Not Training

Timothy Danforth

12d

0

Add Posts