LESSWRONGTags
LW

Sharp Left Turn

EditHistorySubscribe
Discussion (0)
Help improve this page
EditHistorySubscribe
Discussion (0)
Help improve this page
Sharp Left Turn
Random Tag
Contributors
1Multicore

A Sharp Left Turn is a scenario where, as an AI trains, its capabilities generalize across many domains while the alignment properties that held at earlier stages fail to generalize to the new domains.

See also: Threat Models, AI Takeoff, AI Risk

Posts tagged Sharp Left Turn
Most Relevant
4
255A central AI alignment problem: capabilities generalization, and the sharp left turnΩ
So8res
8mo
Ω
48
3
71Refining the Sharp Left Turn threat model, part 1: claims and mechanismsΩ
Vika, Vikrant Varma, Ramana Kumar, Mary Phuong
6mo
Ω
3
3
50We may be able to see sharp left turns comingΩ
Ethan Perez, Neel Nanda
5mo
Ω
27
2
48Reframing inner alignmentΩ
davidad
2mo
Ω
13
2
38Refining the Sharp Left Turn threat model, part 2: applying alignment techniquesΩ
Vika, Vikrant Varma, Ramana Kumar, Rohin Shah
2mo
Ω
5
2
37Victoria Krakovna on AGI Ruin, The Sharp Left Turn and Paradigms of AI Alignment
Michaël Trazzi
23d
3
2
13How is the "sharp left turn defined"?Q
Chris_Leong
2mo
Q
4
1
49Smoke without fire is scaryΩ
Adam Jermyn
4mo
Ω
22
1
46Goal Alignment Is Robust To the Sharp Left Turn
Thane Ruthenis
7mo
15
1
37A caveat to the Orthogonality Thesis
Wuschel Schulz
3mo
10
1
35It matters when the first sharp left turn happensΩ
Adam Jermyn
4mo
Ω
9
1
14Disentangling inner alignment failuresΩ
Erik Jenner
4mo
Ω
5
Add Posts