LESSWRONG"Why Not Just..."
LW

"Why Not Just..."

Aug 08, 2022 by johnswentworth

A compendium of rants about alignment proposals, of varying charitability.

142Deep Learning Systems Are Not Less Interpretable Than Logic/Probability/Etc

2y

54

146Godzilla Strategies

2y

71

90Rant on Problem Factorization for Alignment

2y

51

129Interpretability/Tool-ness/Alignment/Corrigibility are not Composable

2y

12

180How To Go From Interpretability To Alignment: Just Retarget The Search

2y

33

97Oversight Misses 100% of Thoughts The AI Does Not Think

2y

50

80Human Mimicry Mainly Works When We’re Already Close

2y

16

203Worlds Where Iterative Design Fails

2y

30

156Why Not Just... Build Weak AI Tools For AI Alignment Research?

1y

17

126Why Not Just Outsource Alignment Research To An AI?

1y

47

149OpenAI Launches Superalignment Taskforce

10mo

40