LESSWRONGTags
LW

AI-assisted/AI automated Alignment

EditHistorySubscribe
Discussion (0)
Help improve this page (1 flag)
EditHistorySubscribe
Discussion (0)
Help improve this page (1 flag)
AI-assisted/AI automated Alignment
Random Tag
Contributors
6Ruby

Not obviously the best name for this tag, but maybe good to explore/rename. Wiki-tags are publicly editable!

Posts tagged AI-assisted/AI automated Alignment
Most Relevant
3
296CyborgismΩ
NicholasKees, janus
1mo
Ω
41
3
98Beliefs and Disagreements about Automating Alignment ResearchΩ
Ian McKenzie
7mo
Ω
4
2
138Godzilla StrategiesΩ
johnswentworth
10mo
Ω
64
2
90We have to Upgrade
Jed McCaleb
5d
30
2
81Cyborg Periods: There will be multiple AI transitionsΩ
Jan_Kulveit, rosehadshar
1mo
Ω
8
2
54My thoughts on OpenAI's alignment plan
Akash
3mo
2
2
52Reflections on Deception & Generality in Scalable Oversight (Another OpenAI Alignment Review)
Shoshannah Tekofsky
2mo
6
2
45What specific thing would you do with AI Alignment Research Assistant GPT?Q
quetzal_rainbow, janus
3mo
Q
9
2
44A survey of tool use and workflows in alignment researchΩ
Logan Riggs, Jan, janus, jacquesthibs
1y
Ω
5
2
27[Linkpost] Jan Leike on three kinds of alignment taxes
Akash
3mo
2
2
20Model-driven feedback could amplify alignment failuresΩ
aogara
2mo
Ω
1
2
16Discussion on utilizing AI for alignment
elifland
7mo
3
2
14Making it harder for an AGI to "trick" us, with STVsΩ
Tor Økland Barstad
9mo
Ω
5
2
11Getting from an unaligned AGI to an aligned AGI? Ω
Tor Økland Barstad
9mo
Ω
7
2
9Eli Lifland on Navigating the AI Alignment Landscape
ozziegooen
2mo
1
Load More (15/35)
Add Posts