x
This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
LESSWRONG
LW
Login
Alignment Tax — LessWrong
You are viewing version 1.0.0 of this page. Click here to view the latest version.
Alignment Tax
You are viewing revision 1.0.0, last edited by
markov
This page is a stub.
Subscribe
Discussion
1
Subscribe
Discussion
1
Posts tagged
Alignment Tax
Most Relevant
3
79
The case for a negative alignment tax
Cameron Berg
,
Judd Rosenblatt
,
Diogo de Lucena
,
Trent Hodgeson
1y
20
2
68
Alignment can be the ‘clean energy’ of AI
Cameron Berg
,
Judd Rosenblatt
,
Trent Hodgeson
10mo
8
2
59
Against ubiquitous alignment taxes
beren
3y
10
2
58
Safety-capabilities tradeoff dials are inevitable in AGI
Ω
Steven Byrnes
4y
Ω
4
2
50
The case for removing alignment and ML research from the training dataset
beren
3y
8
2
44
How difficult is AI Alignment?
Ω
Sammy Martin
1y
Ω
6
2
31
Safety tax functions
owencb
1y
0
2
27
[Linkpost] Jan Leike on three kinds of alignment taxes
Orpheus16
3y
2
2
22
AI safety tax dynamics
owencb
1y
0
1
142
Ten Levels of AI Alignment Difficulty
Ω
Sammy Martin
2y
Ω
24
1
106
Security Mindset and the Logistic Success Curve
Eliezer Yudkowsky
8y
49
1
28
A/B testing could lead LLMs to retain users instead of helping them
Daniel Paleka
1mo
0
1
18
On the Importance of Open Sourcing Reward Models
elandgre
3y
5
1
7
Labor Participation is a High-Priority AI Alignment Risk
alex
1y
0
1
5
The commercial incentive to intentionally train AI to deceive us
Derek M. Jones
3y
1