This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
LESSWRONG
Tags
LW
Login
Reward Functions
Edit
History
Subscribe
Discussion
(0)
Help improve this page (2 flags)
Edit
History
Subscribe
Discussion
(0)
Help improve this page (2 flags)
Reward Functions
Random Tag
Contributors
Posts tagged
Reward Functions
Most Relevant
5
47
Draft papers for REALab and Decoupled Approval on tampering
Ω
Jonathan Uesato
,
Ramana Kumar
2y
Ω
2
2
20
$100/$50 rewards for good references
Ω
Stuart_Armstrong
7mo
Ω
5
2
13
Why we want unbiased learning processes
Stuart_Armstrong
4y
3
1
30
Thoughts on reward engineering
Ω
paulfchristiano
3y
Ω
30
1
26
The reward engineering problem
Ω
paulfchristiano
3y
Ω
3
1
25
Reward model hacking as a challenge for reward learning
ejenner
2mo
1
1
16
Reward functions and updating assumptions can hide a multitude of sins
Ω
Stuart_Armstrong
2y
Ω
2
1
11
Probabilities, weights, sums: pretty much the same for reward functions
Ω
Stuart_Armstrong
2y
Ω
1
1
10
Utility versus Reward function: partial equivalence
Stuart_Armstrong
4y
5
1
9
Reward function learning: the value function
Stuart_Armstrong
4y
0
1
7
Intuitive examples of reward function learning?
Stuart_Armstrong
4y
3
1
6
Reward function learning: the learning process
Stuart_Armstrong
4y
11