LESSWRONGTags
LW

Reward Functions

EditHistorySubscribe
Discussion (0)
Help improve this page (2 flags)
EditHistorySubscribe
Discussion (0)
Help improve this page (2 flags)
Reward Functions
Random Tag
Contributors
Posts tagged Reward Functions
Most Relevant
5
47Draft papers for REALab and Decoupled Approval on tamperingΩ
Jonathan Uesato, Ramana Kumar
2y
Ω
2
2
20$100/$50 rewards for good referencesΩ
Stuart_Armstrong
7mo
Ω
5
2
13Why we want unbiased learning processes
Stuart_Armstrong
4y
3
1
30Thoughts on reward engineering Ω
paulfchristiano
3y
Ω
30
1
26The reward engineering problem Ω
paulfchristiano
3y
Ω
3
1
25Reward model hacking as a challenge for reward learning
ejenner
2mo
1
1
16Reward functions and updating assumptions can hide a multitude of sinsΩ
Stuart_Armstrong
2y
Ω
2
1
11Probabilities, weights, sums: pretty much the same for reward functionsΩ
Stuart_Armstrong
2y
Ω
1
1
10Utility versus Reward function: partial equivalence
Stuart_Armstrong
4y
5
1
9Reward function learning: the value function
Stuart_Armstrong
4y
0
1
7Intuitive examples of reward function learning?
Stuart_Armstrong
4y
3
1
6Reward function learning: the learning process
Stuart_Armstrong
4y
11
Add Posts