In my project I am looking at the Reward Hypothesis in decision theory and reinforcement learning theory and would like to write a digest of what are the main results that connect a preference order from order-preserving functions, to expected utility maximization and reward functions (with discount factors). I furthermore would like to formalize some of the key results in Lean.

Overall I am interested in topics that connect rationality and decision theory all the way to practical aspects of machine learning and reinforcement learning, to try to bridge these topics for AI Safety.

Nice to meet you all!

5

0

LESSWRONG
LW

LESSWRONG
LW

GuilleMarSan

GuilleMarSan

GuilleMarSan

GuilleMarSan

GuilleMarSan

GuilleMarSan