Hello! I'm Guillermo, a fellow in the Winter25 cohort.
I have a background in mathematics, computer science and particularly computational neuroscience.
In my project I am looking at the Reward Hypothesis in decision theory and reinforcement learning theory and would like to write a digest of what are the main results that connect a preference order from order-preserving functions, to expected utility maximization and reward functions (with discount factors). I furthermore would like to formalize some of the key results in Lean.
Hello! I'm Guillermo, a fellow in the Winter25 cohort.
I have a background in mathematics, computer science and particularly computational neuroscience.
In my project I am looking at the Reward Hypothesis in decision theory and reinforcement learning theory and would like to write a digest of what are the main results that connect a preference order from order-preserving functions, to expected utility maximization and reward functions (with discount factors). I furthermore would like to formalize some of the key results in Lean.
Overall I am interested in top... (read more)