531

LESSWRONG
LW

530
Agent FoundationsDistillation & PedagogyAI
Frontpage

103

Learning-theoretic agenda reading list

by Vanessa Kosoy
9th Nov 2023
AI Alignment Forum
2 min read
1

103

Ω 37

103

Ω 37

Learning-theoretic agenda reading list
7Vanessa Kosoy
New Comment
1 comment, sorted by
top scoring
Click to highlight new comments since: Today at 6:56 AM
[-]Vanessa Kosoy9moΩ470Review for 2023 Review

This is just a self-study list for people who want to understand and/or contribute to the learning-theoretic AI alignment research agenda. I'm not sure why people thought it deserves to be in the Review. FWIW, I keep using it with my MATS scholars, and I keep it more or less up-to-date. A complementary resource that became available more recently is the video lectures.

Reply
Moderation Log
More from Vanessa Kosoy
View more
Curated and popular this week
1Comments
Agent FoundationsDistillation & PedagogyAI
Frontpage

Recently, I'm receiving more and more requests for a self-study reading list for people interested in the learning-theoretic agenda. I created a standard list for that, but before now I limited myself to sending it to individual people in private, out of some sense of perfectionism: many of the entries on the list might not be the best sources for the topics and I haven't read all of them cover to cover myself. But, at this point it seems like it's better to publish a flawed list than wait for perfection that will never come. Also, commenters are encouraged to recommend alternative sources that they consider better, if they know any. So, without further adieu:

General math background

  • Theoretical computer science
    • "Computational Complexity: A Conceptual Perspective" by Goldreich (especially chapters 1, 2, 5, 10)
    • “Lambda-Calculus and Combinators: An Introduction” by Hindley
    • “Tree Automata Techniques and Applications” by Comon et al (mostly chapter 1)
  • "Introductory Functional Analysis with Applications" by Kreyszig (especially chapters 1, 2, 3, 4)
  • "Probability: Theory and Examples" by Durret (especially chapters 4, 5, 6)
  • "Elements of Information Theory" by Cover and Thomas (especially chapter 2)
  • “Game Theory, Alive” by Karlin and Peres
  • “Categories for the Working Mathematician” by Mac Lane (especially parts I, III, IV and VI)

AI theory

  • “Handbook of Markov Decision Processes” edited by Feinberg and Shwartz (especially chapters 1-3)
  • “Aritifical Intelligence: A Modern Approach” by Russel and Norvig (especially chapter 17)
  • "Machine Learning: From Theory to Algorithms" by Shalev-Shwarz and Ben-David (especially part I and chapter 21)
  • "An Introduction to Computational Learning Theory" by Kearns and Vazirani (especially chapter 8)
  • "Bandit Algorithms" by Lattimore and Szepesvari (especially parts II, III, V, VIII)
    • Alternative/complementary: "Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems" by Bubeck and Cesa-Bianchi (especially sections 1, 2, 5)
  • “Prediction, Learning and Games” by Cesa-Bianchi and Lugosi (mostly chapter 7)
  • "Universal Artificial Intelligence" by Hutter
    • Alternative: "A Theory of Universal Artificial Intelligence based on Algorithmic Complexity” (Hutter 2000)
    • Bonus: “Nonparametric General Reinforcement Learning” by Jan Leike
  • Reinforcement learning theory
    • Video and slides: "Introduction to Reinforcement Learning Theory"
    • "Near-optimal Regret Bounds for Reinforcement Learning" (Jaksch, Ortner and Auer, 2010)
    • "Reinforcement Learning in POMDPs Without Resets" (Even-Dar, Kakade, Mansour, 2005)
    • "Efficient Bias-Span-Constrained Exploration-Exploitation in Reinforcement Learning" (Fruit et al, 2018)
    • "Regret Bounds for Learning State Representations in Reinforcement Learning" (Ortner et al, 2019)
    • “Efficient PAC Reinforcement Learning in Regular Decision Processes” (Ronca and De Giacomo, 2022)
    • “Tight Guarantees for Interactive Decision Making with the Decision-Estimation Coefficient” (Foster, Golowich and Han, 2023)

Agent foundations

  • "Functional Decision Theory" (Yudkowsky and Soares 2017)
  • "Embedded Agency" (Demski and Garrabrant 2019)
  • Learning-theoretic AI alignment research agenda
    • Overview
    • Infra-Bayesian reinforcement learning
      • Infra-Bayesianism sequence
      • Linear infra-Bayesian bandits
      • Infra-Bayesian decision-estimation theory
      • Infra-Bayesian haggling
      • Bonus: podcast
    • Infra-Bayesian physicalism
      • Original post
      • Non-monotonic
      • Interpreting quantum mechanics
      • Bonus: podcast
    • Video: "Towards a Theory of Metacognitive Agents"
    • Reinforcement learning with imperceptible rewards
    • String machines

Bonus materials

  • “Logical Induction” (Garrabrant et al, 2016)
  • “Forecasting Using Incomplete Models” (Kosoy 2017)
  • “Cartesian Frames” (Garrabrant, Herrman and Lopez-Wild, 2021)
  • “Optimal Polynomial-Time Estimators” (Kosoy and Appel, 2016)
  • “Algebraic Geometry and Statistical Learning Theory” by Watanabe
Review by
Vanessa Kosoy
Mentioned in
86Voting Results for the 2023 Review
75Video lectures on the learning-theoretic agenda
55Toward Safety Case Inspired Basic Research
54[Closed] Gauging Interest for a Learning-Theoretic Agenda Mentorship Programme
37Interview with Vanessa Kosoy on the Value of Theoretical Research for AI