The Learning-Theoretic AI Alignment Research Agenda — LessWrong