Research Areas in Evaluation and Guarantees in Reinforcement Learning (The Alignment Project by UK AISI) — LessWrong