Notes on Internal Objectives in Toy Models of Agents — LessWrong