Reinforcement Learning in the Iterated Amplification Framework — LessWrong