Expert Iteration From the Inside — LessWrong