Computational complexity of RL with traps — LessWrong