Least-problematic Resource for learning RL? — LessWrong