x
Problem: safe AI from episodic RL — LessWrong