Entropic Regret I: Deterministic MDPs — LessWrong