Avoiding Side Effects in Complex Environments
by TurnTrout and nealeratzlaff
Previously: Attainable Utility Preservation: Empirical Results; summarized in AN #105 Our most recent AUP paper was accepted to NeurIPS 2020 as a spotlight presentation: > Reward function specification can be difficult, even in simple environments. Rewarding the agent for making a widget may be easy, but penalizing the multitude of...
Dec 12, 202062