Online Learning 2: Bandit learning with catastrophes — LessWrong