Is behavioral safety "solved" in non-adversarial conditions? — LessWrong