Agent membranes/boundaries and formalizing “safety” — LessWrong