I'm writing a post about S-risks, and I need access to some clean, established terminology/background material for discussing AI-based long-term outcomes for humanity.
My current (very limited) vocabulary can be summarized with the following categories:
- Outcomes which are roughly maximally bad: Hyperexistential risk/S-risk/Unfriendly AI/Existential risk
- Outcomes which are nontrivially worse than paperclipping-equivalents but better than approximate minimization of human utility: Hyperexistential risk/S-risk/Unfriendly AI/Existential risk
- Outcomes which are produced by agents essentially orthogonal to human values: Paperclipping/Unfriendly AI/Existential risk
- Outcomes which are nontrivially better than paperclipping but worse than Friendly AI: ???
- Outcomes which are roughly maximally good: Friendly AI
The problems are manifold:
- I haven't read any discussion which specifically addresses parts 1 or 2. I have read general discussion of parts 1 and 2 combined under the names of "Outcomes worse than death", "Hyperexistential risk", "S-risk", etc.
- My current terminology overlaps too strongly to use to uniquely identify outcomes 1 and 2.
- I have no terminology or background information for outcome 4.
I've done a small amount of investigation and determined less brainpower would be wasted by just asking for links.