Scott Guerin
1
2
First time post:
Thank you for this tremendous summary of risks and their historical analogues. I also see that you've written about SETI/METI and I look forward to having the time to read through that post.
What model did you use to assist your effort?
There must be a term of art in the Less Wrong community for the disturbing paradox, the catch-22, or what I think of as the "crossover effect" for using AI to write about its potential for catastrophe. Are we just giving it ideas? Is a Dark Forest solution needed akin to the Wallfacers?
I felt it recently...
As a layperson, "Syncroil" is the most amazing AI generated concept to have read in this fascinating paper or anywhere.
As for the attractor states, I imagine two multi-armed pendulums swinging chaotically through their physical states. Grok might have more arms hence more chaos. The pendulum's chaos within a prescribed space is a metaphor for the current limits of human knowledge that the models are trained on. Thus, they eventually settle into stillness, silence, or some kind of OCD behavior.
Hard not to be anthropomorphic about these results.