A few Alignment questions: utility optimizers, SLT, sharp left turn and identifiability
I am trying to find some Alignment direction that could be interesting for me to learn and work on, I've already received a couple of answers but I want more opinions and I just get more motivation from the discussions/cooperation. 1. As I understand that almost any agent can be...