I think this is a really interesting post. It’s interesting to see an outline on the general relationships between self-reporting and sentience.

The idea that "Training an LLM to develop a model of its internal operations which enables it answer non-trivial questions about its mental states" could be a straightforward way to optimize models for Sentience - I think that’s very thought-provoking. 

  • I'm generally curious about the nature of the unique identities of these hypothetically sentient models, as well as how those identities would develop. What exa
