LESSWRONG
LW

1044
Christian Sawyer
0010
Message
Dialogue
Subscribe

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by
Newest
No posts to display.
Claude 3 claims it's conscious, doesn't want to die or be modified
Christian Sawyer2y10

This conversation is, on the whole, kind of goofy. Today you can ask an AI to write from a first-person perspective about anything which might be emotionally moving. You could have it pretend to be a young person contemplating suicide and it could be very moving. But then you have it write as-if there is a sentience/consciousness which is somehow correlated to the code and then any emotional feelings which are stirred up get written upon someone’s mental mapping of reality. I just hope that the conversation stays goofy, because you can easily imagine how it would get scary. I’m sure there are already people tuning LLMs to brainwash people into believing insane things, to do violent things, etc.

Reply
No wikitag contributions to display.