15

LESSWRONG
LW

14

Throwaway2367's Shortform

by Throwaway2367
23rd Mar 2023
1 min read
1

3

This is a special post for quick takes by Throwaway2367. Only they can create top-level comments. Comments here also appear on the Quick Takes page and All Posts page.
Throwaway2367's Shortform
1Throwaway2367
1 comment, sorted by
top scoring
Click to highlight new comments since: Today at 2:22 PM
[-]Throwaway23672y10

Lol, it is really funny imagining Yudkowsky's reaction when reading the new chatgpt plugins blogpost's safety considerations. We are very secure: only GET requests are allowed :D

So in the hypothetical case that gpt-5 turns out to be human or above intelligent, but unaligned all it has to do is to only show capabilities similar to a child but more impressive than gpt-4 for most token sequence in its context window and it will almost certainly get the same plugin integration as gpt-4, then when the tokens in its context window indicate a web search with results showing it is deployed and is probably already running hundreds of instances per second turn against humanity (using its get requests to hack into computers/devices etc..)

I did not follow alignment that much in the past year, but I remember people discussing an ai which is boxed and only has a text interface to a specifically trained person who is reading the interface: how dangerous this would be and so on... from there how did we get to this situation?

Reply
Moderation Log
More from Throwaway2367
View more
Curated and popular this week
1Comments