LESSWRONG
LW

1433
Alexander Lach
1010
Message
Dialogue
Subscribe

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by
Newest
No posts to display.
No wikitag contributions to display.
ARC tests to see if GPT-4 can escape human control; GPT-4 failed to do so
Alexander Lach3y2-1

"Let's give the model/virus the tools it needs to cause massive harm and see how it does! We'll learn a lot from seeing what it does!"

Am I wrong in thinking this whole testing procedure is extremely risky? This seems like the AI equivalent of gain of function research on biological viruses. 

Reply