LESSWRONG
LW

328
Alex.fga
0010
Message
Dialogue
Subscribe

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by
Newest
Jailbreaking ChatGPT on Release Day
Alex.fga3y10

My experience has been that often all it takes to 'jailbreak' it, is to press the try again button. I think a lot of these examples people are trying are over engineered and it actually doesn't take much at all in most cases.

Reply
No wikitag contributions to display.
No posts to display.