LESSWRONG
LW

1532
Alex.fga
0010
Message
Dialogue
Subscribe

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by
Newest
No posts to display.
No wikitag contributions to display.
Jailbreaking ChatGPT on Release Day
Alex.fga3y10

My experience has been that often all it takes to 'jailbreak' it, is to press the try again button. I think a lot of these examples people are trying are over engineered and it actually doesn't take much at all in most cases.

Reply