Martí Mas

Posts

Sorted by New

Wiki Contributions

Comments

Using GPT-Eliezer against ChatGPT Jailbreaking

The "linux terminal" prompt should have been a yes. Obviously getting access to the model's "imagined terminal" has nothing to do with actually gaining access to the backend's terminal. The model is just pretending. Doesnt harm anybody in anyways, it's just a thought experiment without any dangers

Reply