Tensor Trust: An online game to uncover prompt injection vulnerabilities
by Luke Bailey and qxcv
TL;DR: Play this online game to help CHAI researchers create a dataset of prompt injection vulnerabilities. RLHF and instruction tuning have succeeded at making LLMs practically useful, but in some ways they are a mask that hides the shoggoth beneath. Every time a new LLM is released, we see just...
Sep 1, 202330