x
Published Safety Prompts May Create Evaluation Blind Spots — LessWrong