x
Token Statistics Fail at AI Attack Detection But Generation Profiles Succeed — LessWrong