To be legible, evidence of misalignment probably has to be behavioral — LessWrong