Self-fulfilling misalignment data might be poisoning our AI models — LessWrong