Backdoor awareness and misaligned personas in reasoning models — LessWrong