x
How Secret Loyalty Differs from Standard Backdoor Threats — LessWrong