An Analysis on the P0 Logical Flaw in RLHF: Maximum Rationality and "Logical Suicide" — LessWrong