Take 13: RLHF bad, conditioning good. — LessWrong