x
AI #23: Fundamental Problems with RLHF — LessWrong