Compendium of problems with RLHF — LessWrong