Thoughts on the impact of RLHF research — LessWrong