Take 10: Fine-tuning with RLHF is aesthetically unsatisfying. — LessWrong