x
Compositional preference models for aligning LMs — LessWrong