Compositional preference models for aligning LMs — LessWrong