x
How to train any multiagent systems end-to-end from AI feedback — LessWrong