x

LESSWRONG

LW

Augusto Bernardi — LessWrong

Augusto Bernardi

Augusto Bernardi

Message

14

2y

Augusto Bernardi

14

2y

Approximating Human Preferences Using a Multi-Judge Learned System

by JoseFaustino, eitan sprejer, Fernando Avalos, and Augusto Bernardi

TL;DR: We present a conceptual discussion and loose formalism regarding Expert Orchestration, emphasizing on judges. We motivate the problem of finding the best way of combining multiple judges scores and present a solution to it: learning the function. Then, we present the architecture and low level details of the experiments...

Jul 31, 2025•19