Control by Committee
I think I have an interesting new research direction: Aligning Committees. Make ensembles of agents and have them act together as only one agent in the world with some protocol for how they combine preferences. Main Motivating Construction: Given a target consequence T, we construct a committee out of 3...
Nov 6, 20252