The main criticism I imagined of this piece was the disassembly risk.

By adding a latter step to this outer alignment approach (transforming a committee into a modified learning problem), I believe I have a much stronger angle on outer alignment.
I will continue to write about it here:
https://github.com/Alexhb61/Alignment/blob/main/committees/outer_alignment.md

Reply

Moderation Log

LESSWRONG
LW

LESSWRONG
LW

2

Control by Committee

2

2