x
Weak-To-Strong Generalization — LessWrong