x
Half-baked alignment idea: training to generalize — LessWrong