x
Weak Evidence of Value Convergence Without RLHF — LessWrong