x
The hard core of alignment (is robustifying RL) — LessWrong