x
Is there any existing term summarizing non-scalable oversight methods in outer alignment? — LessWrong