x
3 Challenges and 2 Hopes for the Safety of Unsupervised Elicitation — LessWrong