Can we get an AI to "do our alignment homework for us"? — LessWrong