How AI could workaround goals if rated by people — LessWrong