LESSWRONG
is fundraising!
LW

I strongly suspect that you cannot, with a feedback loop as you describe. If you measure discontent based on social media, suffering that does not get posted to social media effectively does not exist. The AI would need a way of somehow recognizing that the social media are only its window to the discontent that exists in the world beyond, which is what it is intended to minimize. Proverbially, it would need to be able to look at the moon rather than the finger pointing to it.

[-]Kazi Siddiqui8y10

Thanks. As far as I can tell, this is symptomatic of a general class of problems with trying to gauge what humans want. If you want to maximize fun, you need a model of what leads to fun. You must somehow gather this data from humans. The only foolproof way to do this is to recreate the human mind in the computer, which brings its own problems. Is there a way out I'm not seeing?

Moderation Log