Humans are a product of evolution, so it makes sense to have various trackers of "things that can hurt us" (such as hunger, low social status, etc.), where each gives a simple advice, but sometimes the different pieces of advice contradict (you are really hungry, but in a situation where admitting it would lower your status).

Computers follow an algorithm. If the algorithm is "for each possible token, calculate the probability of its appearing in a text, then write the token with the greatest probability", there is not much of a potential for internal conflict.

Reply

[-]the gears to ascension3y20

Sure, but it only takes one hyperdesperate squigglewanter. Perhaps a non-desperate orangewanter will take a pill to not want oranges, but do you really think a hyperdesperate squigglewanter is going to care that an orangewanter took a pill?

Reply

Moderation Log

Curated and popular this week

LESSWRONG
LW

LESSWRONG
LW

-11

Random Observation on AI goals

-11

-11