Applying utility functions to humans considered harmful

[-]Qiaochu_Yuan13y130

When I read the beginning of this post I asked myself, "if people don't have utility functions, why haven't LWers gotten rich by constructing Dutch books against people?"

I answered myself, "in practice, most people will probably ignore clever-looking bets because they'll suspect that they're being tricked. One way to avoid Dutch books is to avoid bets in general."

[-]mattnewport16y90

A model is not terribly useful if it does not do a better job of prediction than alternative models. (Micro)economics does quite a good job of predicting human behaviour based on a very simple model of predictable rationality. It is not clear to me that this model offers a better approach to making meaningful predictions about real world human behaviour. I've only skimmed the article but it appears the tests are limited to rather artificial lab tests. That's better than nothing but I'm skeptical that this model's real world predictive power justifies its c... (read more)

1Kaj_Sotala16y

Yes, I admit that it can sometimes be useful to think of humans as having utility functions, and this can be a useful model. I should have said that in the post, now that you mention it. But then one should then always keep in mind that that's just a simplified model that's appropriate for certain situations, not something that can be indiscriminately employed in every case.

1bgrah44916y

I think it's useful inasmuch as it turns "unknown unknowns" into "known unknowns." Knowing what you're ignoring in your approximation seems valuable.

1mattnewport16y

I think they are claiming that their model more closely matches observed behaviour in certain specific controlled environments. It is a big leap from there to assume that the features of the model map in any useful way to actual features of human reasoning.

-6rortian16y

[-]Eliezer Yudkowsky16y80

Research like this seems very hopeful to me. It breaks down into a nice component describing what people actually want and a lot of other components describing shifts of attention and noise. If anything, that seems too optimistic compared to, say, prospect theory, in which the basic units of motivation are shifts from a baseline and there's no objective baseline or obvious way to translate shift valuations into fixed-level valuations.

1mattnewport16y

I'm a little surprised you haven't commented on the randomization aspects of this model. As you've convincingly argued, if your intention is accurate prediction then you can't improve your results by introducing randomness into your model. This model claims to improve its accuracy by introducing randomness in steps 2 and 4 which is a claim I am highly suspicious of after reading your sequence on the topic.

6Kaj_Sotala16y

The model doesn't incorporate randomness in the sense of saying "to predict the behavior of humans, roll a dice and predict behavior X on a result of 1-3 and predict behavior Y on a result of 4-6", which is what Eliezer was objecting against. Instead, it says there is randomness involved in the subjects it's modeling, and says the behavior of the subjects can be best modeled using a certain (deterministically derived) probability distribution.

0mattnewport16y

Does it say that? I didn't get the impression they were making that claim. It seems higly likely to be false if they are. They model changes in attentional focus as a random variable but presumably those changes in attention are driven largely by complex events in the brain responding to complex features of the environment, not by random quantum fluctuation. They are using a random variable because the actual process is too complex too model and they have no simple better idea for how to model it than pure randomness.

0Kaj_Sotala16y

Well, yes, "so complex and chaotic that you might as well call it random" is what I meant. That's what's usually meant by the term - the results of dice rolls aren't mainly driven by quantum randomness either.

2mattnewport16y

Complex yes, chaotic I doubt. I'm reasonably confident that there is some kind of meaningful pattern to attentional shifts that is correlated with features of the environment and that is adaptive to improve outcomes in our evolutionary environment. Randomness in this model reflects a lack of sufficient information about the environment or the process that drives attention rather than a belief that attention shifts do not have a meaningful correlation with the environment.

1prase16y

Depends on what you want to predict. I throw dice and have a model which says that number 5 is the result, deterministically. Now I will be right in 1/6 cases. If I am rewarded for each correct guess, then by introducing randomness into the model I will gain nothing - this is what Eliezer was arguing for. But if I am rewarded for correctly predicting the distribution of results after many throws, any random model is clearly superior to the five-only one.

0mattnewport16y

The random model is better than the five-only one but a non-random model that directly predicts the distribution would be better still. If your goal is to predict the distribution then a model that does so by simulating random dice throws is inferior to one that simply predicts the distribution.

0prase16y

And if you want to do both, i.e. predict both the individual throws and the overall distribution? The "model" which directly states that the distribution is uniform doesn't say anything about the individual events. Of course we can have model which says that the sequence will be e.g. 1 4 2 5 6 3 2 5 1 6 4 3 and then repeated, or that the sequence will follow the decimal expansion of pi. Both these models predict the distribution correctly, but they seem to be more complex than the random one and moreover they can produce false predictions of correlations (like 5 is always preceded by 2 in the first case). Or do I misunderstand you somehow?

0mattnewport16y

A model that uses a sequence is simpler than one that uses a random number, as anyone who has implemented a pseudo random number generator will tell you. PRNGs are generally either simple or good, rarely both.

3prase16y

Depends on what hardware you have got. Having a computer with access to some quantum system (decaying nuclei, spin measurement in orthogonal directions) there is no need to specify in a complicated way the meaning of "random". Or, of course, there is no need for the randomness to be "fundamental", whatever it means. You can as well throw dice (though it would be a bit circular to use dice to explain dice, but it seems all right to use dice as the random generator for making predictions in economy).

1mattnewport16y

A hardware random number generator isn't part of an algorithm, it's an input to an algorithm. You can't argue that your model is algorithmically simpler by replacing part of the algorithm with a new input.

1prase16y

So, should quantum mechanics be modified by removing the randomness from it? Now, having a two level spin system in state ( |0> + |1> ) /sqrt[2], QM says that the result of measurement is random and so we'll find the particle in state |1> with probability 1/2. A modified QM would say, that the first measurement reveals 1, the second (after recreating the original initial state, of course) 1, the third 0, etc., with sequence 110010010110100010101010010101011110010101... I understand that you say that the second version of quantum mechanics would be simpler, and disagree.

[-]Johnicholas16y70

There's a gap between the general applicability of utility functions in theory, and their general inapplicability in practice. Indeed, there's a general gap between theory and practice.

I would argue that this gap is a reason to do FAI research in a practical way - writing code, building devices, performing experiments. Dismissing gritty practicality as "too risky" or "not relevant yet" (which is what I hear SIAI doing) seems to lead to becoming a group without experience and skill at executing practical tasks.

Disclaimer: I'm aware that ... (read more)

4Nick_Tarleton16y

What sort of code, devices, experiments do you have in mind?

2Johnicholas16y

MBlume's article "Put It To The Test" is pretty much what I have in mind. If you think you understand a decision theory, can you write a test suite for an implementation of it? Can your test suite pass a standard implementation, and fail mutations of that standard implementation? Can you implement it? Is the performance of your implementation within a factor of ten-thousand of the standard implementation? Is it competitive? Can you improve the state of the art? If you believe that the safe way to write code is to spend a long time in front of whiteboards, getting the design right, and then only a very short time developing (using a few high-IQ programmers) - How many times have you built projects according to this development process? What is your safety record? How does it compare to other development processes? If you believe that writing machine-checkable proofs about code is important - Can you download and install one of the many tools (e.g. Coq) for writing proofs about code? Can you prove anything correct? What projects have you proved correct? What is their safety record? What opportunities have you given reality to throw wrenches into your ideas - how carefully have you looked for those wrenches?

-1loqi16y

Any such "experiments" that allow for effective outbound communication from a proto-AI seem unacceptably risky. I'm curious what you think of the "oh crap, what if it's right?" scenario I commented on over on the AI box post.

1Johnicholas16y

I didn't SAY try to build a self-improving AI! That's what the disclaimer was for! Also, your claim of "unacceptably risky" needs actual arguments and reasoning to support it. As I see it, the only choice that is clearly unacceptably risky is inaction. Carefully confining your existential risk reduction activity to raising awareness about potential AI risks isn't in any sense safe- for example, it could easily cause more new uFAI projects than it prevents.

1loqi16y

Raising awareness about the problem isn't just about getting would-be uFAI'ers to mend their sinful ways, you know. It's absolutely necessary if you're convinced you need help with it. As you said, inaction is untenable. If you're certain that a goal of this magnitude is basically impossible given the status quo, taking some initial risks is a trivial decision. It doesn't follow that additional risks share the same justification. I'm also not convinced we understand the boundaries between "intelligent" and "self-improving" well enough to assume we can experiment with one and not the other. What sort of "practical tasks" do you have in mind that don't involve potentially intelligent information-processing systems, and why do you think they'll be at all relevant to the "real" work ahead?

[-]Matt_Simpson16y50

Are you questioning that we can model human behavior using a utility function (i.e. microeconomics) or that we can model human values using a utility function? Or both? The former is important if you're trying to predict what a human would do, the second is important if you're trying to figure out what humans should do - or what you want an AGI to do.

0Kaj_Sotala16y

I was mainly thinking about values, but behavior is suspect as well. (Though I gather that some of the use of utility functions for modeling human behavior has been relatively successful in economics.)

7Matt_Simpson16y

I spent a minute trying to think of a reply arguing for utility functions as models of human values, but I think thats wrong. I'm really agnostic about the type of preference structure human values have, and I think I'm going to stop saying "utility function" and start saying "preferences" or the more awkward "something like a utility function" to indicate this agnosticism. When it comes to econ, utility theory is clearly a false model of human behavior (how many models aren't false?), but it's simplicity is appealing. As mattnewport alludes to, alternative theories usually don't improve predictions enough in order to be worth the substantial increase in complexity they typically entail. At least that's my impression.

5thomblake16y

I'm wondering how a model can be "false". It seems like simply "bad" would be more appropriate. Perhaps if the model gets you less accurate results than some naive model, or guessing. I've been thinking a lot lately of treating ethical theories as models... I might have to write a paper on this, including some unpacking of "model". Perhaps I'll start with some top-level posts.

8Matt_Simpson16y

By a false model, all I mean is a model that isn't exactly the same as the reality it's supposed to model. It's probably a useless notion (except for maybe in theoretical physics?), but some people see textbook econ and think "people aren't rational, therefore textbook economics is wrong, therefore my favorite public policy will work." The last step isn't always there or just a single step, but it's typically the end result. I've gotten into the habit of making the "all models are false" point when discussing economic models just to combat this mindset. In general, it distresses me that so few people understand that scientists create maps, not exact replicas of the territory. Treating ethical theories as models seems so natural now that you mention it. We have some preference structure that know very little about. What should we do? The same thing we did with all sorts of phenomenon that we knew very little about - model it!

5bgrah44916y

"All models are wrong but some models are useful." - George E. P. Box

0Kaj_Sotala16y

Any relation to my thoughts of ethical theories as models? http://lesswrong.com/lw/18l/ethics_as_a_black_box_function/ http://lesswrong.com/lw/18l/ethics_as_a_black_box_function/14ha

0thomblake16y

Sure. The three-tier way of looking at it is interesting, but I'll definitely be approaching it from the perspective of someone taking a theoretical approach to the study of ethics. The end result, hopefully, will be something written for such people.

[-]Richard_Kennaway16y20

Utility functions are a good model to use if we're talking about designing an AI. We want an AI to be predictable, to have stable preferences, and do what we want.

Why would these desirable features be the result? It reads to me as if you're saying that this is a solution to the Friendly AI problem. Surely not?

0PhilGoetz14y

I am afraid he probably does. That's the Yudkowskian notion of "friendly". Not a very good word to describe it, IMHO.

[-]cousin_it16y20

There are many alternatives to expected utility if you want to model actual humans. For example, Kahneman and Tversky's prospect theory. The Wikipedia page for Expected utility hypothesis contains many useful links.

[-]Kaj_Sotala16y20

Question: do people think this post was too long? In the beginning, I thought that it would be a good idea to give a rough overview of DFT to give an idea of some of the ways by which pure utility functions could be made more reflective of actual human behavior. Near the end, though, I was starting to wonder if it would've been better to just sum it up in, say, three paragraphs.

3Dagon16y

I found it a bit long. I wish you'd done both: a short description followed by more detail.

2Nick_Tarleton16y

I do think that it's longer than necessary, and that the central point as stated in the title is far more important than the details of the seven theories. Still, I wish I could upvote it more than once, since that central point is really important. (Or at least it really annoys me when people talk as if humans did have utility functions.)

-2djcb16y

Agreed, but I'd say that people do have a utility function -- it's just that it may be so complex that it's better seen as a kind of metaphor than as a mathematical construct you can actual do something with. I share your annoyance -- there seems to be a bias among some to use maths-derived language where it is not very helpful.

-3Richard_Kennaway16y

If utility isn't a mathematical construct you can do something with, then it's an empty concept.

0djcb16y

You might still be able to determine a manageable utility function for a lower animal. For humans it's simply too complex -- at least in 2010, just like the function that predicts next week's weather.

1Richard_Kennaway16y

I will believe this only when I see it done. I do not expect to see it done, no matter how low the animal.

1Splat16y

I found the detail helpful. Even more detail might have been good, but you'd have had to write a sequence.

0Bo10201016y

Not too long. The buildup between the theories was key in keeping my attention.

0[anonymous]16y

I upvoted it because this really needs to be pointed out regularly, but I do think that it's too long, and that the descriptions of the seven theories add very little.

[-]Jonathan_Graehl16y10

What's the risk in using a more static view of utility or preference in computing CEV?

My initial thought: fine, some people will be less pleased at various points in the future than they would have been. But a single dominant FAI effectively determining our future is already a compromise from what people would most prefer.

[-]pjeby16y10

Curiously, these drawbacks appear to have a common theme; they all concern, one way or another, temporal aspects of decision making.

Ainslie and Powers are certainly two who've taken up this question; Ainslie from the perspective of discounted prediction, and Powers from the perspective of correcting time-averaged perceptions.

I think both are required to fully understand human decisionmaking. Powers fills in the gap of Ainslie's vague notion of "appetites", while Ainslie fills in for the lack of any sort of foresight or prediction in Powers' ... (read more)

0Richard_Kennaway16y

Presumably this Ainslie). But if Powers is William (PCT) Powers then I don't know what you're referring to by "correcting time-averaged perceptions".

[-]timtyler16y-10

It seems simple to convert any computable agent-based input-transform-output model into a utility-based model - provided you are allowed utility functions with Turing complete languages.

Simply wrap the I/O of the non-utility model, and then assign the (possibly compound) action the agent will actually take in each timestep utility 1 and assign all other actions a utility 0 - and then take the highest utility action in each timestep.

That neatly converts almost any practical agent model into a utility-based model.

So: there is nothing "wrong" with utility-based models. A good job too - they are economics 101.

4Jonathan_Graehl16y

I don't think that's the right wrapping. Utilities are over outcomes, not decisions. Decisions change the distribution of outcomes but rarely force a single absolutely predictable outcome. At the very least, your outcome is contingent on other actors' unpredictable effects. Maybe you have some way of handling this in your wrapping; it's not clear to me. This reminds me: often it seems like people think they can negotiate outcomes by combining personal utility functions in some way. Your quirky utility function is just one example of how it's actually in general impossible to do so without normalizing and weighting in some fair way the components of each person's claimed utility.

-1timtyler16y

Utilities are typically scalars calculated from sensory inputs and memories - which are the sum total of everything the agent knows at the time. Each utility is associated with one of the agent's possible actions at each moment. The outcome is that the agent performs the "best" action (according to the utility function) - and then the rest of the world responds to it according to physical law. The agent can only control its actions. Outcomes are determined from them by physics and the rest of the world. ...but an agent only takes one action at any moment (if you enumerate its possible actions appropriately). So this is a non-issue from the perspective of constructing a utility-based "wrapper".

0Jonathan_Graehl16y

I personally feel happy or sad about the present state of affairs, including expectation of future events ("Oh no, my parachute won't deploy! I sure am going to hit the ground fast."). I can call how satisfied I am with the current state of things as I perceive it "utility". Of course, by using that word, it's usually assumed that my preferences obey some axioms, e.g. von Neumann-Morgenstern, which I doubt your wrapping satisfies in any meaningful way. Perhaps there's some retrospective sense in which I'd talk about the true utility of the actual situation at the time (in hindsight I have a more accurate understanding of how things really were and what the consequences for me would be), but as for my current assessment it is indeed entirely a function of my present mental state (including perceptions and beliefs about the state of the universe salient to me). I think we agree on that. I'm still not entirely sure I understand the wrapping you described. It feels like it's too simple to be used for anything. Perhaps it's this: given the life story of some individual (call her Ray), you can vacuously (in hindsight) model her decisions with the following story: 1) Ray always acts so that the immediately resulting state of things has the highest expected utility. Ray can be thought of as moving through time and having a utility at each time, which must include some factor for her expectation of her future e.g. health, wealth, etc. 2) Ray is very stupid and forms some arbitrary belief about the result of her actions, expecting with 100% confidence that the predicted future of her life will come to pass. Her expectation in the next moment will usually turn out to revise many things she previously wrongly expected with certainty, i.e. she's not actually predicting the future exactly. 3) Whatever Ray believed the outcome would be at each choice, she assigned utility 1. To all other possibilities she assigned utility 0. That's the sort of fully-described scenario that

0timtyler16y

There's no point in discussing "utility maximisers" - rather than "expected utility maximisers"? I don't really agree - "utility maximisers" is a simple generalisation of the concept of "expected utility maximiser". Since there are very many ways of predicting the future, this seems like a useful abstraction to me. ...anyway, if you were wrapping a model a human, the actions would clearly be based on predictions of future events. If you mean you want the prediction process to be abstracted out in the wrapper, obviously there is no easy way to do that. You could claim that a human - while a "utility maximiser" was not clearly an "expected utility maximiser". My wrapper doesn't disprove such a claim. I generally think that the "expected utility maximiser" claim is highly appropriate for a human as well - but there is not such a neat demonstration of this.

0timtyler16y

I certanly did not intend any such implication. Which set of axioms is using the word "utility" supposed to imply? Perhaps check with the definition of "utility". It means something like "goodness" or "value". There isn't an obvious implication of any specific set of axioms.

-2Richard_Kennaway16y

This is backwards. Agents control their perceptions, not their actions. They vary their actions in such a manner as to produce the perceptions they desire. There is a causal path from action to perception outside the agent, and another from perception (and desired perception) to action inside the agent. It is only by mistakenly looking at those paths separately and ignoring their connection that one can maintain the stimulus-response model of an organism (whether of the behaviourist or cognitive type), whereby perceptions control actions. But the two are bound together in a loop, whose properties are completely different: actions control perceptions. The loop as a whole operates in such a way that the perception takes on whatever value the agent intends it to. The action varies all over the place, while the perception hardly changes. The agent controls its perceptions by means of its actions; the environment does not control the agent's actions by means of the perceptions it supplies.

3Cyan16y

"Control" is being used in two different senses in the above two quotes. In control theory parlance, timtyler is saying that actions are the manipulated variable, and you're saying that perceptions are the process variable.

2timtyler16y

Um. Agents do control their actions. I am well aware of the perception-action feedback - but what does it have to do with this discussion?

-1Richard_Kennaway16y

It renders wrong the passage that I quoted above. You have described agents as choosing an outcome (from utility calculations, which I'd dispute, but that's not the point at issue here) deciding on an action which will produce that outcome, and emitting that action, whereupon the world then produces the chosen outcome. Agents, that is, in the grip of the planning fallacy. Planning plays a fairly limited role in human activity. An artificial agent designed to plan everything will do nothing useful. "No plan of battle survives contact with the enemy." "What you do changes who you are." "Life is what happens when you're making other plans." Etc.

0timtyler16y

I don't know what you are thinking - but it seems fairly probable that you are still misinterpreting me - since your first paragraph contains: ...which appears to me to have rather little to do with what I originally wrote. Rather, agents pick an action to execute, enumerate their possible actions, have a utility (1 or 0) assigned to each action by the I/O wrapper I described, select the highest utility action and then pass that on to the associated actuators. Notice the lack of mention of outcomes here - in contrast to your description. I stand by the passage that you quoted above, which you claim is wrong.

1Richard_Kennaway16y

In that case, I disagree even more. The perceived outcome is what matters to an agent. The actions it takes to get there have no utility attached to them; if utility is involved, it attaches to the perceived outcomes. I continue to be perplexed that you take seriously the epiphenomal utility function you described in these words: and previously here. These functions require you to know what action the agent will take in order to assign it a utility. The agent is not using the utility to choose its action. The utility function plays no role in the agent's decision process.

0timtyler16y

The utility function determines what the agent does. It is the agent's utility function. Utilities are numbers. They are associated with actions - that association is what allows utility-based agents to choose between their possible actions. The actions produces outcomes - so, the utilities are also associated with the relevant outcomes.

0[anonymous]16y

The utility function determines what the agent does. It is the agent's utility function.

2[anonymous]16y

You get plenty of absurdities following this route. Like atoms are utility maximising agents that want to follow brownian motion and are optimal!

2Mitchell_Porter16y

Or they want to move in straight lines forever but are suboptimal.

1timtyler16y

You mean like the principle of least action...? ...or like the maximum entropy principle...?

[-]Cyan16y100

Slapping the label "utility" on any quantity optimized in any situation adds zero content.

0timtyler16y

It is not supposed to. "Utility" in such contexts just means "that which is optimized". It is terminology. "That which is optimized" is a mouthful - "utility" is shorter.

1Cyan16y

There's already a word for that: "optimand". The latter is the better terminology because (i) science-y types familiar with the "-and" suffix will instantly understand it and (ii) it's not in a name collision with another concept. If "utility" is just terminology for "that which is optimized", then is vacuous: goal-directed agents attempt to optimize something by definition.

0timtyler16y

Right - but you can't say "expected optimand maximiser". There is a loooong history of using the term "utility" in this context in economics. Think you have better terminology? Go for it - but so far, I don't see much of a case.

0Cyan16y

That would be the "other concept" (link edited to point to specific subsection of linked article) referred to in the grandparent.

-10timtyler16y

-1timtyler16y

Not "vacuous" - true. We have people saying that utility-based frameworks are "harmful". That needs correcting, is all.

0Cyan16y

I suspect that by "utility-based frameworks" they mean something more specific than you do.

-2timtyler16y

Maybe - but if suspicions are all you have, then someone is not being clear - and I don't think it is me.

3Cyan16y

I find it hilarious that you think you're being perfectly clear and yet cannot be bothered to employ standard terminology.

-9timtyler16y

1Richard_Kennaway16y

This does not work. The trivial assignment of 1 to what happens and 0 to what does not happen is not a model of anything. A real utility model would enable you to evaluate the utility of various actions in order to predict which one will be performed. Your fake utility model requires you to know the action that was taken in order to evaluate its utility. It enables no predictions. It is not a model at all.

-7timtyler16y

1Johnicholas16y

Is this an argument in favor of using utility functions to model agents, or against?

0timtyler16y

It is just saying that you can do it - without much in the way of fuss or mess - contrary to the thesis of this post.

0Kaj_Sotala16y

Did you miss the second paragraph of the post?

-2timtyler16y

No, I didn't. My construction shows that the utility function need not be "insanely complex". Instead, a utility based model can be constructed that is only slightly more complex than the simplest possible model. It is partly this simplicity that makes the utility-based framework such an excellent general purpose model of goal-directed agents - including, of course, humans.

2Kaj_Sotala16y

Wait, do you mean that your construction is simply acting as a wrapper on some underlying model, and converting the outputs of that model into a different format? If that's what you mean, then well, sure. You could do that without noticeably increasing the complexity. But in that case the utility wrapping doesn't really give us any useful additional information, and it'd still be the underlying model we'd be mainly interested in.

-3timtyler16y

The outputs from the utility based model would be the same as from the model it was derived from - a bunch of actuator/motor outputs. The difference would be the utility-maximizing action "under the hood". Utility based models are most useful when applying general theorems - or comparing across architectures. For example when comparing the utility function of a human with that of a machine intelligence - or considering the "robustness" of the utility function to environmental perturbations. If you don't need a general-purpose model, then sure - use a specific one, if it suits your purposes. Please don't "bash" utility-based models, though. They are great! Bashers simply don't appreciate their virtues. There are a lot of utility bashers out there. They make a lot of noise - and AFAICS, it is all pointless and vacuous hot air. My hypothesis is that they think that their brain being a mechanism-like expected utility maximiser somehow diminishes their awe and majesty. It's the same thing that makes people believe in souls - just one step removed.

0Kaj_Sotala16y

I don't think I understand what you're trying to describe here. Could you give an example of a scenario where you usefully transform a model into a utility-based one the way you describe? I'm not bashing utility-based models, I'm quite aware of their good sides. I'm just saying they shouldn't be used universally and without criticism. That's not bashing any more than it's bashing to say that integrals aren't the most natural way to do matrix multiplication with.

0timtyler16y

Call the original model M. "Wrap" the model M - by preprocessing its sensory inputs and post-processing its motor outputs. Then, post-process M's motor outputs - by enumerating its possible actions at each moment, assign utility 1 to the action corresponding to the action M output, and assign utility 0 to all other actions. Then output the action with the highest utility. Check with your subject line. There are plenty of good reasons for applying utility functions to humans. A rather obvious one is figuring out your own utility function - in order to clarify your goals to yourself.

0Kaj_Sotala16y

Okay, I'm with you so far. But what I was actually asking for was an example of a scenario where this wrapping gives us some benefit that we wouldn't have otherwise. I don't think utility functions are a very good tool to use when seeking to clarify one's goals to yourself. Things like PJ Eby's writings have given me rather powerful insights to my goals, content which would be pointless to try to convert to the utility function framework.

0timtyler16y

Personally, I found thinking of myself as a utility maximiser enlightening. However YMMV.

0timtyler16y

My original comment on that topic was: Utility-based models are a general framework that can represent any computable intelligent agent. That is the benefit that you don't otherwise have. Utility-based models let you compare and contrast different agents - and different types of agent.

-3timtyler16y

Incidentally, I do not like writing "utility-based model" over and over again. These models should be called "utilitarian". We should hijack that term away from the ridiculous and useless definition used by the ethicists. They don't have the rights to this term.

LESSWRONG
is fundraising!
LW

LESSWRONG
is fundraising!
LW

36

Applying utility functions to humans considered harmful

36

36