Terminal Values and Instrumental Values

[-]douglas18y30

The disticintion between instrumental values and terminal values is useful in thinking about political and economic issues (the 2 areas I’ve thought about so far…) I’m running into a problem with ‘terminal’ values, and I wonder if this isn’t typical. A terminal value implies the future in a way that an insturmental value does not. The instrumental value is for an action carried out in a finite time and leads to an outcome in the foreseeable future. A terminal value posits all futures—this is an endless recusive algorithm. (At least I don’t have an end to the future in my thinking now). When I ask myself, “How do I want things to be in the future?” I can carry this question out only so far, but my concept of the future goes well beyond any currently imaginable scenarios.

[-]igor218y40

Eliezer, what's this with your recent bias against boredom? Are you sure it's rational or efficient or even simply useful in any way to cultivate a constant (and possibly boring) battle against boredom?

[-]J_Thomas18y70

Douglas, in principle you ought to consider the entire state of the future universe when you set a terminal value. "I want my sister not to be killed in the next few weeks by flesh-eating bacteria" is a vague goal. "My sister not being killed by flesh-eating bacteria because the world fell into a black hole and tidal effects killed her" is not an adequate alternative.

In practice we set terminal values as if they're independent of everything else. I assume that giving my sister penicillin will not have any side effects I haven't considered. As far as I know she isn't allergic to penicillin. If it will bankrupt me then that's something I will consider. I assume the drug company is not sending its profits to support al qaeda unless somebody comes out and claims it is and the mass media take the claim seriously. I assume the drug company won't use my money to lobby for things I'd disapprove of. I completely ignore the fact that my sister's kidneys will remove the penicillin and she'll repeatedly dose her toilet with a dilute penicillin solution that will encourage the spread of penicillin-resistant bacteria. If I did think about that I might want her to save her urine so it could be treated to destroy the penicillin before it's thrown away.

In practice people think about what they want, and they think about important side effects they have learned to consider, and that's all. If we actually had a holistic view of things we would be very different people.

[-]Joshua_Fox18y00

What is the difference between moral terminal values and terminal values in general? At first glance, the former considers other beings, whereas the latter may only consider oneself -- can someone make this more precise?

[-]Peter_de_Blanc18y00

Huh? Considering only oneself is less general than considering everything.

[-]Silas18y30

n moral arguments, some disputes are about instrumental consequences, and some disputes are about terminal values. If your debating opponent says that banning guns will lead to lower crime, and you say that banning guns lead to higher crime, then you agree about a superior instrumental value (crime is bad), but you disagree about which intermediate events lead to which consequences. ... This important distinction often gets flushed down the toilet in angry arguments. People with factual disagreements and shared values, each decide that their debating opponents must be sociopaths.

I don't think it's possible to find a truer statement about political debates on the internet.

I've lost count of how many exchanges I've been in that have gone like this:

me: Plan X would better reduce environmental impact at lower cost. them: So, in other words, you think the whole global warming thing is a myth?

And then, of course, people sometimes can't get keep straight which consequence you're debating:

me: The method you've described does not show a viable way to produce intellectual works for-profit without IP. them: I disagree with your claim that no one has ever produced any intellectual works without IP protection.

[-]donjoe9y00

I'm noticing this very late, and I'm going to be off-topic, but I still have to stop to note that there's no such thing as "IP", not in actual laws (unless they've been infected by this term very recently and I just haven't found out about it). It's a bogus name lumping together things that the law does not lump together at all, a term invented purely for use in corporate propaganda, nothing more. https://www.gnu.org/philosophy/not-ipr.en.html

[-]Richard_Hollerith218y-10

I’m running into a problem with ‘terminal’ values . . .

A terminal value posits all futures — this is an endless recusive algorithm. (At least I don’t have an end to the future in my thinking now).

I believe this is a real problem, and my way of resolving it is to push my terminals values indefinitely far into the future, so for example in my system for valuing things, only causal chains of indefinite length have nonzero intrinsic importance or value. To read a fuller account, click on my name.

[-]Jef_Allbright18y00

I simply want to express my great appreciation for Eliezer's substantial efforts to share his observations of the journey, his willingness (in principle) to update his beliefs, and his presently ongoing integration of the epistemologically undeniable "subjective" with the hardcore reductionist "objective." I'm joyfully anticipating what comes next!

[+]david218y-60

[-]Joshua_Fox18y10

Peter de Blanc "Huh? Considering only oneself is less general than considering everything."

Certainly. But can you give a succinct way of distinguishing moral terminal values from other terminal values?

[-]Dojan14y-10

Define what you mean by "moral" and I think the answer will give itself.

[-]Peter_de_Blanc18y10

Certainly. But can you give a succinct way of distinguishing moral terminal values from other terminal values?

No. What other sorts of terminal values did you have in mind?

[-]Stan18y00

Good post!

[-]Joshua_Fox18y10

Peter de Blanc

No. What other sorts of terminal values [other than moral] did you have in mind?

Well, one could have a terminal value of making themselves happy at all costs, without any regard for whether it harms others. A sadist could have the terminal value of causing pain to others. I wouldn't call those moral. I'm interested in a succinct differentiation between moral and other terminal values.

[-]Peter_de_Blanc18y30

Josh, I would say that making oneself happy is a morality, and so is causing pain to others. It sure isn't our morality. If you could find a short definition of our morality, I would be totally amazed.

[-]douglas18y00

J Thomas--"in principle you ought to consider the entire state of the future universe when you set a terminal value." Yes, and in practice we don't. But as I look further into the future to see the consequences of my terminal value(s), that's when the trouble begins.

igor--I want to defend Eliezer's bias against boredom. It seems that many of the 'most moral' terminal values (total freedom, complete knowledge, endless bliss...) would end up in a condition of hideous boredom. Maybe that's why we don't achieve them.

Richard- I read your post. I agree with the conclusions to a large extent, but totally disagree with the premises. (For example- I think the only valueable thing is subjective experience) Isn't that amazing?

[-]George_Weinberg218y20

I have a question about this picture.

Imagine you have something like a chess playing program. It's got some sort of basic position evaluation function, then uses some sort of look ahead to assign values to the instrumental nodes based on the terminal nodes you anticipate along the path. But unless the game actually ends at the terminal node, it's only "terminal" in the sense that that's where you choose to stop calculating. There's nothing really special about them.

Human beings are different from the chess program in that for us the game never ends, there are no "true" terminal nodes. As you point out, we care what happens after we are dead. So wouldn't it be true that in a sense there's nothing but instrumental values, that a "terminal value" just means that a point at which we've chosen to stop calculating, rather than saying something about the situation itself?

[-]Liliet B6y10

I would propose an approximation of the system where each node has a terminal value of its own (which can be 0 for completely neutral nodes, but actually no they cannot - reinforcement mechanisms of our brain inevitably give something like 0.0001 because I heard someone say it was cool once or -0.002 because it reminds me of a sad event in my childhood)

As a simple example, consider eating food when hungry. You get a terminal value on eating food - the immediate satisfaction the brain releases in the form of chemicals as a response to recognition of the event, thanks to evolution - and an instrumental value on eating food, which is that you get to not starve for a while longer.

Now let's say that while you are a sentient optimization process that can reason over long projections of time, you are also a really simple one, and your network actually doesn't have any other terminal values than eating food, it's genuinely the only thing you care about. So when you calculate the instrumental value of eating food, you get only the sum of getting to eat more food in the future.

Let's say your confidence in getting to eat food next time after this one decreases with a steady rule. For example, p(i+1)=p(i)*0.5. If your confidence that you are eating food right now is 1, then your confidence that you'll get to eat again is 0.5, and your confidence that you'll get to eat the time after that is 0.25 and so on.

So the total instrumental value of eating food right now is limit of Sum(p(i) * T(food)) where i starts from 0 and approaches infinity (no I don't remember enough math to write this in symbols).

So the total total value of eating food is T(food) + Sum (p(i)*T(food)). It's always positive, because T(food) is positive and p(i) is positive and that's that. You'll never choose not to eat food you see in front of you, because there are no possible reasons for that in your value network.

Then let's add the concept of 'gross food', and for simplicity's sake ignore evolution and suggest that it exists as a totally arbitrary concept that is not actually connected to your expectation of survival after eating it. It's just kinda free floating - you like broccoli but don't like carrots, because your programmer was an asshole and entered those values into the system. Also for simplicity's sake, you're a pretty stupid reasoning process that doesn't actually anticipate seeing gross food in the future. In your calculation of instrumental value there's only T(food) which is positive, and T(this_food) which can be positive or negative depending on the specific food you're looking at appears ONLY while you're actually looking at it. If it's negative, you're surprised every time (but don't update your values because you're a really stupid sentient entity and don't have that function).

So now the value of eating food you see right now is T(this_food) + Sum (p(i)*T(food)). If T(this_food) is negative enough, you might choose to not eat food. Of course this assumes we're comparing to zero, ie you assume that if you don't eat right now you'll die immediately and also that's perfectly neutral and you don't have opinions on that (you only have opinions on eating food). If you don't eat the food you're looking at right now, you'll NEVER EAT AGAIN, but it might be that it's gross enough that it's worth it! More logically, you're comparing T(this_food) + Sum (p(i)*T(food)) to Sum(p(i)*T(food)) * p(not starving immediately). The outcome depends on how high the grossness of the food is and how high you evaluate p(not starving immediately) to be.

(If the food's even a little positive, or even just neutral, eating it wins every time, since p(not starving immediately) is <1 and not having it there wins automatically)

Note that the grossness of food and probability of starving are already not linear in how they correlate in their influence on the outcome. And that's just for the idiot AI that knows nothing except tasty food and gross food! And if we allow it to compute T(average_food) based on how much of what food we've given it, it might choose to starve rather than eat gross things it expects to eat in the future! Look, I've simulated willful suicide in all three simplifications so far! No wonder evolution didn't produce all that many organisms that could compute instrumental values.

Anyway, it gets more horrifically complex when you consider bigger goals. So our brain doesn't compute the whole Sum( Sum(p(i)*T(outcome(j)))) every time. It gets computed once and then stored as a quasi-terminal value instead. QT(outcome) = T(outcome) + Sum( Sum(p(i)*T(outcome(j)))), and it might get recomputed sometimes, but most of the time it doesn't. And recomputing it is what updating our beliefs must involve. For ALL outcomes linked to the update.

...Yeah, that tends to take a while.

[-]manuelg18y00

The very first "compilation" I would suggest to your choice system would be to calculate the "Expected Utility of Success" for each Action.

1) It is rational to be prejudiced against Actions with a large difference between their "Expected Utility of Success" and their "Expected Utility", even if that action might have the highest "Expected Utility". People with a low tolerance for risk (constitutionally) would find the possible downside of such actions unacceptable.

2) Knowing the "Expected Utility of Success" gives information for future planning if success is realized. If success might be "winning a Hummer SUV in a raffle in December", it would probably be irrational to construct a "too small" car port in November, even with success being non-certain.

Eliezer, I have a question.

In a simple model, how best to avoid the failure mode of taking a course of action with an unacceptable chance of leading to catastrophic failure? I am inclined to compute separately, for each action, its probability of leading to a catastrophic failure, and immediately exclude from further consideration those actions that cross a certain threshold.

Is this how you would proceed?

[-]Richard_Hollerith218y00

it's only "terminal" in the sense that that's where you choose to stop calculating..

No, the way Eliezer is using "terminal value", only the positions that are wins, losses or draws are terminal values for the chess-playing agent.

So wouldn't it be true that a "terminal value" just means a point at which we've chosen to stop calculating, rather than saying something about the situation itself?

Neither. A terminal value says something about the preferences of the intelligent agent.

And Eliezer asked us to imagine for a moment a hypothetical agent that never "stops calculating" until the rules of the game say the game is over. That is what the following text was for.

This is a mathematically simple sketch of a decision system. It is not an efficient way to compute decisions in the real world.

Suppose, for example, that you need a sequence of acts to carry out a plan? The formalism can easily represent this by letting each Action stand for a whole sequence. But this creates an exponentially large space, like the space of all sentences you can type in 100 letters. As a simple example, if one of the possible acts on the first turn is "Shoot my own foot off", a human planner will decide this is a bad idea generally - eliminate all sequences beginning with this action. But we've flattened this structure out of our representation. We don't have sequences of acts, just flat "actions".

So, yes, there are a few minor complications. Obviously so, or we'd just run out and build a real AI this way. In that sense, it's much the same as Bayesian probability theory itself.

But this is one of those times when it's a surprisingly good idea to consider the absurdly simple version before adding in any high-falutin' complications.

[-]Adirian18y00

Terminal values sound, essentially, like moral axioms - they are, after all, terminal. (If they had a basis in a specific future, it would be a question of what, specifically, about that future is appealing - and that quality would, in turn, become a new terminal value.) When treating morality as a logical system, it would simplify your language in explaining yourself somewhat, I think, to describe them as such - particularly since once you have done so, Godel's theorem goes a long way towards explaining why you can't rationalize a conceptual terminal value down any further. (They are very interesting axioms, since we can only consistently treat them conceptually and as variables, but nevertheless axiomatic in nature.)

Speaking of people coming to think of B as a good thing itself, many of those in favour of banning guns treat gun abolition as a terminal value in its own right - challenging those in favour of gun freedoms to prove that guns reduce crime, rather than asserting that they increase it. That is, they treat the abolition of guns as a positive thing in its own right, and only the improvement of another positive thing, say, by reducing crime, can balance the inherent evil of permitting people to own guns.

[-]g18y10

Adirian, re gun control, are you sure? I haven't studied people's attitudes to that issue, but what you describe sounds very strange and quite unlike the thought processes of the only pro-gun-control person whose thought processes I know really well, namely me. Allowing people to do things is (in itself) just about always positive; gun control is desirable (if it is) because of effects such as (allegedly) reducing gun crime, reducing accidents involving guns, making it less likely that people will think of killing people as a natural way to deal with conflicts, etc.

At least, that's how I think, and so far as I can tell from the few gun control discussions I've been in it's also how other people who are in favour of gun control think. I'd guess (though obviously I could be very wrong) that anyone who thinks of either gun abolition or gun ownership as a terminal value or disvalue is doing so as a cognitive shorthand, having already come to some strong opinion on the likely consequences of having more guns or fewer guns.

I'm sure there are plenty of people for whom guns produce a positive or negative visceral reaction (e.g., because they're seen as representing gratuitous violence, or freedom, or power over potential attackers, or something). I don't think that's the same thing as treating gun abolition or gun ownership as a terminal value; it's just another source of bias which, if they're wise, they'll try to overcome when thinking about the issue. (Few people are wise.)

It's hardly surprising if pro-gun-control people prefer to frame the issue by challenging their opponents to show that guns reduce crime, or if anti-gun-control people prefer to frame it by challenging theirs to show that guns increase crime. Everyone likes to put the burden of proof on their opponents. (Remark: "Burden of proof" is a rather silly phrase. What's really involved in saying that the burden of proof lies on the advocates of position X is the claim that the probability of X, prior to any nonobvious arguments that might be offered, is low. This is a nice example of something Eliezer has pointed out a few times: we tend to phrase what we say about reasoning in quasi-moral terms -- A "owes" B some evidence, B has "justified" her position, etc. -- when it is generally more useful to think in terms of probability-updating. Or belief-updating or something, if for some reason you don't like using the term "probability" for these things. End of remark.)

I don't understand your appeal to Goedel's theorem. Thinking of ethics as (like) a logical system and applying Goedel might lead to some conclusion like "There will always be situations for which your principles yield no clear answer", though actually I don't see why anyone would expect the conditions of Goedel's theorem to hold in this context so I'm not even convinced of that; but once you decide to think of terminal values as axioms you've already explained (kinda) "why you can't rationalize a conceptual terminal value down any further".

[-]Adirian18y00

It is a terminal value, however - you are regarding B as something other than B, something other than a stage from which to get to C. To exactly the ends you permit your visceral reaction to the guns themselves shape your opinion, you are treating the abolition or freedom to use guns as an ends, rather than a means. (To reduce crime or promote freedom generally, respectively.) Remember that morality itself is the use of bias - on deciding between two ethical structures which is the better based on subjectively defined values - so to say that something is bias in a moral framework means that it is being treated as a moral axiom, a terminal value.

Your commentary means one of two things - either your don't believe ethics is a rational system to which logic can be applied, or you don't accept that axioms have a place in ethics. Addressing the latter, it is certain that they do, as in any rational system. At the very least you must accept the axioms of definition - among which will be those axioms, those values, by which you judge the merits of any given situation or course of action. "Death is bad" can be an axiom or a derived value - but in order to be derived, you must posit an axiom by which it can be derived, say, that "Thinking is good," and then reason from there, by stating, for example, that death stops the process of thinking. Which applies no matter which direction you come from - from the side of the axioms, trying to discover what situations are best, or from the side of the derived values, trying to figure out what axioms led to their derivation.

Regarding the latter argument - then you take ethics itself as a thing which cannot further be defined, and so claim that morality is itself the terminal value, the axiom. Which I don't think would be your position.

[-]g18y10

I think there's a distinction that I'm trying to make and you're trying to elide, between actually thinking something's a terminal value and behaving sometimes as if it is. Obviously all of us, all of the time, have all sorts of things that we treat as values without thinking through their consequences, and typically they fluctuate according to things like how hungry we are. If all you meant is that some people have an "eww" reaction to guns then sure, I agree (though I find it odd that you chose to remark on that and not on the equally clear fact that some people have an "ooo" reaction to guns) and we're merely debating about words.

I have literally no idea on what basis you say that I either don't believe ethics is a rational system to which logic can be applied or don't accept that axioms have a place in ethics. For what it's worth, I think any given system of ethics (including the One True System Of Ethics if there is one) is a somewhat-rational system to which logic can be applied, and that there's a place for first principles, but that ethics isn't all that much like mathematical logic and that terms like "axiom" are liable to mislead. And I certainly don't think that any real person's ethics are derived from any manageable set of clearly statable axioms. (One can go the other way and find "axioms" that do a tolerable job of generating ethics, but that doesn't mean that those axioms actually did generate anyone's ethics.)

I also have no idea how you get from "axioms have no place in ethics" to "morality itself is a terminal value and an axiom". Unless all you mean is that whatever ethics anyone adopts, you can just take absolutely everything they think about right and wrong as axioms, which is possibly true but useless.

[-]Adirian18y00

Our behavior is nothing more than the expression of our thoughts. If we behave as though something is a terminal value - we are doing nothing more than expressing our intents and regards, which is to say, we THINK of it as a terminal value. There is no distinction between physical action and mental thought, or between what is in our heads and what comes out of our mouths - our mind moves our muscles, and our thoughts direct our voice. There is no "actual thought" and - what? Nonactual thought? As if your body operated of its own will, acting against what your actual thoughts are. The mind is responsible for what the body does. I'm not eluding the distinction. I'm denying it.

Your language explains precisely why I said that you don't believe ethics is rational. Somewhat-rational means irrational - that is, something that is rational only some of the time it is, in fact, irrational. Either a thing is rational, and logic can reasonably and consistently be applied to it - or it isn't. There isn't "mathematical logic" and then "otherwise logic." Many have been going to great lengths to explain, among other things, how Bayesian Reasoning - derived entirely from a pretty little formula which is quite mathematical - is meaningful in daily thinking. There is just logic. It's the same logic in mathematics as it is in philosophy. It is only the axioms - the definitions - which vary.

Because axioms exist where rationality begins - that is their purpose. They are the definitions, the borders, from which rationality starts.

Incidentally, if you don't think ethics is like mathematical logic, and you've been reading and agreeing with anything Eliezer posts on the subject, you should take a foundations of mathematics course. He is going to great lengths to describe ethics in a way that is extremely mathematical, if the language has been stripped away for legibility. (For example, he explains infinite recursion, rather than using the word.) Which may, of course, be why he avoids the use of the word "axiom," and instead simply explains it. I'd also recommend a classical philosophy course - because the very FIELD of ethics is derived from precisely the thing you are suggesting is ridiculous, the search for mathematical, for logical, expressions of morality. The root of which I think it is clear is the value code upon which an individual builds their morality - a thing without rational value in itself, save as a definition, save as an axiom.

That is almost what I meant by axioms. Values. Terminal values, specifically. And also the basis of any individual's ethical code. The entire point of my post was linguistics - hence the sentence that axioms would be a simpler way of explaining terminal values. What I meant by "morality itself is a terminal value and an axiom," however, is akin to what you suggest - it is that if morality is treated as an irrational entity, as you seem want to do, then yes, absolutely everything someone thinks about right and wrong must be treated in a rational ethical system as an axiom. Which is, as you say, possibly true - but thoroughly worthless.

[-]g18y00

Adirian, I have done post-doctoral research in pure mathematics; I don't need a course in the foundations of mathematics. But thanks for the suggestion. And I've read plenty of philosophy, and so far as I can judge I've understood it well. Of course none of that means that I'm not the idiot you clearly take me for, but as it happens I don't think I am :-).

I didn't say "eluding", I said "eliding". "Denying" is fine, too. I understand why you think the distinction is unreal. I disagree, not because I imagine that there's some fundamental discontinuity between thought and action, but (ironically, in view of the other stuff going on in this discussion) because our thoughts are logically (and often not quite so logically) connected to one another in ways that our actions and feelings aren't. If on one occasion my visceral response when thinking about guns is "eww, killing and violence and stuff" and on another it's "ooo, power and freedom and stuff" then I'm not guilty of any inconsistency, whereas anything that seriously purports to be a moral system rather than just a vague fog of preferences needs to choose, or at least to assign consistent weights to those considerations.

"Somewhat rational" does not mean "irrational". There are three different ways in which something can be said to be rational. (1) That reason can be applied to it. Duh, reason can be applied to everything. (2) That it's prosecuted by means of reason. Ethical thought sometimes proceeds by means of reason, and sometimes not. Hence, "somewhat rational". (3) That applying reason to it doesn't show up inconsistencies. Perhaps some people have (near enough) perfectly consistent ethical positions. Certainly most people don't. It's not unheard of for philosophers to advocate embracing that inconsistency. But generally there's some degree of consistency, and sufficiently gross inconsistencies can prompt revision. Hence, again, "somewhat rational".

I haven't suggested that looking for logical expressions of morality is "ridiculous", and once again I have literally no idea where you get thate idea from. You have repeatedly made claims about what I think and why, and you've been consistently wrong. You might want to reconsider whatever methods you're using for guessing. (I apologize if I've done likewise to you, though I don't think I have.)

[-]Paul_Gowder18y10

I feel like I ought to make my ritual attempt to fly the deontology flag on this site by reference to the possibility of attaching do/don't do evaluations directly to actions without reference to any outcome-evaluations at all.

Yet... the end of this post might actually be the most interesting argument I've heard in a while for the existence and permanence of what Rawls calls "the fact of reasonable pluralism" -- Elizer offers us the useful notion that interconnections between our values are so computationally messy that there is just no way to reconcile them all and come to agreement on actual social positions without artifically constraining the decision-space.

[-]michael_vassar318y00

I think that part of the problem here is that humans are actually structured in a manner that leads to instrumental values fairly easily becoming terminal values, especially in the case of intense instrumental values. Furthermore, we place a terminal value on this fact about ourselves, at least with regard to positive instrumentalities becoming positive terminal values. A big part of liberalism is essentially the decision not to let negative instrumental values become negative terminal values.

I have difficulty interpreting the following paragraphs, could you expand on them? Are you equating sociopathy with differing terminal values?

"In moral arguments, some disputes are about instrumental consequences, and some disputes are about terminal values. If your debating opponent says that banning guns will lead to lower crime, and you say that banning guns lead to higher crime, then you agree about a superior instrumental value (crime is bad), but you disagree about which intermediate events lead to which consequences. But I do not think an argument about female circumcision is really a factual argument about how to best achieve a shared value of treating women fairly or making them happy.

This important distinction often gets flushed down the toilet in angry arguments. People with factual disagreements and shared values, each decide that their debating opponents must be sociopaths. As if your hated enemy, gun control / rights advocates, really wanted to kill people, which should be implausible as realistic psychology."

[-]Kenny_Easwaran18y00

This post crystallizes some arguments I've been trying to make in decision theory. Certain representations of decision theory suggest that propositions (or "events") get values, but I've thought that only "states" (maximal descriptions of the complete state of the world) should get values. Their position, as far as I can tell, comes down to thinking that since every proposition has an expected value, we can use this as the value of the proposition. Thinking of this as a type error cuts right through that. (ps, I'm a philosopher too, arguing against some other philosophers - I don't think there's a disciplinary boundary issue here, though perhaps some disciplines are more likely to think of these things one way than another)

[-]J_Thomas18y00

Me: "in principle you ought to consider the entire state of the future universe when you set a terminal value."

Douglas: 'Yes, and in practice we don't. But as I look further into the future to see the consequences of my terminal value(s), that's when the trouble begins.'

Me: Doctor, it hurts when I do this.

Doctor: Then don't do that.

[-]Adirian18y00

""Somewhat rational" does not mean "irrational". There are three different ways in which something can be said to be rational. (1) That reason can be applied to it. Duh, reason can be applied to everything. (2) That it's prosecuted by means of reason. Ethical thought sometimes proceeds by means of reason, and sometimes not. Hence, "somewhat rational". (3) That applying reason to it doesn't show up inconsistencies. Perhaps some people have (near enough) perfectly consistent ethical positions. Certainly most people don't. It's not unheard of for philosophers to advocate embracing that inconsistency. But generally there's some degree of consistency, and sufficiently gross inconsistencies can prompt revision. Hence, again, "somewhat rational"."

The second is the only situation by which somewhat rational makes sense, but was not the context of the argument, which was, after all, about moral systems, and not moral thoughts - as for the third, inconsistent consistency, I think you will agree, is not consistency at all.

Since we're having a conversation, I might hazard a suggestion that it is what you are saying that is giving me the impressions of what it is you think. And I stated my reasons in each case why I thought you were thinking as you were - if you wish to address me, address the reasons I gave, so I might know in what way I am failing to understand what it is you are attempting to communicate.

[-]g18y00

Adirian, I've been trying to address the reasons you've given, in so far as you've given them. But for the most part what you've said about my opinions seems to consist of total non sequiturs, which doesn't give me much to work on in ways more productive than saying "whatever you're doing, you're getting this all wrong".

If you don't think it's reasonable to call a system of ethics "somewhat rational" when some of its bits are the way they are because of chains of reasoning and others aren't, and when the person or society whose system of ethics it is sometimes treats inconsistencies as meaning that revision is needed and sometimes not, then clearly we have a terminological disagreement. Fair enough.

[-]Vladimir_Nesov218y10

Since there are insanely many slightly different outcomes, terminal value is also too big to be considered. So it's useless to pose a question of making a difference between terminal values and instrumental values, since you can't reason about specific terminal values anyway. All things you can reason about are instrumental values.

[-]donjoe14y10

"instrumental values have some strange life of their own, even in a normative sense. That, once you say B is usually good because it leads to C, you've committed yourself to always try for B even in the absence of C. People make this kind of mistake in abstract philosophy"

... not to mention economics, where some people confuse the instrumental goal of "maximizing profit" with a terminal goal - instead of using something like "maximizing the total Human Quality of Life" - and end up opening car doors obsessively, all day every day, and preaching that everyone should do the same, no matter what pathological consequences that leads to or how far that takes them from any higher purpose they might agree with when pressed with enough "but why?" questions.

[-]Ronny Fernandez13y00

A real deadlock i have with using your algorithmic meta-ethics to think about object level ethics is that I don't know who's volition, or "should" label I should extrapolate from. It allows me to figure out what's right for me, and what's right for any group given certain shared extrapolated terminal values, but it doesn't tell me what to do when I am dealing with a population with none-converging extrapolations, or with someone that has different extrapolated values from me (hypothetically).

These individuals are rare, but they likely exist.

[-]Benya13y30

I'm writing to report that the following piece of writing just had a useful teaching effect on me:

If your debating opponent says that banning guns will lead to lower crime, and you say that banning guns lead to higher crime, then you agree about a superior instrumental value (crime is bad), but you disagree about which intermediate events lead to which consequences.

And a few paragraphs later:

If you say that you want to ban guns in order to reduce crime, it may take a moment to realize that "reducing crime" isn't a terminal value, it's a superior instrumental value with links to terminal values for human lives and human happinesses.

When re-reading this post just now (I hadn't read it in a long time), I did wonder "isn't that a typo?" when reading the first of these quotes. I did figure it out for myself, but (and I am embarrassed to admit this) it did take me a moment. I'm hoping the feeling of "ouch" when I did realize will help to make the lesson stick this time around.

I'm not sure whether the effect was intended (my guess is it was), but in any case, perhaps that's a useful data point on this kind of writing.

[-][anonymous]12y10

Where is justification for dividing values in these two categories?

[This comment is no longer endorsed by its author]Reply

[-]halcyon12y00

Does this pseudocode resemble any particular programming language?

[-]helicase11y10

This actually seems to be explicitly represented in (Mandarin) Chinese:
"须要" cannot be used with nouns, and prescribes that something should be done in a certain way (instrumental values)
"需要" is mostly for nouns, and indicates that you need it/should have it (terminal values)

Or, the difference between these two programming paradigms:

Imperative languages specify how you want the computer to do something (sometimes down to the machine code level)
Functional languages specify what kind of result you want (add these two sets of numbers together, I don't care how, multithread if appropriate)

[-]tdb8y01

"cognitive archaeology", tee hee. I thought he was making it up, it turns out he's just misapplying it.

https://en.wikipedia.org/wiki/Cognitive_archaeology

[-]FCCC5y*10

Damn. This formalism is similar to one I developed (except I did it much later) for determining when a goal is good or not. Did Eliezer come up with those four pieces himself or is this based on someone else's work?

[-]Adam Zerner3y20

Part of the problem, it seems to me, is that the human mind uses a rather ad-hoc system to keep track of its goals - it works, but not cleanly. English doesn't embody a sharp distinction between means and ends: "I want to save my sister's life" and "I want to administer penicillin to my sister" use the same word "want".

Very interesting thought.

[-]Nick M3y10

typo?

"In particularly, I've noticed people get confused when..."

should say 'particularly' or 'in particular'

[-]JJ Lee3y10

I can’t quite grasp the idea of having multiple terminal values, values other than happiness. It seems to me that the mother believes that if she DOESN’T save her child, the rest of her life her mental state will be poor, both from her son being dead but also the guilt of not saving him when she could. So, she is still picking between future mental states: either having a negative future mental state or having no mental state at all. She judges that the death of her son and the guilt that she would feel is great enough that her mental state will go down and never recover. This may not be TRUE, but she probably isn’t thinking very clearly. The point is she believes that she will never recover. So she decides that the better alternative is to end her life by sacrificing her son.

She could just commit suicide after her son dies to seemingly the same effect, but she probably believes that her ending moments will be happier if she actually is saving her son.

I’m uncertain in this, but I don’t understand how people just “gain” terminal values. Maybe I just have a bad picture of the human psyche but the explanation I provided makes more sense to me than “she just randomly had this specific terminal value for her son’s life”.

Another possible explanation for the mother’s actions are her acting irrationally. Humans are bad at imagining what death looks like. Even if she does not believe in an afterlife, she might still have this feeling that saving her son’s life will make her happy. In fact, the idea of people having inaccurate instrumental values is mentioned in this article. Perhaps the mother is so used to the instrumental value of “help my son” that she continues to help her son, even when it isn’t in her best interests.

I’m not sure, maybe I’m running this too far to the ground. Is there another good example of a person exhibiting behaviours seemingly going against their beliefs about their future mental state?

[This comment is no longer endorsed by its author]Reply

[+]Donatas Lučiūnas1y*-50

LESSWRONG
LW

LESSWRONG
LW

122

Terminal Values and Instrumental Values

122

122