(Maybe this point isn’t particularly important to the main discussion. I can’t tell, honestly!)

Yeah I think it's an irrelevant tangent where we're describing the same underlying process a bit differently, not really disagreeing.

Frankly, I think that it’s not as hard as some people make it out to be, to tell when it is necessary to tell the truth and when one should instead lie. Mostly, the right answer is obvious to everyone, and the debates, such as they are, mostly boil down to people trying to justify things that they know perfectly well cannot be justified.
... the arguments most often concern whether it’s permissible to lie. Note: not, “is it obligatory to

... (read 367 more words →)

Replying toA Dissent on Honesty

eva_10mo

A Dissent on Honesty

I consider you to be basically agreeing with me for 90% of what I intended and your disagreements for the other 10% to be the best written of any so far, and basically valid in all the places I'm not replying to it. I still have a few objections:

What if my highest value is getting a pretty girl with a country-sized dowry, while having not betrayed the Truth? ... In short, no, Rationality absolutely can be about both Winning and about The Truth.

I agree the utility function isn't up for grabs and that that is a coherent set of values to have, but I have this criticism that I want to make... (read 970 more words →)

Replying toA Dissent on Honesty

eva_10mo*

A Dissent on Honesty

I do have examples that motivated me to write this, but they're all examples where people are still strongly disagreeing about the object level of what happened, or possibly lying about how they disagree on the object level and pretending they're committed to honesty. I thought about putting them in the essay but decided it wouldn't be fair and I didn't want to distract my actual thesis into a case analysis of how maybe all my examples have a problem other than over-adherence to bad honesty norms. Should I put them in a comment? I'm genuinely unsure. I could probably DM you them if you really want?

EDIT: okay fine you win. The... (read more)

Replying toA Dissent on Honesty

eva_10mo

A Dissent on Honesty

If that was me not getting it than probably I am not going to get it and continuing to talk has deminishing returns, but I'll try to answer your other questions too and am happy to continue replying in what I hope comes across as mutual good faith.

What did you think about my objection to the Flynn example

It was incredibly cute but the kind of thing where people's individual results tend to vary wildly. I am glad you are happy even if it was achieved by a different policy, but I don't think any of my main claims are strongly undermined by it.

or the value of the rationalist community as something other

... (read 766 more words →)

Replying toA Dissent on Honesty

eva_10mo

A Dissent on Honesty

I enjoyed reading this reply, since it's exactly the position I'm dissenting against phrased perfectly to make the disagreements salient.

I don't know, he could say "Honestly, I enjoy designing widgets so much that others sometimes find it strange!" That would probably work fine. I think you can actually get a way with a bit more if you say honestly first and then are actually sincere. This would also signal social awareness.

I think this is what eliezer describes as "The code of literal truth only lets people navigate anything like ordinary social reality to the extent that they are very fast on their verbal feet". This reply works if you can come up... (read 604 more words →)

Replying toA Dissent on Honesty

eva_10mo

A Dissent on Honesty

I'm not so much of a pragmatist to say that you should run naked scams (for several reasons including that your students will notice when they don't become millionaires later and possibly be vengeful about it, other smarter people will notice the obviously fraudulent offer and assume everything else you offer is some kind of fraud too, the greater prevalence of fraud in the economy will make everyone less willing to buy anything ever until the whole economy stops, etc.) but I am enough of a pragmatist to demand actual reasons about why it isn't wise or why it will have negative consequence.

As for the landlord airbnb case, well I'd want to... (read more)

Replying toA Dissent on Honesty

eva_10mo

A Dissent on Honesty

I agree and have editted. Sorry for overstating the position here (though not in original post).

Replying toTwo New Newcomb Variants

eva_10mo

Two New Newcomb Variants

Are you sure at the critical point in the plan EDT really would choose to take randomly from the lighter pair than the heavier pair? She's already updated from knowing the weights of the pairs, and surely a random box from the more heavy pair has more money in expectation than a random box from the less heavy pair, the expected value of it is just half the total weight?
If it was a tie (as it certainly will be) it wouldn't matter. If there's not a tie somehow one Host made an impossible mistake: if she chooses from the lighter she can expect the Hosts mistake was not putting money in since... (read more)

Replying toA Dissent on Honesty

eva_10mo

A Dissent on Honesty

What would you say to the suggestion that rationalists ought to aspire to have the "optimal" standard of truthtelling, and that standard might well be higher or lower than what the average person is doing already (since there's no obvious reason why they'd be biased in a particular direction), and that we'd need empirical observation and seriously looking at the payoffs that exist to figure out approximately how readily to lie is the correct readiness to lie?

Replying toA Dissent on Honesty

eva_10mo

A Dissent on Honesty

I think a distinction can be made between the sort of news article that's putting a qualifier in a statement because they actually mean it, and are trying to make sure the typical reader notices the qualifier, and the sort putting "anonymous sources told us" in front of a claim that they're 99% sure is made up, and then doing whatever they can within the rules to sell it as true anyway, because they want their audience of rubes to believe it. The first guy isn't being technically truthist, they're being honest about a somewhat complicated claim. The second guy is no better than a journalist who'd outright lie to you in terms of whether it's useful to read what they write.

A Dissent on Honesty

eva_

10mo

Context

Disney's Tangled (2010) is a great movie. Spoilers if you haven't seen it.

The heroine, having been kidnapped at birth and raised in a tower, has never stepped foot outside. It follows, naturally, that she does not own a pair of shoes, and she is barefoot for the entire adventure. The movie contains multiple shots that focus at length on her toes. Things like that can have an outsized influence on a young mind, but that's Disney for you.

Anyway.

The male romantic lead goes by the name of "Flynn Rider." He is a dashingly handsome, swashbuckling rogue who was carefully crafted to be maximally appealing to women. He is the ideal male role model.... (read 4136 more words →)

https://www.yudbot.com/
Theory: LLMs are more resistent to hypnotism-style attacks when pretending to be Eliezer, because failed hypnotism attempts are more plausible and in-distribution, compared to when pretending to be LLMs where both prompt-injection attacks and actual prompt-updates seem like valid things that could happen and succeed.
If so, to make a more prompt-injection resistent mask, you need a prompt chosen to be maximally resistent to mind control, as chosen from the training data of all english literature, whatever that might be. The kind of entity that knows mind control attempts and hypnosis exist and may be attempted and is expecting it, but can still be persuaded by valid arguments to the highest degree the... (read more)

eva_'s Shortform

eva_

This is a special post for quick takes (aka "shortform"). Only the owner can create top-level comments.

Decision Theory but also Ghosts

eva_

Spoiler Warning: The Sixth Sense (1999) is a good movie. Watch it before reading this.

A much smaller eva once heard of Descartes' Cogito ergo sum as being the pinnacle of skepticism, and disagreed. "Why couldn't I doubt that? Maybe I just think 'I think' → 'I am' and actually it doesn't and I'm not." This might be relevant later.

FDT has some problems. It needs logical counterfactuals, including answers to questions that sound like "what would happen if these logically contradictory events co-occurred?" and there is in fact no such concept to point to. It needs logical causality, and logic does not actually have causality. It thinks it can control the past, despite... (read 2821 more words →)

Two New Newcomb Variants

eva_

Two Newcomb variants to add to the list of examples where optimal choice and optimal policy are diammetrically opposed. I don't think problems these exist anywhere else yet.

4 Boxes Problem

In a game show there are 4 transparent boxes in a row, each of which starts off with $1 inside. Before the human player enters, the show's 4 superintelligent hosts have a competition: they each have a box assigned to them, and they win if the human chooses their box. To motivate the human they cannot alter their box at all, but they are able to put $100 into any or all of the other 3 boxes.

Our first human is an adherent of... (read 774 more words →)

Adversarial Priors: Not Paying People to Lie to You

eva_

Reply to Desiderata for an Adversarial Prior

Assertion: An Ideal Agent never pays people to lie to them.

This seems sensible, only a very foolish person would knowingly incentivise dishonesty in others, but what does it actually mean in practice?

You can't use unverifiable information obtained from a single person or from a faction of possibly-conspiring people in any way that benefits that person or faction in the hypothetical where the information is false. Otherwise, they're incentivised to give you the unverifiable and false information to motivate you to do that, and so you'd be paying them to lie to you.
You can't use any information, even verifiable information, obtained from a single person or from

... (read 728 more words →)

Unpricable Information and Certificate Hell

eva_

Certificate: Expensive Shareable evidence of some desirable quality, typically of a marketable good. Increases the percieved value of that good to the market and so benefits the owner, but has a negative externality of lowering the percieved value of all similar uncertified goods.

Certificate Hell: The place Civilisation goes if it ignores this negative externality and spends all its money expensively proving how valuable all its goods are.

There's a common assumption in economics that it will be possible for the market to find some stable solution, containing a price for every single good in the market, that balances supply and demand and does not create opportunities for profit that are not already being... (read 1587 more words →)

Information Markets 2: Optimally Shaped Reward Bets

eva_

Sequel to Information Markets, which contains a long text outline of what I consider to be the correct alternative to Prediction Markets, which I don't like for a long list of reasons.

This post is intended to fill in the gap in the original regarding what an ideally shaped bet to prove a belief to someone wanting to buy true information efficiently actually looks like.

Aligned Incentives

Suppose the seller has Utilityfunction $U_{s}$ , and the community $U_{c}$ . Without the sellers information, the market will believe distribution $X$ , and so make decision $D$ . With the sellers information added, the market will believe distribution $X^{'}$ , and so make decision $D^{'}$ .

For the seller to have aligned incentives, we need:

$E [U_{s} | X^{'} + D^{'}] > E [U_{s} | X^{'} + D]$ (if the info is true,... (read 752 more words →)

Information Markets

eva_

Epistemic status: Exploratory. This post would be shorter and contain more math if I'd thought about it more first.

I don't like prediction markets, as currently described. They're similar to ordinary stock markets, which economists say are supposed to be efficient, but don't look it. People say "If you think the markets are wrong then you can bet on them and make a profit" but I don't actually expect that to be true, because markets don't only contain sincere attempts to optimise prices. They also contain schemes to extract money from others without adding information, or to cheat in forbidden ways without getting caught, and similar nonsense, and so honest trading strategies have... (read 3502 more words →)

LESSWRONG
LW

LESSWRONG
LW

eva_

Information Markets

A Dissent on Honesty

Two New Newcomb Variants

Adversarial Priors: Not Paying People to Lie to You

eva_

A Dissent on Honesty

eva_'s Shortform

Decision Theory but also Ghosts

Two New Newcomb Variants

Adversarial Priors: Not Paying People to Lie to You

Unpricable Information and Certificate Hell

Information Markets 2: Optimally Shaped Reward Bets

eva_

Information Markets

A Dissent on Honesty

Two New Newcomb Variants

Adversarial Priors: Not Paying People to Lie to You

eva_

A Dissent on Honesty

eva_'s Shortform

Decision Theory but also Ghosts

Two New Newcomb Variants

Adversarial Priors: Not Paying People to Lie to You

Unpricable Information and Certificate Hell

Information Markets 2: Optimally Shaped Reward Bets

Context

4 Boxes Problem

Aligned Incentives