"Waymo’s speed disadvantage does add up on longer trips, like this comparison showing Waymo 50 minutes slower than an Uber if traversing the entire length of the covered area down to Burlingame, due the whole ‘always obey all the traffic laws and rules of the road and almost never have an accident’ thing."
Traffic laws and rules of the road are an effect, but the big reason here is that Waymo doesn't currently go on the highway/freeway (I believe they are approved to, but haven't yet done so). Like in the LA case that went around before, the slow down is that they are taking surface streets the whole way. This is probably also part of why they don't serve SFO, while you can get to SFO by surface streets it's exceedingly awkward to do so from most places (you'd think it would be the chaos of the curb drop-off / pick-up and luggage handling, but for the former SFO already dedicates part of the main parking garage to ride-share etc. which would be easy to man with a fixer and prob worthwhile to do so).
"Cooking scales well, but for single people the economics are remarkably bad. Stop telling single people not to order delivery."
It does if you cook one meal at a time. The sweet spot for cooking, depending on the dish, is probably 4-8 people most of the time. Any more and you have to start doing multiple batches or get larger cookware and any less and you aren't amortizing the fixed costs. So the home cooking route is to cook 4 portions and save 3 for later. Mix and match a bit between sides and mains and you can have quite a bit of variety (most foods freeze quite well too) still. I'm skeptical delivery can every compete on cost or takeout on time outside of maybe NYC and a few similar places, but if you have the money spend it on whatever you like.
Last time I was in the US, I often relied on the following pasta recipe to prepare multiple meals at once:
- fettuccine
- cook bacon in a pot with just a drip of olive oil (no other additions)
- cook pre-sliced mushrooms in another pot (no other additions)
- throw everything into the drained fettuccine in a pot
- add a few cheese singles and mix everything until they are melt and blended
The thing about this recipe is that I've observed it keeps the original flavor if kept in the fridge and re-heated in the microwave, I kept it in the fridge up to 10 days. Many other pasta sauces alter their flavor in a bad way if so processed, and freezing was even worse in my experience.
When someone makes a list of claims and some of the words are clickable, I expect the link to lead to some evidence for the claim, or at least a very clear example if the claim is based on common knowledge. Instead, the claim "Europe’s war against air conditioning continues to be truly absurd." does not lead to anything that would illustrate a "war" (not even a metaphorical war) against air conditioning, it does not show anything about "Europe’s" current policy at all but instead just leads to a tweet by Rob Wiblin who says "European countries with hot summers should have AC in most buildings, and we should install solar panels that supply the necessary electricity just fine on hot and sunny days. It's crazy AC is uncommon in the UK — doubly so in France." (It is unclear what is controversial in the first sentence. The second sentence may be based on an accurate description of the situation, but it is unclear in the tweet.), who retweets a statement by French nationalist M. Le Pen about air conditioning. That statemen is a mix of some policy intention and claims about French "leaders" and "elites".
I think the old norms according to which politics content was seen as potentially problematic and therefore should at least be based on good epistemics had their advantages. But maybe I misremember those times.
I too have grown increasingly skeptical that meta-analysis in its typical form does anything all that useful.
Unfortunately, people can be bad at understanding meta-analyses. If you have studies that disagree like 50/50, it's not necessarily true that half did something wrong. It's possible there is a legitimate hidden moderator that changes the effect of the variable (probably being revealed by the meta-analysis but not picked up sufficiently by popular reporting). Or even revealing that half have a fatal flaw would be a contribution of the meta-analysis! Sometimes the effects are not totally comparable, in which case that should either be modeled/adjusted, or excluded (probably already considered by the meta-analyst [though the salient counter-examples where the researcher(s) screw up confirm that not all papers are perfect, though this is not unique to meta-analyses]). It is indeed problematic when a meta-analysis concludes the average effect is the effect, particularly with a bimodal distribution of effect sizes (would be crazy to conclude that in that case!).
The alternative "look at the studies individually" suffers from all the same issues: garbage in, garbage out; hidden moderators; non-comparability; over-concluding from an average. At least meta-analysis brings some systematic thinking to the evaluation of the literature. A strong meta-analysis interacts with these issues and hopefully avoids the pitfalls because it does what a weak meta-analysis does not - it explains inclusion criteria, considers variability, and explains differences rather than just reporting a mean.
Agree that good meta analyses are good, and don't get these things wrong.
If only most papers, including meta analyses, didn't suck. Alas.
this reflects consumer demand for more features and not caring much about durability
And because durability is largely illegible. And even where it is legible, it is largely stochastic and untethered to relative cost of repair vs. replacement.
When people tell you who they are, believe them (with obvious exceptions). In particular, if they explicitly describe themselves as evil, or demonic, or uses other similar terms, definitely believe them.
These replies seems correct:
...but some people in the thread clearly don't get it.
Cooking scales well, but for single people the economics are remarkably bad. Stop telling single people not to order delivery.
Probably works better when you have a big freezer, but yeah.
Derek Thompson looks back at how poor we were in 1776. We are, by comparison, unfathomably rich. George Washington spent $15k/year in today’s dollars on candles to keep the lights on. Heat was so expensive Jefferson couldn’t write in winter because his ink would freeze.
I agree with the overall point and I could believe the Washington statistic. But it rarely freezes outdoors in Monticello, and he could have just moved his desk closer to the fireplace.
using urban firewood prices for what was mostly rural consumption.
Bernard Stanford: If you value all informal economy firewood production at market price, and then compare it to extant GDP estimates, you need to make sure ALL informal economic production is similarly valued, or you’ll massively overestimate firewood’s share of GDP. Seems to be what happened!
The approach seems to have a serious flaw in assuming that THIS sector of the informal economy was underestimated, but surely not any OTHER sector.
There are two different problems being raised here.
The outside quote is Zvi pointing out the issue of what price is used to multiply by quantity output to get value estimates. This is a good point because we want to multiply by a representative price, not a too-high price.
The inside quote is a related but distinct issue, which I disagree with. Bernard is arguing that if you are going to include an informal element in GDP, you have to include all informal elements. Sure, it's the ideal, but 1) that's not what's going on here, and 2) even if it were, I'd be fine with it. First, firewood is already semi-included in GDP, this process is just attempting to get a better estimate. Imagine BLS sends people out to grocery stores to price milk and estimate milk consumption, but they only end up getting urban prices and only end up observing 1/3 of the milk consumed in the US. It'd actually be great to add back in the missing consumption. How do you price it? I guess with the only price you have (and Zvi is right that it's a problematic price to use). Second, imagine BLS sends people out to start to price household dishwashing and estimate household dishwashing service provision, but they don't bother to collect household vacuuming. It'd actually still be great to do that, though it'd be difficult to pull off.
So, I think what the authors did is actually a nice exercise, but the price issue means 28% could be an overestimate. How bad of one? The paper discusses Figure A11, recreating Barger's (1955) estimates (though it only goes back to 1869), so what was the GDP share using that one? In the 1870s, the NBER paper in question reports a share of about 8% (on $500M of firewood). Barger reports about $350M of firewood (because of the lower price index), which would imply a share of about 5.4%. If the 28% was a 50% overestimate, that would mean the "true" share in the 1830s (if you assume Barger's price index is "true") could be more like 18.7%. That's still quite high and far from "half an order of magnitude" overestimate, more like 1/5 of one. Is it absurd? Figure 6 here [pdf via academia.edu] has British and Swedish shares at 20% and 40%, respectively.
Welcome to the monthly roundup of things that don’t fit into other categories and don’t rise to the level of their own posts.
Bad News
When people tell you who they are, believe them (with obvious exceptions). In particular, if they explicitly describe themselves as evil, or demonic, or uses other similar terms, definitely believe them.
Did you know 67% of all college students bet on sports? That’s a group that is majority female, so that statistic is wild. This is in the context of Ron Yorko developing a class in sports betting awareness from a neuroscience perspective for CMU freshman.
Cooking scales well, but for single people the economics are remarkably bad. Stop telling single people not to order delivery.
Chase Sapphire Reserve annual fee increases to $795 from $550, you get a $300 travel credit. That should cut down considerably the number of people who get value here.
Claim that AirBnB and Vrbo are headed downhill, which directionally matches my experiences, although it’s obviously not as bad as this portrays things. Revealed preference is that there was a period when I defaulted to an AirBnB, and I have definitely switched back to renting hotel rooms in most situations.
More cautionary tales of AirBnB, I continue to update towards using hotels unless I have strong need of something bigger.
That claim that chess grandmasters burn 6000 calories a day during intense play? Not only is it Obvious Nonsense, the story of how it got repeated a lot is even stupider than you think.
Europe’s war against air conditioning continues to be truly absurd. It’s even more absurd considering how well it lines up with solar power. If the solar panels can’t produce the energy to run the air conditioning, then you didn’t need to turn it on. It also is the obvious response any time someone says ‘their lived experiences are better.’
This does seem like a good heuristic:
There are cases where the person is actually asking nicely and they clearly are hoping you tell them yes, as in ‘have you done this procedure before?’ or ‘are you familiar with [X] method?’ That’s different.
When someone says this in a way that clearly implies that they think the answer is no and they are using that to dismiss you, then yeah, doesn’t matter, it will change nothing, and you should likely write them off whether or not you can answer yes.
Government Working
I am doing my best to avoid commenting on politics. As usual my lack of comment on other fronts should not be taken to mean I lack strong opinions on them. Yet sometimes, things reach a point where I cannot fail to point them out.
If you are looking to avoid such things, I have split out this section, so you can skip it.
FDA has a new pilot program that can slash FDA’s drug review time from 10-12 months to 1-2 months, by evaluating things along the way during clinical trials, which was what they did during Operation Warp Speed. That would straight up accelerate the deployment of such drugs by most of a year. It would also greatly encourage future investment, not only is the process faster the drug companies know where they are at throughout and can adjust accordingly. The term ‘AI’ does not once appear in the report.
Which demands the obvious question, why the hell are we only doing this now?
As per Levels of Friction, yes, you should have anticipated the results we got when moving things into Tier 1 where they are legal and ubiquitous without limiting principles:
It doesn’t have to be game designers. Ordinary capitalists should be fully equipped to reason this out.
Click-to-cancel, which I agree with Sheel Mohnot was by far the best thing Lina Khan did at the FTC, has been stopped by a panel of three Republican judges so the industry could get ‘more time and process’ to explain why they opposed the rule. The story here about his failure to cancel a gym membership is rage inducing and completely standard.
Sir, when she had a job you complained, how you complain again, please make up your goddamned mind. Also offer me a click so I can cancel.
It seems the UK government literally got an injunction forbidding the press from talking about what the government was doing with respect to Afghan migrants? Regardless of what you think of what was being done, forbidding the press from discussing it feels like a Declaration of Independence, time-to-start-over-with-a-new-government level of violation of basic principles of freedom?
Jones Act Watch
Balsa Research can’t keep up, as the House suddenly and overwhelmingly passed the American Cargo for American Ships Act that would require 100% of transportation project [DOT related] materials transported over oceans to go on US ships. So we’re going to make it a lot more expensive to use ships for projects that are ‘procured, furnished or financed by’ the DOT. No, this is not ‘worse than the Jones Act,’ the blast radius is far smaller and it only applies the flagging requirement, but this plus the Jones Act is worse than only the Jones Act.
That’s in addition the cataclysmic regulations we helped fight back against earlier.
Meanwhile, you know how the Jones Act was supposed to promote American shipbuilding?
Instead, the beneficiaries of the Jones Act, via owning existing Jones Act ships, have enlisted the government to actively sabotage American shipbuilding even further.
As in, and I quote: “Some Jones Act companies now expressing fear that building new ships could devalue their current fleets.”
I’d say ‘mask off moment,’ but it’s not. What mask?
That’s right. We literally got an offer to invest $20 billion in US shipbuilding, and the Trump administration said no, we won’t support that. No non-US-built ships can be used, and also no US ships can be built. Also tariffs on things like steel.
So, no ships, then. Except the handful that exist, which get to profit.
The corruption is staggering. It can always get worse.
Prize Fund
The answer is (as always) transaction costs.
At one point, I coordinated with Paul Christiano to put out an AI Alignment Prize. On a per dollar basis, I am confident we generated and highlighted excellent work. However, we also had to put in a ton of time evaluating all the entries. A lot of other would-be prizes will have a similar problem, and once you announce a prize people can get very persnickety about details.
Also you have to use part of your social bandwidth to communicate the prize.
However, yes, you should be doing it more. And I should be doing it more.
One cool variant is to create a Manifold market on ‘will [X] happen?’ where the [X] is something you want to happen and that someone can go make happen. The absolute value of the prize is low but in my experience this is highly motivating, and for example got my hands on a Switch 2. There is tons of alpha in offering a symbolic but real prize that shows you care at all.
Dining Out
Somehow you can still get 5 million views by posting that you were stupid enough to use Uber Eats in New York City instead of Caviar, then counting sales tax and the tip as part of the delivery fee and saying you paid $30 for delivery.
By comparison, on Caviar, I tested out a similar sized order, subtotal was $94, sales tax was over $8 and total charge was $109.03. I mean, you can be an idiot and press the Pay More button if you want, I suppose.
Explanation of why all airport restaurants get similar Yelp ratings, they’re all run by the same group of people. Except no, that still makes no sense, because the food still mostly tastes the same as it does on the outside. If you go to Shake Shack you still get a Shake Shack burger, you go to Dunkin Donuts you get their donuts, and so on. So yes, there is a bit of equalization in service, perhaps, but that doesn’t explain it? I know that I will almost always make the same choices at airport restaurants I would at a similar outside food court.
So I think this is still a mystery, that likely has more to do with how people rate restaurants when they are being charged a lot and are travelling? As in, you’re always happy to eat something at all, always frustrated by the price and options and conditions, so you end up around 2.5-3 star averages almost no matter what? I guess?
A ghost kitchen Xi’an Famous Foods is doing bonkers business in Alexandria. Xi’an Famous Foods is quite good, I recommend the Lamb Noodles like everyone else does, but you have to eat it right away (and also I made the mistake of looking, and it is a lot of calories, so I don’t do it often). This isn’t only me, they’ve been consistent about insisting on the eating right away part, which applies here way more than usual. I worry many customers aren’t getting the full experience.
Joe Weisenthal says all cities have good food now. Nate Silver calls out Boston as being somewhat lacking among the top metro areas as do many others, which he attributes to it being a college town, and many others question the premise.
My understanding can be summarized this way:
While I Cannot Condone This
Yes you can take a systematic approach to anything and very often you should do it.
Perhaps the supposedly ‘normal’ humans should also be doing more systematic study of how to do you do, fellow humans? They seem to have skipped over some things.
The parties like this were taken from me at the time (in the 90s and early 00s) and I never experienced them, but I did understand they existed and I was sad about this.
Cate Hall comes out against the concept of willpower. I see this post as correctly attacking people who simply tell you to Use The Try Harder and think doing hard things through ‘sheer willpower’ is virtuous and those who don’t have it deserve to suffer or anything like that.
I strongly agree that the best way to get good results is to set things up to be easy, and that anyone who says any form of ‘you don’t need [X] you only need willpower’ is usually the asshole in a given situation. Engineering solutions are great.
I still think the post goes too far in treating willpower as a non-useful concept. Willpower is a highly useful handle for an important tool that one can and should cultivate and learn how to use wisely. You can also choose to call it something else, if you prefer.
Cate Hall asks ‘are you stuck in movie logic?’ in particular highlighting one form of Idiot Plot where the whole problem could be cleared up in five minutes if people would simply talk to each other and Say The Thing rather than repeatedly and conspicuously dancing around it and not Saying The Thing. As she says, there is a time and place for not Saying The Thing but on the margin you should say it.
Technically when you register for the LSAT you are representing and affirming that you are doing so for the sole purpose of seeking admission to law school, wait what?
I too am strongly against lying but there are exceptions and this is one of them. Technical attestations with no legitimate purpose or ability to be enforced, and no person who is relying on them in any way to be accurate, don’t count.
What are protests actually for? Ben Landau-Taylor asserts that if you want your protest to exert any political pressure, this requires that you demonstrate the capacity for violence (ideally while carefully avoiding any actual violence). Otherwise, no the state won’t respect your demonstration of support, so the purpose of the protest is as a pep rally for the participants (and I would add a signal to others in other ways, which can then indirectly pressure the state in various ways), which can be worthwhile but should not be confused with political pressure.
I think this model goes too far but is essentially correct, with the caveat that you can also credibly threaten things other than violence, but you have to credibly threaten something.
Most of this Scott Sumner post is about underconfidence in monetary policy, where I find little to disagree with, but what I want to talk about here are ChatGPT’s examples of underconfidence:
These are terrible examples.
Clark Kent does not have an underconfidence problem with his powers at any point that I can see. He has a lack of control problem, which is a very real issue. He does have regular person underconfidence problems as Clark Kent, but that’s different.
Peter Parker, in every example I have seen, is initially radically reckless and overconfident. He does things that risk getting him killed if he lacks Required Secondary Powers he has not yet verified.
Have appliances declined in durability? The answer is yes, but only modestly, this reflects consumer demand for more features and not caring much about durability, and also largely reflects government requirements for water and energy efficiency. Besides, prices have declined a lot, so it is fine.
Something to watch out for:
Good News, Everyone
Argentina grows at 7.6% YoY in Q2, exceeding expectations. Economists surveyed by Arentina’’s central bank in May expected 5.2% annual growth in 2025. Also note from March 31 that poverty has fallen sharply from 53% to 38%.
TSA stops requiring us to take off our shoes even if we didn’t pay for TSA Pre.
A fungus was discovered that can eat even hard to break down plastics, so you could plausibly throw it into a landfill and it would do the rest? It is rarely that simple and there are obvious things to check first, but yes we do get bailed out like this every so often. Also note that if you build superintelligence, things like this will tend to happen a lot more often in a variety of ways.
John Wentworth advises us to centrally seek wizard power, the ability and skills to do and create things yourself, rather than king power, which is dominance and bargaining power and directing others, mostly in ways that can only get you what money can buy and involves you marching in front of parades thinking you decide where the parade goes. This allowed him to reorient his own drives in this way.
He also highlights a comment from there noting that rationalist types can present depression very differently than others, in a comment I’m quoting in full:
Scott Aaronson officially admits to being a rationalist.
Polymarket is really hitting the big time, with more visits than FanDuel or DraftKings.
The true gambling kings do remain Robinhood and Coinbase.
Cracking down on alcohol in the USSR in the 1984-1990 period made big differences, and they mostly seem to be clear improvements. Note that divorce rates went up.
Derek Thompson looks back at how poor we were in 1776. We are, by comparison, unfathomably rich. George Washington spent $15k/year in today’s dollars on candles to keep the lights on. Heat was so expensive Jefferson couldn’t write in winter because his ink would freeze.
Religious attendance by the young is way up in the UK, as in by a factor of four or more, and France’s Catholic Church did more baptisms this year (17k) then they have in 20 years, in what some call The Quiet Revival. American bible sales are up 22%. I have seen similar statistics in a few places. What I have yet to see is an explanation of why this is happening, but also I have never seen a satisfying explanation of past cycles of religious revival.
Opportunity Knocks
OpenPhil is hiring, including for their new Abundance and Growth team (Generalist JD, Specialist JD).
Antisocial Media
I strongly endorse this, although I doubt we’ll get it. AI parsing for topics is a bonus.
Indeed. I can think of a number of accounts where I highly value their opinions on [X], usually things like games or AI highly relevant to my interest, and very much do not value their comments on [Y], often political but sometimes simply something boring.
This is not a coincidence because nothing is ever a coincidence, also obviously, although the impact here is dramatically overstated of course:
One did not simply post to classic LessWrong. It was so intimidating that I at the time was worried to post there, which I shouldn’t have been, but if you weren’t ready the response was super harsh, you would be effectively shown the door. There was tons of filtering. Even if you weren’t shown the door, you wouldn’t get to be a true part of the community, although you could still have for example gotten an early line on Bitcoin.
There were also strong attractors. If you were the type of person who could be there, there was a substantial chance you ended up there. It’s true that there is a ‘invisible graveyard’ of other LessWrong people that would have been right at home and never found it, but I don’t think it is that much larger than the actual group. Same with MIRI.
Going forward, for future groups, I expect the effects will be similar, so long as it remains humans who are shaping our future. Let’s hope that lasts.
Similarweb says Threads now has slightly more monthly active users than Twitter? But it also says Twitter has about 35 times as much web traffic. I don’t buy this?
I wonder about this situation, and what is really going on.
As in, a good portion of those who see Brah’s post are going to notice that Freiman’s post saying ‘constant 2022 dollars’ right there in large friendly letters. I do think the true situation is more complicated than the chart suggests, but yes people are getting richer by these measures.
Ryx Commar notes a problem, and correctly identifies it as a sorting problem, not an average quality issue:
The solution is to rely instead on other markers. Stick almost entirely to curated following-or-listed-only feeds (did you know even YouTube, Facebook, Instagram and TikTok let you do this, if you dare go to all such places?), except where algorithms are very good.
Social media likes and views are still a very rich indicator, but you have to control for circumstances, starting with the account posting but also the subject and the way it is constructed. With enough skill you can still get benefit out of them but it’s tricky.
Technology Advances
Apple made the stop button on the alarm small because if you don’t force people to wake up to find the button they oversleep 30% more, whereas an easy to find snooze button only buys you a few minutes.
In ‘it’s worse than you know’ news:
That is how badly it performs in a non-adversarial situation. This is how your age verification works when everyone is scanning their actual faces with no attempt to fool the system. If you’re facing kids who want to fool the system? I mean just give up, even if you mysteriously ruled out the ‘hey other kid can you do the verification for me’ strategy. Sonnet thought you could probably just literally use a fake mustache.
I do not understand this either: Why do all laptops, or at least all not-dirt-cheap ones, not have the same connectivity features as smartphones?
A Patrick McKenzie tale of how to allow kids to make phone calls on their Amazon Fire tablets, which for them required multiple non-intuitive steps.
I thought I had a lot of open tabs. I counted 139 including all my tab groups, of which probably half are actually necessary. I was incorrect, this does not appear to be ‘a lot of open tabs.’
Also, really, Safari?
For Science!
I too have grown increasingly skeptical that meta-analysis in its typical form does anything all that useful.
Tracing Woods here makes a similar argument for education meta-studies in particular, that the different studies have dramatically different setups and criteria, and you need to look at the studies individually if you want to learn anything. I buy it.
If you post a graph showing a small effect, but it is zoomed in, people get the wrong idea, so try not to do that when this would be a problem.
Various Things On Fire
Firewood alone was supposedly 28% of GDP. Except wait, does that actually make any sense? A quarter of economic activity was firewood? We should believe that because a paper said so?
Actually, calling out absurd numbers as absurd is The Way.
Spending 10% of labor on firewood is still a lot. Firewood was a huge deal. But 10% is very different from 30%, and makes a lot more sense. If it was 30%, that would have it be approaching farming in terms of how much labor goes into it even directly for the farmers, and this simply does not intuitively make any sense.
In theory if any given necessity gets sufficiently difficult to obtain it can become an arbitrarily large cost. But that definitely was not the way to bet, and indeed we have an explanation for what was going on: They were valuing all firewood at market price (which is well above typical cost) and then comparing to GDP estimates that treat production very differently than that, and I think the problem goes at least one step deeper than Bernard identifies, they likely aren’t even including this type of measurement of firewood itself in the denominator, and also they are using urban firewood prices for what was mostly rural consumption.
The question is then, why don’t we feel rich? The reason we don’t feel rich is that we are not permitted to live in between how we lived then and how we are supposed to live now.
And yet, we could do so much better. Which is all very good news.
You Need Bigger Nametags
Fully endorsed. I do not know of a single example of a too-large name on a badge.
Variously Effective Altruism
Oh no.
We also would have accepted Jonah Hill, as per Atomic. This is not merely a technical historical accuracy thing, I think it’s actually important. Also, given this and the name The Altruists, and also the description I – I mean seriously, what, no, that’s not how any of this worked – I presume they have zero idea what they are doing.
How should we think about Warren Buffet’s $6 billion donation going entirely to other foundations, mostly to the Gates Foundation? This is definitely not first best, but he is getting quite old, so I don’t think asking him to manage the money himself is a reasonable ask, trying to force generic additional foundation into existence without his focused attention seems unlikely to work out, and at this scale there are few options available. Obviously I have some suggestions I think are much better places to put a good chunk of these funds, but I’m not mad at it.
NYT once again pulls the Kevin Bacon Game, as in ‘[X] is associated with [Y] which has a similar name to [A] which includes [Z] so obviously [X] is linked to [Y].’
Elon Musk is no longer a senior official. It would still be rather rude.
This is rapidly evolving into a generalized weapon against everything good.
As in:
We see this in its pure form with David Sacks, saying anyone opposed to anything he wants must be an EA in a mask, and that we have to ban states from passing laws about AI because all state laws about AI would of course be the result of a global conspiracy of evil EAs. But you can do the same thing about anything, anywhere.
As Henry Shevlin says, you have to know your EAs.
In a French experiment, they report that imposing a maximum donation increased likelihood and quantity of giving, at least as effectively as a suggested donation, but what they actually did was paid 10 Euros for completing a questionnaire and then offered people the chance to donate either 0-10, 0-10 (with a suggestion of 6) or 0-6 Euros. And yes, in this case 0-6 did better, but this obviously doesn’t either describe what they claim it does or generalize. It does suggest the important principle that you want to appear reasonable.
For Your Entertainment
There are two distinct problems here: That on the margin there are huge rewards to learning to work the system, and that the intrinsic motivations have perhaps changed.
My money is on the problem being mostly about the reward systems rather than the motivation. Yes, some people primarily want to be famous and successful, but that has always been true. What changed is that if you pursue excellence, the excellence that gets rewarded and that you can measure is largely about working the system, whereas making the underlying products ‘better’ matters too but it is a slower process that on the margin doesn’t pay off for a long cycle. Success is so reliant on virality.
That’s one reason I am so grateful for Substack. It is one of the few places where virality is great when it happens, but it matters remarkably little for long term success.
The New York Times comes out with its best 100 movies of the 21st Century, as voted on by influential Hollywood people.
My main takeaway was, wow, there are a lot of movies and I have seen not many. My secondary takeaway was, well, this does explain a lot, I suppose.
My evaluation:
Have seen, excellent pick (definitely would have made my list): 14
Have seen, good pick (would be happy to have this on the list): 10
Have seen, questionable pick (I mean weird flex, not my pick): 8
Have seen, actively bad pick (no, seriously, no, don’t watch this): 2
Haven’t seen, probably good pick, but I because of reasons I never saw it: 11
Haven’t seen, can’t tell: 52
Haven’t seen, probably bad pick: 3
If we look only at the 34 that I’ve seen, that ratio isn’t that bad, but you have highly favorable selection working for you there.
Recent movie pickings have been slim. A lot of people liked Superman. I did not.
As a reverse experiment, I went through my Letterboxd diary list (as in, what have I watched since I started tracking, that was released in the 21st century.) The ones that 100% should be on the list are Anora, The Fall Guy and Looper. All three are missing, and I get that the other two are quirky opinions but I don’t think there’s any excuse for excluding Anora. The bubble for my list would be somewhere in the 4.5 range. Of my 4.5 star movies recorded, NONE of them made it either: Challengers, Poor Things, Megalopolis, Weird: The Weird Al Story, Deadpool and Wolverine, Predestination, You Hurt My Feelings and May/December. Some of that is that the list clearly has an anti-recency bias, there are literally zero movies from 2024 or 2025. Who knows.
I think a lot of the problem was that they only asked each person to vote for 10 movies rather than 100 movies. That introduces some odd distortions.
For better opinions, here are Scott Sumner’s latest movie reviews. There is also well-earned praise for Lighthaven and the events there. I have been seeing less movies lately in favor of watching more television shows, and because few movies this year have appealed to me. I do hope to turn that back around, especially now that (by the time you read this) Love Island USA is done for the year, but I also think going through phases of intense interests and jumping around is actually correct.
Here’s another example of ‘whatever you are doing, commit to the bit.’
I don’t know how much this generalizes or how much it depends on it having been a physical skill like ceramics, but yeah. Get into something.
In one of the weirdest arguments I’ve seen in a long time, Tyler Cowen says people read less and perhaps have lower literacy skills but the ‘most likely culprit for our current problems’ is the decline of network television and people’s willingness to obey Walter Cronkite and be duller and more conformist. I suppose the point is that reading was already gone and mostly we’re substituting out of TV and there are some cultural downsides to that?
But that has nothing to do with the question about reading, and also that’s a different set of problems? Surely, if English Majors Can’t Read, that isn’t caused by their failure to watch a bunch of NBC. My read on the post covering the reading debate here is that it’s a mirage, reading hasn’t actually declined that much, we’re now constantly interacting via text, it’s more that attention spans for long texts have declined and this isn’t obviously wrong, and the reason students 100 years ago sound so much better is that they are a highly selected group.
To the extent there really is an issue, I say the problem was caused by… network television, which shifted a ton of consumption away from reading to video. After that, the recent changes didn’t make things worse (I think?) but substituted something else for the network television.
YouTube Shorts is now averaging over 200 billion daily views. There are only ~8 billion people on Earth, so that’s 25 per person. And then Reels and probably TikTok are both bigger than that. Yikes.
Robin Hanson points out our consumption of fiction and music is dramatically higher than it used to be, these are rough AI estimates, I note that o3-pro for me estimated 9-14 hours a week for all fiction rather than 24, although Opus was 15-20 hours:
The post doesn’t focus on music, and I would ignore it. There is no real sense in which we ‘spend’ three hours a day on music. o3-pro estimates 97% of our music consumption is passive, so active consumption may even have gone down. There’s no reason to presume this is or is not adaptive.
I consume far less because I find music reduces my productivity, but it brings me joy and I should probably consume more.
Fiction however is presumably being consumed as a primary activity. So this change, largely in response to vastly superior supply of both fiction and free time, is plausibly maladaptive. Certainly 24 sounds like a ton, although 14 seems a lot more sane to me.
One could decompose this change into leisure consumption over time, and the share of that consumption that is fiction or actively listening to music. It seems plausible that given the decision to consume so much leisure, it is not a mistake to consume this much fiction and music, or it is a much smaller mistake. So to the extent we worry about a cultural error here, the focus should be on our potentially maladaptive increase in total leisure.
A paper’s model of ‘inefficient bargaining’ puts a 2% lower bound on the chance a TV show is cancelled even if it would be efficient to continue, higher if there is asymmetric information. That’s the nature of any similar negotiation, if you’re not risking a 2% chance any given negotiation blows up you are not negotiating very hard.
Game Theory
I’ve talked about it before but I seriously can’t get over that the world works this way.
If all you ever do is throw the number in the average, and all you care about is the average, then yes, rating something 3/5 is silly. But you don’t directly benefit that much from the average, so all you have to do is have the ratings also do something else, especially if they help you track things or help algorithms or AIs make predictions, or you get a reward for a reasonable distribution, or people are reading your reviews directly, and so on. Movie ratings do survive with a reasonable distribution for similar reasons, even in America.
The problem is that if you try to force calibration in various ways, that opens up other ways to cheat the system, so this would work if and only if people weren’t adjusting.
Gamers Gonna Game Game Game Game Game
It was Monster Train 2 month. We’re back, baby.
I centrally describe Monster Train 2 as More Monster Train. Had fun the first time? Have fun again, with a bunch of cool new features, figure out the new clans, and climb. As before, the goal of Monster Train is to do Something Utterly Ludicrous, or more precisely something that wins the run, which means knowing exactly what does and does not win runs. There are particular battles that are run killers if you don’t realize the danger.
Ultimately I decided that I had fun for enough hours I was happy I bought and played the game, but that I’d had this experience before, I could keep going and achieve more things but my experience had peaked and I was done after 14 hours. Which is fine.
I am now on Clair Obscur Expedition 33. I agree with everyone else that, some frustrations with navigation aside it has been a great experience so far. I do have notes, especially that certain choices are not well balanced.
Recommendations for how to maximize your Clair Obscur Expedition 33 experience. The first is minimize spoilers. The others are out of your hands and are minor spoilers, so I’m not going to tell you, and you shouldn’t click the link until after you play.
If I am understanding this right, XBox is going to transition to a modular platform that will be fully compatible with PCs and basically be a way to play PC games on console and handheld formats? They lost to Sony so they’re going after Valve?
I agree with dCrusius that retro games both classic and new are pretty awesome, and it is not only nostalgia, and there’s a reason my kids like them so much. Restrictions breed creativity and I love being able to actually fully grok everything. There are still great modern games too, of course.
Reid Duke reports from PT: Final Fantasy. Sounds like old times. DI Goetschel also reports as well, the first part is highly particular but the second part involves universal principles that don’t require you know what the cards do.
There was a poker tournament where one player got a $1 million dollar extra payout if he won, which was much larger than all the other prizes. So the other finalist let him win. All Magic: The Gathering players and game theorists are unsurprised, but in poker this is a real problem, because poker depends on the ability of various players to do various insane prop bets and competitions and such that create weird incentives, and for the other players to not respond by coordinating to make the conditions happen, whether or not they then directly get (or negotiated for) a cut.
I do miss the original Railroad Tycoon.
Railroad Tycoon was great because it focused on actually interesting decisions, and simulated actually interesting things in ways that felt real and forced you to think and work with a variety of real concepts. Alas, yeah, these types of games seem to have gone very downhill, even though one could very easily make them great by making the retro version and then using modern tech to make it better. But no one does it.
A common risk and gaming pattern:
The VC’s question seems highly valid, and there are at least two very distinct plausible answers, although one probably means he was flying a bit too high.
The thing about poker and gambling is that you only have to gamble enough to make you care. It can’t be $0, but if I can get excited by amounts of money that mean nothing to me, why not? The excitement is the point, I’m certainly not making my hourly. If I ever do get to play a major tournament, which is the only time I might plausibly play for stakes that actually matter to me at this point in real terms, it will be because of the competition and the title.
I do remember what it was like to be gambling actually important, life changing amounts of money on a daily basis. I never actually got to the point where I enjoyed that aspect, but I did it because that’s the only way to get the alpha.
By default, never tell a streamer any potentially new-to-them game information they aren’t explicitly asking for you to tell them, and wondering aloud does not count as asking. I am fully with Jorbs here.
Who Wants To Be An American Citizen?
DHS Is Considering Reality Show Where Immigrants Compete for Citizenship, from the producer and writer of Duck Dynasty. I would have tapped Mark Burnett, creator of Survivor and The Apprentice, because obviously.
To be clear, this is extremely funny, but also we should totally do this, because skill-based immigration rules as does wholesome family entertainment.
The challenges might need some work, though?
An alternative pitch, of course, would be Green Card Marriage. Relationships on The Bachelor tend not to last, so let’s raise the stakes. If you don’t actually marry and make it two years we kick you back out of the country. Remember, you can’t be 4TWR when coming to America is always the right reason. So all bets are off.
I Was Promised Flying Self-Driving Cars
Waymo expands, now so tantalizingly close to SFO.
Plus this area of course:
For now maybe a shuttle or quick taxi ride for the last mile into SFO?
Waymo’s speed disadvantage does add up on longer trips, like this comparison showing Waymo 50 minutes slower than an Uber if traversing the entire length of the covered area down to Burlingame, due the whole ‘always obey all the traffic laws and rules of the road and almost never have an accident’ thing.
A key question on self-driving cars is, are we going to use them to give children better freedom of movement, because now they can safely go anywhere without having to drive? Are we perhaps also going to let them walk around because the primary threat (other than police) was always cars and the self-driving cars are vastly safer for pedestrians? Or are we going to be totally crazy and not let them do any of it?
I also disagree that they will make traffic worse, because self-driving cars can coordinate traffic very well, even if humans would end up in a pointless jam that feeds on itself, and because the cars can coordinate their movements much better, also we could vastly improve parking issues. But yes, ultimately if we want to get optimal road use we need to charge to use the roads.
A cool thought experiment, 23 million autonomous vehicles could take care of all car rides, a 90%+ reduction in vehicles, by an o3 estimate. This seems right to me at least if you exclude isolated people’s vehicle needs.
For now, we’re a little short.
That’s one of those statistics that is both impressive and disappointing at the same time. It is great to double the size of the fleet, but why only double? Why not 10x, or 100x? I want my self-driving cars.
A bill was introduced in Washington, DC to allow fully self-driving cars. For the last few months Waymo has been forced to have dummy human drivers behind the wheel, with rides for customers in Washington DC, which will be their seventh city, only slated for 2026.
Sports Go Sports
There is a new culturally important sport in town, which is Love Island USA. Make no mistake, this is a sport, and a rather excellent one. Season 7 was reportedly several times the size of the former peak of Season 6 by audience and size of online discussion, so chances are Season 8 is going to be huge next year. The best part is that there is still so much room for improvement in the format.
NIL Go is the new attempt to get a handle on payments to athletes in college sports, requiring all substantial payments to go through them so they can check the deal and approve it, with arbitration if you object. It seems likely this will fail and we’re simply going to face a full market for student athlete services, with extra steps, but at least they are trying once more.
The Lighter Side
An SMBC is very much not how any of this works, which was the joke, but the problem is that SMBC is too often actually describing how things do work, such that Eliezer felt compelled to point out all the ways this one was wrong, which only made the whole thing funnier.
Refuse the call to adventure today!
Maybe learn a foreign language instead?
Or, if you must do more, here’s a handy guide.