IABIED Review - An Unfortunate Miss

[-]Vaniver2mo213

For many of the chapters in Parts 1 and 3, as they came to their end, I wondered “That’s it? Where is the rest? You were just getting going!”

It's online. No, seriously--the online resources are like twice the length of the printed book.

[-]Darren McKee2mo10

Yes, I wrote that in the review. I think that they made the wrong choice because that is not how most people will consume the content.
Do you happen to have a good argument why the book proper couldn't be 20% longer to better make their case?

[-]Raemon2mo132

I think 20% longer does meaningfully put it over a threshold that matters a lot. Being short enough that a not-particularly-invested person would choose to read it matters a lot.

I also don't know that I buy being 20% longer would really help, in that, there's basically two kinds of people reading the book – people who just want a basic thrust of the arguments, and people who want a good enough understanding to contribute intellectually in some way. I currently think the book does a pretty decent job at conveying the most central arguments.

I think I also disagree that most people will read the book vs the online contents. (disclosure: I worked on the website for the online resources). I think most people won't finish the book (even among people who buy the book. People mostly don't read books), but a lot of people are likely to read at least some of the free online contents.

(It's a bit easier for me to believe this in part because I expect to exert some control/taste on how the online resources evolve over time and have ideas for making them really good)

[-]yams2mo112

I’ve met a large number of people who read books professionally (humanities researchers) who outright refuse to read any book >300 pages in length.

[-]Darren McKee2mo10

I chose ~20% for a reason, but we can be more precise and say 15% to still keep it under 300 pages.
Also, it you truly think space is at such a premium, then the scenario could be scaled back in favor of explaining how the policy proposals would work.

[-]yams2mo20

I am not arguing about the optimal balance and see no value in doing so. I am adding anecdata to the pile that there are strong effects once you near particular thresholds, and it’s easy to underrate these.

In general I don’t understand why you continue to think such a large number of calls are obvious, or imagine that the entire MIRI team, and ~100 people outside of it, thinking, reading, and drafting for many months, might not have weighed such thoughts as ‘perhaps the scenario ought to be shorter.’ Obviously these are all just margin calls; we don’t have many heuristic disagreements, and nothing you’ve said is the dunk you seem to think it is.

Ultimately Nate mostly made the calls once considerations were surfaced; if you’re talking to anyone other than him about the length of the scenario, you’re just barking up the wrong tree.

More on how I’m feeling in general here (some redundancies with our previous exchanges, but some new):

https://www.lesswrong.com/posts/3GbM9hmyJqn4LNXrG/yams-s-shortform?commentId=yjnTtbyotTbEnXqa9

[-]Darren McKee2mo10

While a decent exchange, I'm not sure if this is that useful to either of us for future exchanges?

Regarding anecdata, you also have to take into account Scott Alexander disliking the scenario, Will being disappointed, Shakeel thinking the writing was terrible, and Buck thinking that they didn't sufficiently argue their case. And that's not even including the people who overly disagree with the main argument.

Anyway, we shall see how it turns out (and I sincerely hope it has a positive impact)

[-]yams2mo20

Most of these people claim to be speaking from their impression of how the public will respond, which is not yet knowable and will be known in the (near-ish) future.

My meta point remains that these are all marginal calls, that there are arguments the other direction, and that only Nate is equipped to argue them on the margin (because, in many cases, I disagree with Nate’s calls, but don’t think I’m right about literally all the things we disagree on; the same is true for everyone else at MIRI who’s been involved with the project, afaict). Eg I did not like the scenario, and felt Part 3 could have been improved by additional input from the technical governance team (and more detailed plans, which ended up in the online resources instead). It is unreasonable that I have been dragged into arguing against claims I basically agree with on account of introducing a single fact to the discussion (that length DOES matter, even among ‘elite’ audiences, and that thresholds for this may be low). My locally valid point and differing conclusions do not indicate that I disagree with you on your many other points.

That people wishing the book well are also releasing essays (based on guesses and, much less so in your case than others, misrepresentations) to talk others in the ecosystem out of promoting it could, in fact, be a big problem, mostly in that it could bring about a lukewarm overall reception (eg random normie-adjacent CEA employees don’t read it and don’t recommend it to their parents, because they believe the misrepresentations from Zach’s tweet thread here: https://x.com/Zach_y_robinson/status/1968810665973530781). Once that happens, Zach can say “well, nobody else at my workplace thought it was good,” when none of them read it, and HE didn’t read it, AND they just took his word for it.

I could agree with every one of your object level points, still think the book was net positive, and therefore think it was overconfident and self-fulfillingly nihilistic of you to aithoritatively predict how the public would respond.

I, of course, wouldn’t stand by the book if I didn’t think it was net positive, and hadn’t spent tens of hours hearing the other side out in advance of the release. Part I shines VERY bright in my eyes, and the other sections are, at least, better than similarly high-profile works (to the extent that those exist at all) tackling the same topics (exception for AI2027 vs Part 2).

[-]Vaniver2mo64

Do you happen to have a good argument why the book proper couldn't be 20% longer to better make their case?

I think there's a curve of how many people pick up the book at all that depends on length. I didn't do this estimation explicitly--and my guess is the authors and publishers were doing it implicitly instead of explicitly--but my guess is you get something like 20% fewer readers if the book is 20% longer, and the number of additional people who find it convincing with 20% more length is something like 5% of readers, and I think that means increasing the length is suboptimal.

(Like, in my favorite world we could A/B test this with the ebook or w/e, where we dynamically include material and see which pieces to include, or have something Arbital-style where people can expand sections for elaboration as needed. But this is very challenging to do with a physical book.)

[-]Darren McKee2mo10

Yes, a test would be nice but impossible.

I'll just say I strongly disagree that 20% more length means 20% fewer readers. I would think it wouldn't change readership much at all. The people who would read such a book wouldn't drop off quite so dramatically.

[-]WilliamKiely2mo82

Fair review. As I've now said elsewhere, after listening to IABIED I think your book Uncontrollable is probably still the best overview of AI risk for a general audience. More people should definitely read your book. I'd be down to write a more detailed comparison in a week or two once I have hardcopies of each book (still in the mail).

[-]Darren McKee2mo40

Thank you kindly, and I look forward to that comparison

[-]Alice Blair2mo7-10

I’ve reflected on whether that perception is largely subjective preference or a honed sense of how others come to understand things but I can’t know (time will tell).

I professionally and as in my personal capacity do a lot of communications to different levels of background knowledge about AI safety, and I got the same sense that this is not how you bridge the gap between the general population/DC folk/intellectual side of genpop/etc and what you're actually trying to communicate. I basically agree with Yudkowsky on all of his claims. My primary problem with his writing has always been that it only works on those who are already at least a bit rationalist and have the capacity to become moreso. I assumed from the preliminary reviews that the contributions from Nate and the editing team had fixed this, that they had finally turned Yudkowsky's writing into good general-audience writing, and I was surprised and disappointed to find out this was not the case. The praise for the book from outsiders still gives me some hope that I'm wrong on this front, but this doesn't meaningfully effect my assessment of its quality.

[-]yams2mo122

I think the text is meaningfully more general-audience friendly than much of the authors’ previous writing.

It could still be true that it doesn’t go far enough in that direction, but I’m excited to watch the experiment play out (eg it looks like we’re competitive for the Times list rn, and that requires some 4-figure number of sales beyond the bounds of the community, which isn’t enough that I’m over the moon, given the importance of the issue, but is some sign that it may be too early in the game to say definitively whether or not general audiences are taking to the work).

[-]Darren McKee2mo10

To both, have either of you read my book for comparison?

Alice,
Yes, we feel the same way on multiple fronts. I still don't understand why certain decisions were made that reduced some easy wins. Oh well, we shall see.

yams,
That's true that it's better, but there is SO much further it could have gone.
Actually, I think 6-8000 copies can be largely driven by the community (funding book groups) and there was such an institutional push that that should help.
The concern is that they get the sales but it's the wrong book. So the thing one actually wants - the reader to now be aware/engaged, happens less compared to something else. Perhaps it will polarize in bad ways... or good ways. Experiment indeed.

[-]yams2mo50

Can’t discuss too much about current sales numbers, mostly because nobody really has numbers that are very up to date, but I was starting with a similar baseline for community sales, and then subtracting that from our current floor estimate to suggest there’s a chance it’s getting traction; a second wave will be more telling, the conversation will be more telling, but the first filter is ‘get it in people’s hands’, and so we at least have a chance to see how those other steps will go.

In both this and other reviews, people have their theory of What Will Work. Darren McKee writing a book (unfortunately) does not appear to have worked (for reasons that don’t necessarily have anything to do with the book’s quality, or even with Darren’s sense of what works for the public; I haven’t read it). Nate and Eliezer wrote a book, and we will get feedback on how well that works in the near future (independent of anyone’s subjective sense of what the public responds to, which seems to be a crux for many of the negative reviews on LW).

I’m just highlighting that we all have guesses about what works here, but they are in fact guesses, and most of what this review tells me is ‘Darren’s guess is different from Nate’s’, and not ‘Nate was wrong.’ That some people agree with you would be some evidence, if we didn’t already strongly predict that a bunch of people would have takes like this.

[-]Darren McKee2mo30

Oh yes, guesses all over the place. And very difficult to meaningfully arbitrate.

(FYI, my opinion is that mine hasn't reached more people for several reasons, such as not having name recognition, existing larger following, and institutional support.
But, whenever some reads it, they seem to really like it.)

[-]ryan_b2mo61

Strong upvote, I appreciate the inside-view context that you have from publishing a similar book. I bought it as a result of this review.

I cannot, alas, promise a side-by-side review. However, there are a couple of questions I am primed to look for, foremost among them right now: how much detail is invested in identifying the target audience? The impression I am getting so far is that it has been approximately defined as not us, but a lot of complaints seem to turn on this question. I see a lot of discussion about laymen but that's an information level, not a target audience. I don't know if I have seen much discussion of the target audiences at all outside of the AI policy area, come to think of it.

[-]Darren McKee2mo32

Great, look forward to hearing what you think.

I can't speak to exactly who IABIED was targeting, but I spent a lot of effort to make it as accessible as possible to someone who (a) would read a non-fiction book about AI, but (b) have no background in science (this includes many people with influence). The logic being that one might lose non-science people if written more towards science people but unlikely to lose science people if written generally but engagingly.

[-]Thane Ruthenis2mo6-2

I particularly agree with the point about the style being much more science-y than I'd expected, in a way that surely filters out large swathes of people. I'm assuming "people who are completely clueless about science and are unable to follow technical arguments" are just not the target audience. To crudely oversimplify, I think the target audience is 120+ IQ people, not 100 IQ people.

I mention this for transparency but also because some seem to be rallying around IABIED, even with its shortcomings, because they don’t think there is another option

I think IABIED should be rallied around because "the MIRI book" is the obvious Schelling point for rallying around. It has brand recognition in our circles, its release is a big visible event, it managed to get into best-seller categories meaning it's visible to the mainstream audiences, etc. Even if there are other books which are moderately better at doing what IABIED does, it wouldn't be possible to amplify their impact the same way (even if, say, Eliezer personally recommended them), so IABIED it is.

Further, even if it's possible to coordinate around and boost a different book the same way, this would require additional time; months or years (if that better book is yet to be written). We don't have much of that luxury, in expectation.

This still wouldn't be a good idea if IABIED were actively bad, of course. But it's not. I think it's reasonably good, even if we have our quibbles; and MIRI's pre-release work shows that it seems convincing to non-experts.

We could think about crafting better persuasion-artefacts in the future, but I think rallying around IABIED is the only option, at this point in time. And it may or may not be a marginally worse option compared to some hypothetical alternatives, but it's not a bad option.

[-]Darren McKee2mo30

I think the target audience includes those in various positions and with various backgrounds that would benefit from a more thorough presentation of the ideas, so it's not just the style issue.

It might depend on what, exactly, rallying means and how you see the implications of that. I thought EY's appearance on Hard Fork, for example, wasn't good and the message of AI safety might have been better presented by someone else.

As you read, I agree with Buck that book doesn't sufficiently argue it's main points, and this makes it problematic.

We may just disagree on how difficult it would be to recommend my book (as an example) along with IABIED?
There are a wide range of options and some require little effort and wouldn't take away much from IABIED compared to the benefit (yes, I think the difference is that great).

[-]Jeroen Willems2mo20

I went into IABIED trying to take on the mindset of a layperson (hard of course!) and actually came away thinking it did a really great job. Of course, as you say, time will tell.

Some of your complaints of the book seem to stem from the fact that you are "For Y" and Y&S are "Not X". If you believed as strongly as they do in "Not X", do you think some of the decisions in the book would make more sense?

I thought the length of the book was great for people new to the topic. Readers will likely have counterarguments while reading the book. But if you even try to address those a little, the book would quickly grow beyond just 20% longer. The decisions on what to include made sense to me.

The scenario in part 2 does a great job responding to the common question "but how exactly will AI take over and kill us all?". I feel very confident most readers would much much rather have a clear story than extrapolations. It's true that stories of how AI will kill us carry lots of risk of hole-poking and discarding. But I actually think they handled that very well by adding plenty of clear caveats before, during, and after the scenario.

I think their proposal, aside from the 8 GPUs (I would choose a higher threshold), makes sense as is. They admit their lack of knowledge on how to implement it IIRC. I think that's completely fine. I'm glad they don't go into detail about what they don't know. The may point of the book is right there in the title. What logically follows from the title is that you need international agreements similar to how we've handled nuclear war. I assume they hope others who read the book with more knowledge on how to get to such a place, will get motivated to act.

This book is the first of Yudkowsky I actually managed to finish. When I heard Shakeel talk about torturous language and others complaining about the parables, I was worried (because those are exactly the reasons I couldn't finish his other works). But I ended up really surprised by how much I enjoyed the writing and all of the parables. And funnily enough I thought the leaded gasoline one was one of the most boring ones. But perhaps I was so pleasantly surprised because of the low expectations I had going in. And I can definitely imagine how they might still be too sciencey/sci-fi for laypeople. Good point!

Haven't read your book yet, so I can't say how it compares!

LESSWRONG
LW

LESSWRONG
LW

66

IABIED Review - An Unfortunate Miss

66

66

(Meta?)Context and Expectations

Main Points/Reflections

Important messages

Lack of detail/ argumentation/ too short

Style too sciencey/sci-fi?

Scenario (Part II of IABIED)

Promoting Safe AI innovation or Shut it all down?

Final Thoughts