Interview with Eliezer Yudkowsky on Rationality and Systematic Misunderstanding of AI Alignment

[-]niplav1mo4227

I've not watched this particular interview, but watched a bunch of your other interviews with several people, and tbh it shades a bit too much into the Yudkowsky personality cult direction? Especially this trailer.

I'd appreciated it if you made the show more about the ideas, and less about that one particular person who doesn't matter except insofar their ideas, and I think Yudkowsky would happily fade into obscurity if his goals were achieved. But mainly the presentation is, ah, "not beating the personality cult allegations", and leaves me off feeling icky.

[-]J Thomas Moros1mo235

I did watch this interview, but not his other videos. It does start with the intro from that trailer. However, I did not see it as reflecting a personality cult. Rather, it seemed to me that it was trying to establish Eliezer's credibility and authority to speak on the subject for people who don't know who he is. You have to remember that most people aren't tracking the politics of the rationality community. They are very used to an introduction that hypes up the guest. Yes, it may have been a bit more hyperbolic than I would like, but given how much podcast/interview guests are hyped on other channels and the extent to which Eliezer really is an expert on the subject, much more so than many book authors that get interviewed, it was necessary to lay it on strong.

[-]Archimedes1mo148

I read the transcript above but haven't watched the trailer. IMO, there's definitely more fawning throughout (not just the introduction) than is necessary.

[-]Liron1mo12-5

Thanks for the feedback. I understand there's a contingent of people who feel as you do about this, though I suspect small relative to my target audience. Still, IMO, it's worthwhile to recognize that a ton of brilliant & valuable thinking is localized in one living mind.

I think Yudkowsky would happily fade into obscurity if his goals were achieved.

Even if that's the case, I wouldn't want our media landscape and history books to neglect to acknowledge and remember a great thinker.

[-]niplav1mo92

I've now listened to the interview and it's in the 60th percentile of what I'd've expected, that's nice :-)

I don't know who exactly your target audience is. I guess having a person as a handle for "this cluster of concepts and models" is useful, but I think in other interviews I got the impression that your guests were sometimes a bit uncomfortable with the level of Yudkowsky-praise.

[-]Error1mo42

I found the trailer/intro a bit off-putting too. It didn't bother me, exactly, but it seemed over the top, and makes me hesitant to share the interview with others.

That said, I think JTM makes a good point above, about the expectations of the general public. I rarely watch talk shows; I don't know what's normal. I could understand a tradeoff where the optimal tone for the public also provides ammo for certain kinds of criticism.

[-]Error1mo155

This isn't about the content, but: Thank you for publishing the transcript. It's really, really aggravating when useful and/or interesting material is trapped in a form that can't be ctrl-F'd.

[-]Liron1mo20

No prob!

[-]cousin_it1mo143

I agree with others about the fawning. A more "hardball" question I'd ask is: why not the left? It feels at some point a choice was made to build a libertarian-leaning techie community, which backfired: rationalists and adjacent folks ended up playing a big role in building and investing in AI. Maybe a more left-leaning movement focused on protest and the like would make more sense now?

[-]Mitchell_Porter1mo92

The kind of movement that would be most effective at stopping AI, would be one that is anti-AI with no nuances and certainly no transhumanism. Just emphasize every downside of AI, actual and possible, with human extinction being front and center as the really big reason not to do it.

[-]cousin_it1mo*40

Yeah. As an example of "no nuances", maybe an effective anti-AI movement would even have to be anti-AI-alignment. As in, it would tell young people "don't work on AI alignment".

[-]TristanTrim1mo30

One movement is probably the wrong idea, rather we need different movements tailored to working with each relevant social systems and class, and such that they will work well with each other.

Many people dislike AI for mundane reasons, and it seems like thrusts to address the "if anyone builds it, everyone dies (IABIED)" issue are often watered down to focus instead on mundane (but still important) issues in ways that do not address IABIED.

A wedge issue: narrow AI. I think narrow AI is very good and useful and we should redirect investment from AGI/ASI towards interdisciplinary narrow AI. This perspective is much more appealing for many pro-technology people than anti-AI everywhere, and I think that pro-technology people are an important kind of people to convince. But, for example, art generation is (mostly) a form of narrow AI, and it has (much like LLMs) been trained illegally on stolen intellectual property. I think that is a problem, but I do not oppose the kind of machines which would put artists out of work in general. So, an anti AGI, pro narrow AI stance is unlikely to be popular with artists, for example.

Unfortunately, I believe nuance is necessary, but the idea of having multiple movements focused on multiple issues seems worthwhile for negating some of the problems created by having nuance.

[-]Noosphere891mo21

As someone who's more sympathetic to left-wing ideas than perhaps 80-90%+ of the rationalist community, a lot of the reason for avoiding allying with the left is it makes AI safety a partisan cause, and this would really, really nuke their chances of safety policy getting passed, because left-wingers would ask them to take politically controversial stances on a whole host of topics mostly unrelated to AI safety that is actually relevant.

Way too many well-meaning people would ask AI safety people to make something like an omni-cause movement, which has been arguably one of the biggest reasons why a lot of movements in the 2010s failed to achieve their goals.

[-]Matrice Jacobine1mo10

Trying to outline the cruxes:

If you think AI safety require safety research, differential acceleration, etc. and trust AI companies to deliver them, your best bet political affiliation will be with tech-industry-friendly bipartisan centrists.
If you think AI safety require safety research, differential acceleration, etc. and don't trust AI companies to deliver them, your best bet political affiliation will be with tech-friendly progressives.
If you think AI safety require pausing or stopping all AI research as soon as possible through an international agreement, your best bet political affiliation will be with anti-tech progressives, as anti-tech conservatives will recoil on the "international agreement" aspect.
If you think AI safety require pausing or stopping all AI research as soon as possible, and no international agreement is needed because every country should independently realize that AGI will kill them all, your best bet political affiliation will be with anti-tech people in general whether progressives or conservatives, and probably more with anti-tech conservatives if you expect them to have more political power within AGI timelines.

[-]WilliamKiely1mo32

I listened to the book and watched your interview and read Buck's review and I think it makes sense for Buck's review to have more karma. I think link-posts get less karma in general and people who want to enjoy your podcast episodes don't rely on LW posts to discover them.

My comment here is in response to your tweet: https://x.com/liron/status/1968499070395154942

LessWrong is a circular firing squad.
My deep dive with founder-of-site-&-associated-movement to promote his potentially world-saving book — 83 votes
Buck's “tentatively supportive” review of said book, questioning whether someone more reasonable could've written it — 152 votes

83 karma from 41 votes isn't like you were heavily down-voted or anything. It's a good amount of karma and seems appropriate. (I didn't vote on either post.)

[-]Raphael Roche1mo*1-2

Good interview. Perhaps a bit too consensual.
Many equally intelligent people who have carefully examined his arguments hold views that are less radical on the subject. For most, uncertainty remains high, but for Yudkowsky, that's simple. You jump off the cliff, you die.

This aspect is not ignored, but I would have appreciated an interview that questioned more his deeply entrenched position. The fact that current AIs are far more easily aligned than anyone could have dreamed of back in the 2000s is an evidence that should have compelled Yudkowsky to update. This is not proof that alignment is achievable for AGI, let alone ASI, but it is more an argument in favor than against such a possibility.

Notably, when we read the CoTs we often see that reasoning LLMs already do something that looks undistinguishable to me from a tentative coherent extrapolated volition (imperfect, but still).

However, while he should be happy about that, I see no significant update in Yudkowsky's position. It doesn't even count as evidence to him, it is negligible. I hope I’m wrong, but to me it seems he has succumbed to the bottom line syndrome. He made up his mind 20 years ago and I don't expect him to update unless anyone builds it and everybody dies or not.

PS : I will still buy the book and share it.

[-]Chris van Merwijk1mo30

"Perhaps a bit too consensual."

Yeah, horrible!! They should have pretended to disagree with each other in order to balance out all the agreement they have. They must be biased!!

[-]Raphael Roche1mo10

whether you think it's 1%, 5%, or like me, 50%

50% is pure incertitude. 95% (for Yudkowsky) is close to pure certitude. So, "all the agreement they have" seems kind of an overstatement.

Besides, even if they were in total agreement on the p(doom), it would definitely be a good thing to avoid the echo chamber effect and that the interviewer makes himself the advocate of other major figures of the debate, challenging Yudkowsky's position. Not all along the interview, but more than we see here. It seems all the more necessary since Yudkowsky appears, maybe not isolated but at least on a border of the spectrum. My feeling is that an intellectually honest rationalist cannot ignore these considerations.

[-]Chris van Merwijk1mo10

Fair point, I've downvoted my comment. Apologies.

(although in my defense, you didn't make that argument in the comment I responded to, and also, liron assigning 50% doesn't mean he actually disagrees with Yudkowsky. It might be he's just not sure, but doesn't have any counterarguments per se).

[-]Raphael Roche1mo10

I agree I should have pointed it out in the initial comment !

[-]ondrej.kubu54@gmail.com1mo10

Please publish the interview also in a podcast form. I can only find the trailer in my (free) Doom debates feed.

LESSWRONG
LW

LESSWRONG
LW

93

Interview with Eliezer Yudkowsky on Rationality and Systematic Misunderstanding of AI Alignment

93

93

Video

Timestamps

Transcript

Eliezer's Background and Evolution of Views

Rationality Fundamentals

AI Capabilities and Intelligence Scale

The Subcritical vs Supercritical Threshold

The Alignment Problem

What We Want from AI

International Coordination Solutions

Call to Action