Alignment Newsletter One Year Retrospective

[-]Wei Dai7yΩ11340

The main value to me is being updated on all the research that is going on in this field. If the newsletter went away and nothing else changes, I don't know how I would find all the new relevant papers and posts that come out.

I think I've commented on your newsletters a few times, but haven't comment more because it seems like the number of people who would read and be interested in such a comment would be relatively small, compared to a comment on a more typical post. A lot of people who read your newsletters are doing so by email and won't even see my comment, and someone who does read them through LW/AF might not be interested in the particular paper (or your opinion of it) that I want to discuss. Plus, the fact that you avoid giving strong negative opinions (which BTW seems sensible to me for a newsletter format) makes it less likely that I feel an urgent need to correct something.

One idea you can consider is to create individual link posts on AF for the most important papers/posts that you include in the newsletter (with your summaries and opinions) that haven't already been posted to AF, which would create focal points for discussing them. I think if I had a thought on some paper that is mentioned in your newsletter, I'd be more inclined to write a comment for it under its own link post as opposed to under your newsletter post. I would also be more inclined to comment on your summaries and opinions if there was a chance to correct something before it went out to your email subscribers. This could also be a way for you to solicit summaries from random readers.

[-]ESRogs7y60

I think I've commented on your newsletters a few times, but haven't comment more because it seems like the number of people who would read and be interested in such a comment would be relatively small, compared to a comment on a more typical post.

I am surprised you think this. Don't the newsletters tend to be relatively highly upvoted? They're one of the kinds of links that I always automatically click on when I see them on the LW front page.

Maybe I'm basing this too much on my own experience, but I would love to see more discussion on the newsletter posts.

[-]Rohin Shah7yΩ240

Thanks! Link posts on AF are an interesting idea; my current expectation is that very few people apart from you would comment on them, but it seems worth trying.

I would also be more inclined to comment on your summaries and opinions if there was a chance to correct something before it went out to your email subscribers.

This makes sense, will think about how to make it happen.

[-]Rob Bensinger7yΩ360

One option that's smaller than link posts might be to mention in the AF/LW version of the newsletter which entries are new to AIAF/LW as far as you know; or make comment threads in the newsletter for those entries. I don't know how useful these would be either, but it'd be one way to create common knowledge 'this is currently the one and only place to discuss these things on LW/AIAF'.

[-]Rohin Shah7yΩ6140

Comment thread for the question: What is the value of the newsletter for you?

[-]Raemon7yΩ4120

Copied from my answer in the feedback form:

I'm a layman, attempting to help with infrastructure for technical people, who reads the newsletter sporadically to keep up with the overall trends in AI and AI Safety.

Right now I read the newsletter fairly sporadically. I think it might benefit me to, once a year, or maybe once a quarter, reading a higher level summary that goes over which papers seemed most important that year, and which overall research trends seemed most significant. I'm not sure if this is worth the opportunity cost for you, but it'd be helpful to me and probably others.

(I'd be interested in that both from the standpoint of my own personal knowledge, as well as tracking how stable your opinions are over time – when you list something as particularly interested or important do you tend to still think so a year later?)

I also think it'd make more sense for LessWrong to curate a "highlights of the highlights" post once every 3-12 months, than what we currently do, which is every so often randomly decide that a recent Newsletter was particularly good and curate that.

[-]Rohin Shah7yΩ360

I think it might benefit me to, once a year, or maybe once a quarter, reading a higher level summary that goes over which papers seemed most important that year, and which overall research trends seemed most significant. I'm not sure if this is worth the opportunity cost for you, but it'd be helpful to me and probably others.

A slightly different option would be to read the yearly AI alignment literature review, use that to find the top N most interesting papers, and read their summaries in the spreadsheet. This also has the benefit of showing you a perspective other than mine on what's important -- there could be an Agent Foundations paper in the list that I haven't summarized.

(I'd be interested in that both from the standpoint of my own personal knowledge, as well as tracking how stable your opinions are over time – when you list something as particularly interested or important do you tend to still think so a year later?)

I think that the stability of my opinions is going up over time, mainly because I started the newsletter while still new to the field.

I also think it'd make more sense for LessWrong to curate a "highlights of the highlights" post once every 3-12 months, than what we currently do, which is every so often randomly decide that a recent Newsletter was particularly good and curate that.

This seems good; I'm currently thinking I could write something like that once every 25 newsletters (which is about half a year), which should also help me evaluate the stability of my opinions.

[-]Rana Dexsin7y60

I browse this newsletter occasionally via LW; I am not subscribed by email. I am not so far seriously involved in AI research, and I don't wind up understanding most of it in detail, but I have a longer-term interest in such issues, and I want to keep a fraction of a bird's eye on the state of the field if possible, so that if I start in on deeper such activities a few years from now, I can re-skim the archives and try to catch up.

[-]Rana Dexsin7y10

Speculative followup: seeing a few other people say similar things here and contrasting it with what seems to have been implied in the retrospective itself makes me guess there's a seriousness split between LW and email "subscribers". Does the former have passersby dominating the reader set (especially since it'll be presented to people who are on LW for some other reason), whereas anyone who cares more deeply and specifically will primarily consume the newsletter by email?

[-]Rohin Shah7y20

Oh, I think there are a lot of email subscribers who skim/passively consume the newsletter. I didn't focus very much on them in the retrospective because I don't think I'm adding that much value to them.

It might be true that all of the people who read it thoroughly are subscribed by email, I'm not sure. It's hard to tell because I expect skimmers far outnumber thorough readers, so seeing a few skimmers via the comments is not strong evidence that there aren't thorough readers.

[-]ryan_b7y60

It is hits-based for me, where the hit is usually using analogies or models I otherwise have a better understanding of than alignment. Because I am a relative layperson I do not get a deep understanding of the papers, but the questions of the field are intrinsically interesting to me and I find the difference in viewpoints between the papers and opinions/summaries I do hit on very useful for trying to keep a 'shape of the field' in mind should I ever need to engage with it more deeply.

[-]Ofer7y50

The newsletter is extremely helpful for me for keeping up to date with AI alignment research. I also find the "Other progress in AI" section very helpful.

Both the summaries and the opinion segments are extremely helpful for me!

Overall, I think that reading (or listening to) all the ANs that I've read so far was an extremely high EV-per-hour time investment.

[-]Rohin Shah7y20

Thanks!

[-]dearken7y50

I'm a second-year college student. I hope to pursue a career in computing ethics, but I'm not sure I'll end up specifically in AI safety. I've attended some AI safety research meetings at my school, but I don't expect to actually begin doing my own research until next year.

I laughed at your idea that some people subscribe to the newsletter to feel like part of an elite group... yeah, that might be me at this point! However, I think it will be very useful for me when I have more time this summer to spend on deciphering the content. If I don't understand something in your summary, I look it up, so I've already begun to organically build a useful knowledge base.

Also, the newsletter provides me with a regular dose of reassurance and inspiration. Even when I don't have time to thoroughly read the summaries, skimming them reminds me how interesting this field is.

Thanks for your work, and I enjoyed reading the retrospective!

[-]Rohin Shah7y20

Thanks!

If I don't understand something in your summary, I look it up, so I've already begun to organically build a useful knowledge base.

This seems like a great way to use the newsletter :)

Also, the newsletter provides me with a regular dose of reassurance and inspiration. Even when I don't have time to thoroughly read the summaries, skimming them reminds me how interesting this field is.

[-]Jsevillamol7y120

Some back of the envelope calculations trying to make sense out of the number of subscribers.

The EA survey gets about ~2500 responses per year from self identified EAs and I expect it represents between 10% and 70% of the EA community, so a fair estimate is that the EA community is about 1e4 people.
They ask about top priorities. About 16% of respondents consider AI risk a top priority.
Assuming representativeness, that means about 2e3 EAs who consider AI risk a priority.
Of those I would expect about half to be considering actively pursuing a career in the field, for 1e3 people.
This checks out with the newsletter number of subscribers.

[-]Rohin Shah7y130

Hmm, this seems roughly plausible. It doesn't gel with my experience of how many people seem to be trying to enter the field (which I would have estimated almost an order of magnitude less, maybe 100-200), but it's possible that there's a large group of such people who I don't interact with who nonetheless are subscribed to the newsletter.

We also might have different intended meanings of "career in the field".

[-]Rohin Shah7yΩ5110

Comment thread for the question: Am I underestimating the risk of causing information cascades? Regardless, how can I mitigate this risk?

[-]ryan_b7y40

I believe the risk of information cascades due to the newsletter is very low. The biggest factor in my expectation is this one:

As a result, an opinion from a researcher who didn't do the work can help contextualize the results that makes it easier for less involved readers to figure out the importance of the ideas.

That is to say, it is very clear that this is a newsletter, and that your opinion differs from that of the authors of the papers. This goes a long way to preventing the kind of uncritical agreement that typifies information cascades.

Also, consider the case where nothing in the newsletter ever becomes the subject of wide agreement: this suggests to me that either the field is not making enough progress to settle questions (which is very bad), or that the newsletter is by accident or design excluding ideas upon which the field might settle (which seems bad from the perspective of the newsletter).

Finally, I expect this field and the associated communities are unusually sensitive to information cascades as a problem, and therefore less likely to fall victim to them.

So there is a mechanism working against cascades, other reasons to expect things in the newsletter to be widely agreed on over time, and a community less likely to fall victim to cascades even if the former two items did not apply.

Mitigation: other people writing summaries and opinions seems to me the best mitigation, because a diversity of opinion works directly against the cascade and it also seems a likely course for handling the increased volume of research. In particular, if two or more contributors to the newsletter wrote up different opinions on a paper, this would probably kill the cascade dead for that particular paper and further signal the importance of direct evaluation for the others.

[-]Rohin Shah7y20

Also, consider the case where nothing in the newsletter ever becomes the subject of wide agreement: this suggests to me that either the field is not making enough progress to settle questions (which is very bad), or that the newsletter is by accident or design excluding ideas upon which the field might settle (which seems bad from the perspective of the newsletter).

Certainly when my opinions are right I would hope that they become widely agreed upon (and I probably don't care too much if it happens via information cascade or via good epistemics). The question is about when I'm wrong.

That is to say, it is very clear that this is a newsletter, and that your opinion differs from that of the authors of the papers. This goes a long way to preventing the kind of uncritical agreement that typifies information cascades.

Journalism has the same property, but I do see uncritical agreement with things journalists write. Admittedly the uncritical agreement comes from non-experts, but with the newsletter I'm worried mostly about insufficiently critical agreement from researchers working on different areas, so the analogy kinda sorta holds.

Finally, I expect this field and the associated communities are unusually sensitive to information cascades as a problem, and therefore less likely to fall victim to them.

Agreed that this is very helpful (and breaks the analogy with journalism), and it's the main reason I'm not too worried about information cascades right now. That said, I don't feel confident that it's enough.

I think overall I agree with you that they aren't a major risk, and it's good to get a bit of information that at least you treat the opinion as an opinion.

[-]Rohin Shah7yΩ490

Comment thread for the question: What can I do to get more feedback on the newsletter on an ongoing basis (rather than having to survey people at fixed times)?

[-]digital_carver7y60

Simply including a line in the newsletter that feedback and comments are welcome can make a difference, letting people know that this is not just something for consumption but something that can be discussed upon and have opinions voiced over. Possibly more effective might be to have a "feedback week" every month, asking people to give feedback on whatever forum they are reading it on - having a specific time to give feedback is more likely to lead to action than leaving a continuously open window for it (even if the window does continue to be open at other times).

[-]Rohin Shah7y20

Yeah, I like the idea of having specific times for feedback, it does seem more likely that people actually bother to give feedback in those cases.

[-]ryan_b7y40

This is a good method, by which I mean creating comment threads on LessWrong. It has the added advantage of letting you solicit specific feedback, instead of just collecting what bubbles to the surface from the commentariat.

[-]Ofer7y30

Explicitly saying that you'd like feedback on the newsletter (like you just did in this post) would probably help and as digital_carver suggested you can include a request for feedback in each newsletter. For example, the Import AI newsletter ends with "If you have suggestions, comments or other thoughts you can reach me at ... or tweet at me..."

[-]Rohin Shah7yΩ490

Comment thread for the question: How should I deal with the growing amount of AI safety research?

[-]ryan_b7y60

Additional contributors to share the load of summarizing and searching seems like a good course, particularly since diverse contributors weighs against an information cascade.

[-]digital_carver7y30

This can be another window for feedback, asking readers to add any other relevant papers that they think are missed out from the newsletter. Then, if there are enough ones of sufficient quality, those can be linked to in the next newsletter, which acknowledgement to the commenter who posted it.

[-]Rohin Shah7yΩ370

Comment thread for the question: What is the value of the newsletter for other people?

[-]Ofer7y*30

However, Twitter has become worse over time, possibly because it has learned to show me non-academic stuff that is more attention-grabbing or controversial, despite me trying not to click on those sorts of things.

On Twitter you can create a list of relevant people (e.g. people who tend to tweet about relevant papers/posts) and then go over the complete "feed" of just that list, sorted chronologically.

[-]Rohin Shah7y20

Ooh, I might have to try this, it does sound better.

LESSWRONG
is fundraising!
LW

LESSWRONG
is fundraising!
LW

94

Alignment Newsletter One Year Retrospective

94

Ω 29

94

Ω 29

Summary

Actions I’d like you to take

Everything else

Newsletter updates

Spreadsheet

Translation

Newsletter stats

Growth

Composition of subscribers

Impact of the newsletter

Benefits

Costs

Is the newsletter worth it?

Going forward

Structure of the newsletter

Scaling up

Sourcing

Appearance

Future of the newsletter

Feedback I’d like

Appendix: Alignment Newsletter FAQ

What’s the history of the Alignment Newsletter?

Why do you never have strong negative opinions?

Mistakes

Things done right