Hey y'all. I have a hobby of collecting papers and videos that go over interesting research or less-known-in-ai-safety subfields/topics. But I generally reach far outside what I'm able to directly use. So, here are some things I found which are interesting to me. To make best use of this post, please turn on your mental skim mode. I have more interesting high-impact-on-capabilities links I don't feel comfortable sharing widely right now, but which I will share with some labs; please DM me for those links. They're public papers, though, so my not sharing links barely has any impact.
(And for any who need the reminder - remember that when reading papers, speed is to be advised until a paper has been determined to be relevant to topics you'd like to intervene on. https://www.semanticscholar.org/paper/How-to-read-a-paper-Keshav/15bd289b853f97eab5a5e930bfc27f3f803ce81d)
I'm going to post this and then edit it over the course of a few hours, so I stop leaving it unpublished. Update, +3 hours: have now added most of the links I'm going to. Next up, adding summaries and maybe a few more links. Will finish editing tomorrow.Another update, +9 days: I've been distracted, comment a word or two if you'd like a higher quality version of this post to exist.
edit: see also this post on finding things on the internet.
This is all stuff I haven't really deeply evaluated, but which looks cool. It's labeled with whether I'll read: done > definitely > probably > maybe > iffy
Thanks for sharing!