Columbia EA is starting an Advanced AI Safety Fellowship this summer, which will involve one 3-hour meeting per week with a group of people seriously considering careers in AI safety who have some background in both ML (at least working knowledge of linear algebra, multivariable calculus, probability and stats, and neural networks) and AI safety (usually the Cambridge AGI Safety Fundamentals program or a similar reading group we run at Columbia). Each meeting will involve reading about a topic selected the previous week and discussing it, and may also include exercises (like these). While we do not have capacity to invite people from elsewhere to join our meetings, we would like to invite others to follow along with our exploration of the field! You can find our running meeting notes here, where we will write down our meeting agendas, readings, and other notes (maybe key takeaways, responses to exercises, questions, etc.).
The hope is that this can serve as a small, easy win, by motivating at least a few people to set aside time to dive deep into AI alignment topics with little additional effort on our part. Participating in an alternative program with a similar premise, in which the organizers put more effort into making it a good experience for you, would likely be better than following along with us. If such a program already exists, please let me know! I’m not currently aware of any programs that satisfy all of the following properties:
Given that we are not satisfying 3, you could dispense with “following along” and just structure your entire plan yourself if you feel that would work better for you. There are two main benefits we hope to provide for people following along:
We don’t have any strong takes on what it would look like to follow along with us, but here is a suggested path:
Thanks to Berkan Ottlik for co-running the fellowship and providing suggestions for this post. All mistakes are my own.