I tried to organize a program where participants actually sincerely tried to solve the hard part of alignment, for up to 5 weeks. It went wrong a lot, largely due to fixable mistakes. Good things about the program: * I learnt a lot about alignment in doing the prep, interviewed...
I wrote this in DMs to Akshyae Singh, who's trying to start something to help bring people together to improve AI Safety communication. After writing it, I thought that it might be useful for others as well. I'd like to preface with the information that I'm not a marketing expert,...
[LessWrong Community event announcement: https://www.lesswrong.com/events/rRLPycsLdjFpZ4cKe/ai-safety-law-a-thon-we-need-more-technical-ai-safety] Many talented lawyers do not contribute to AI Safety, simply because they've never had a chance to work with AIS researchers or don’t know what the field entails. I am hopeful that this can improve if we create more structured opportunities for cooperation. And this...
My father is now saying to my little sister that if you want to be a doctor, you can only sleep 2 hours a day. He doesn't care about the truth being sacred. He will lie to himself, to others, to anyone. He has not seen the truth as sacred...
The Moonshot Alignment Program is a 5-week research sprint from August 2nd to September 6th, focused on the hard part of alignment: finding methods to get an AI to do what we want and not what don't want, which we have strong evidence will scale to superintelligence. You’ll join a...
Why we need more and better goalposts for alignment. Announcing an AI Alignment Evals Hackathon to help solve this. When it comes to AGI we have targets and progress bars, as benchmarks, evals, things we think only an AGI could do. They're highly flawed and we disagree about them a...
Overview Join us for the AI & Liability Ideathon, a two-week event on December 7, 2024, at 3:00 PM BST. https://lu.ma/sjd7r89v Join lawyers, researchers and developers to create solutions for AI Liability. Propose, develop and refine ideas with a team, ending in a presentation evening where you can share the...