We’re excited to announce that applications are now open for our 2025 Q1 Pivotal Research Fellowship, a 9-week program designed to enable promising researchers to produce impactful research and accelerate their careers in technical AI safety, AI governance, and biosecurity. About the Fellowship The Pivotal Research Fellowship is hosted in...
The Swiss Existential Risk Initiative (CHERI) is now called Pivotal Research, and the CHERI research fellowship is now the Pivotal Research Fellowship. Apply for the Pivotal Research Fellowship this summer in London to research global catastrophic risks (GCR) with experienced mentors on technical AI safety, AI governance, biosecurity & pandemic...
Overview * Solving the problem of mesa-optimization would probably be easier if we understood how models do search internally * We are training GPT-type models on the toy task of solving mazes and studying them in both a mechanistic interpretability and behavioral context. * This post lays out our model...
This post was written by Marius Hobbhahn and Tilman Räuker. Disclaimer: We have previously posted this piece on the EA forum. We now post it here because LW allows for polls, and we have worked in additional feedback. Over the last years, we have encountered stances on development within AI...