Summary: AI models are now improving on METR’s Time Horizon benchmark at about 10x per year, compared to ~3x per year before 2024. They will likely return to a slower pace in 2026 or 2027 once RL makes up a sizable fraction of compute spending, but the current rate is...
Thanks to Ashwin Acharya, David Schneider-Joseph, and Tom Davidson for extensive discussion and suggestions. Thanks to Aidan O’Gara, Alex Lintz, Ben Cottier, James Sanders, Jamie Bernardi, Rory Erlich, and Ryan Greenblatt for feedback. Part 1: Executive Summary There's growing sentiment in the AI community that artificial general intelligence (AGI[1]) could...
A Long Time From Now, in a Galaxy Far, Far Away: Episode 1: The Phantom Mugger One night, on your way home from work, you find yourself face to face with a mysterious shadowy figure. “Your money or your life,” he hisses through his teeth. The tip of a knife...
This is a post on OpenAI’s “AI and Compute” piece, as well as excellent responses by Ryan Carey and Ben Garfinkel, Research Fellows at the Future of Humanity Institute. (Crossposted on the EA Forum) Intro: AI and Compute Last May, OpenAI released an analysis on AI progress that blew me...