Mapping AI Capabilities to Human Expertise on the Rosetta Stone (Epoch Capabilities Index)
This is a crosspost from the General-Purpose AI Policy Lab research blog. The “Rosetta Stone for AI Benchmarks” paper, by Epoch AI and Google DeepMind researchers, which underpins the Epoch Capability Index, gave us a great way to rank AI models and benchmarks on a common difficulty scale. But the...
Mar 914