Race to the Top: Benchmarks for AI Safety
This is an executive summary of a post from my personal blog, also cross-posted from the EA Forum. Read the full texts here. Summary Benchmarks support the empirical, quantitative evaluation of progress in AI research. Although benchmarks are ubiquitous in most subfields of machine learning, they are still rare in...
Dec 4, 202229