x

LESSWRONG
LW

Isabella Duan — LessWrong

Isabella Duan

Isabella Duan

Message

28

1

3y

Isabella Duan

28

3y

Race to the Top: Benchmarks for AI Safety

This is an executive summary of a post from my personal blog, also cross-posted from the EA Forum. Read the full texts here. Summary Benchmarks support the empirical, quantitative evaluation of progress in AI research. Although benchmarks are ubiquitous in most subfields of machine learning, they are still rare in...

Dec 4, 2022•29