x

LESSWRONG

LW

Lizka — LessWrong

Lizka

Top postsTop post

Lizka

Message

I'm a researcher at Forethought.

Before that, I ran the non-engineering side of the EA Forum and worked on some other content-related tasks at CEA. [More about the Forum/CEA Online job.]

Most of my content (and a more detailed bio) is on my profile on the EA...

320

11

4

5y

Lizka

I'm a researcher at Forethought.

Before that, I ran the non-engineering side of the EA Forum and worked on some other content-related tasks at CEA. [More about the Forum/CEA Online job.]

Most of my content (and a more detailed bio) is on my profile on the EA...

Top postsTop post

AI benchmarking has a Y-axis problem

TLDR: People plot benchmark scores over time and then do math on them, looking for speed-ups & inflection points, interpreting slopes, or extending apparent trends. But that math doesn’t actually tell you anything real unless the scores have natural units. Most don’t. Think of benchmark scores as funhouse-mirror projections of “true” capability-space, which stretch some regions and compress others by assigning warped scores for how much accomplishing that task counts in units of “AI progress”. A plot on axes without canonical units will look very different depending on how much weight we assign to different bits of progress.[1] Epistemic status: I haven’t vetted this post carefully, and have no real background in benchmarking or statistics. Benchmark scores vs "units of AI progress" Benchmarks look like rulers; they give us scores that we want to treat as (noisy) measurements of AI progress. But since most benchmark score are expressed in quite squishy units, that can be quite misleading. * The typical benchmark is a grab-bag of tasks along with an aggregate scoring rule like “fraction completed”[2] * ✅ Scores like this can help us... * Loosely rank models (“is A>B on coding ability?”) * Operationalize & track milestones (“can a model do X yet?”) * Analyze this sort of data[3] * ❌ But they’re very unreliable for supporting conclusions like: * “Looks like AI progress is slowing down” / “that was a major jump in capabilities!” * “We’re more than halfway to superhuman coding skills” * “Models are on track to get 80% by EOY, which means...” * That's because to meaningfully compare score magnitudes (or interpret the shape of a curve), scores need to be proportional to whatever we're actually trying to measure * And grab-bag metrics don’t guarantee this: * Which tasks to include and how to weight them are often subjective choices that stretch or compress different regions of the scale * So a 10-point gain early on might reflec

Beware safety-washing

Which questions can’t we punt?

Design sketches for a more sensible world

Defense-favoured coordination design sketches

This post is part of a sequence. Previous post: Strategic awareness tools: design sketches Intro We think that near-term AI could make it much easier for groups to coordinate, find positive-sum deals, navigate tricky disagreements, and hold each other to account. Partly, this is because AI will be able to...

Which questions can’t we punt?

We think AI strategy researchers should prioritize questions related to earlier parts of the AI transition, even when that means postponing work on some questions that ultimately seem more important. In brief, our case for taking this “just-in-time” perspective is: * There are more open AI strategy questions than we...

Strategic awareness tools: design sketches

This post is part of a sequence. Previous post: Design sketches for angels-on-the shoulder | Next post: Defense-favoured coordination design sketches We’ve recently published a set of design sketches for tools for strategic awareness. We think that near-term AI could help a wide variety of actors to have a more...

Design sketches for a more sensible world

We don’t think that humanity knows what it’s doing when it comes to AI progress. More and more people are working on developing better systems and trying to understand what their impacts will be — but our foresight is just very limited, and things are getting faster and faster. Imagine...

Design sketches for angels-on-the-shoulder

This post is part of a sequence. Previous post: Design sketches: collective epistemics | Next post: Strategic awareness tools: design sketches We’ve recently published a set of design sketches for technological analogues to ‘angels-on-the-shoulder’: customized tools that leverage near-term AI systems to help people better navigate their environments and handle...

AI benchmarking has a Y-axis problem

TLDR: People plot benchmark scores over time and then do math on them, looking for speed-ups & inflection points, interpreting slopes, or extending apparent trends. But that math doesn’t actually tell you anything real unless the scores have natural units. Most don’t. Think of benchmark scores as funhouse-mirror projections of...

The first type of transformative AI?

AI risk discussion often seems to assume that the AI we most want to prepare for will emerge in a “normal” world — one that hasn’t really been transformed by earlier AI systems. I think betting on this assumption could be a big mistake. If it turns out to be...

Load More (7/15)