What success looks like

Marius Hobbhahn; MaxRa; JasperGeh; Yannick_Muehlhaeuser

What success looks like

by Marius Hobbhahn, MaxRa, JasperGeh, Yannick_Muehlhaeuser

2 min read28th Jun 20224 comments

19 Ω 8

AI GovernanceAI Risk Concrete StoriesAI Success ModelsAI

Frontpage

Crossposted from the AI Alignment Forum. May contain more technical jargon than usual.

This is a linkpost for https://forum.effectivealtruism.org/posts/AuRBKFnjABa6c6GzC/what-success-looks-like

TL;DR: We wrote a post on possible success stories of a transition to TAI to better understand which factors causally reduce the risk of AI risk. Furthermore, we separately explain these catalysts for success in more detail and this post can thus be thought of as a high-level overview of different AI governance strategies.

Summary

Thinking through scenarios where TAI goes well informs our goals regarding AI safety and leads to concrete action plans. Thus, in this post,

We sketch stories where the development and deployment of transformative AI go well. We broadly cluster them like
1. Alignment won’t be a problem, …
  - Because alignment is easy: Scenario 1
  - We get lucky with the first AI: Scenario 4
2. Alignment is hard, but …
  - We can solve it together, because …
    - We can effectively deploy governance and technical strategies in combination together: Scenario 2
    - Humanity will wake up due to an accident: Scenario 3
    - The US and China will realize their shared interests: Scenario 5
  - One player can win the race, by …
    - Launching an Apollo Project for AI: Scenario 6
We categorize central points of influence that seem relevant for causing the success of our sketches. The categories with some examples are:
1. Governance: domestic laws, international treaties, safety regulations, whistleblower protection, auditing firms, compute governance and contingency plans
2. Technical: Red teaming, benchmarks, fire alarms, forecasting and information security
3. Societal: Norms in AI, publicity and field-building
We lay out some central causal variables for our stories in the third chapter. They include the level of cooperation, AI timelines, take-off speeds, size of the alignment tax, type of actors and number of actors

AI GovernanceAI Risk Concrete StoriesAI Success ModelsAI

Frontpage

19 Ω 8

New Comment

4 comments, sorted by

top scoring

Click to highlight new comments since: Today at 3:21 PM

[-]shminux2y20

What's a TAI? There is no definition of this acronym anywhere in this post or in the link, and google brings 3 different but apparently unrelated hits: threats in AI, IEEE Transactions on AI, and... Tentacular AI. I hope it's that last one.

Reply

[-]Erich_Grunewald2y47

I think usually Transformative AI.

Reply

[-]shminux2y20

Thanks :)

Reply

[-]Noosphere892y11

My mainline best case or median-optimistic scenario is basically partially number 1, where aligning AI is somewhat easier than today, plus acceleration of transhumanism and a multipolar world both dissolve boundaries between species and the human-AI divide, this by the end of the Singularity things are extremely weird and deaths are in the millions or tens of millions due to wars.

Reply

Moderation Log