18 Implication of AI timelines on planning and solutions

by JJ Hepburn

21st Aug 2021

2 min read

5

18

AI

Frontpage

18

New Comment

5 comments, sorted by

top scoring

Click to highlight new comments since: Today at 6:13 AM

[-]Tao Lin4y60

I think that if we saw the working AI alignment solution used in 2050 in a paper written in 2026, we wouldn't be confident it would work. That's because there are a lot of uncertainties about how hard the AI alignment problem is in the first place, how ML behaves when it's scaled up, ect. I think most plans for AI safety need to go like "we make the theory now, then we keep working on it as ML scales up and adapt accordingly".

Reply

[-]JJ Hepburn4y30

Yes, if you have a solution in 2026 it isn't likely to be relevant to something used in 2050. But 2026 is the planned solution date and 2050 is the median TAI date.

The numbers I used above a just to demonstrate the point thought. The broad idea is that coming up with a solution/theory to alignment takes longer than planned. Having a theory isn't enough, you still have some time to make it count. Then TAI might come at the early end of your probability distribution.

It's pretty optimistic to plan that TAI will come at your median estimate and that you won't run into the planning fallacy.

Reply

[-]Tao Lin4y10

What I'm trying to say is that it's much harder to do AI alignment research while models are still small, so TAI timelines somewhat dictate the progress of AI alignment research. If I wanted my 5 year plan to have the best chance at success, I would have "test this on a dog-intelligence-level AI" in my plan, even if I thought that probably wouldn't arrive by 2036, because that would make AI alignment research much easier.

Reply

[-]JJ Hepburn4y10

The plan and numbers I lay out above you actually finish friendly AI in 2036, which is the 10% point

Reply

[-]Tao Lin4y40

Here is an argument I've heard for why we shouldn't try to solve AI alignment super early:

If you aren't one of the top few AI safety researchers in the world, then you are far more likely to solve AI alignment if you spend some years to develop your skills first. Therefore most people in AI alignment should forsake some early timelines (like anything before 2040) and optimize for their impact once they're a senior researcher.

This would be false if either less experienced AI safety researchers were able to contribute to completing AI alignment in 5 years, or if they can develop skills nearly as well working on a 5 year alignment plan as they could just optimizing for learning. I think both of these are somewhat true, which weakens the argument for me.

Reply

Moderation Log

LESSWRONG
LW

LESSWRONG
LW

18

Implication of AI timelines on planning and solutions

18

18