I would also object to his implication that the only time-sensitive thing about AGI safety is to figure it out before AGI comes. The other thing is determining which of today's many ongoing research paths towards AGI is likeliest to lead to an AGI architecture that's amenable to being used safely and beneficially. ...And that kind of information is only useful if we get it WAY before AGI is around the corner. :-)
The sad/good thing is that this article represents progress. I recall that in Human Compatible Stuart Russell said that there was a joint declaration from some ML researchers that AGI is completely impossible, and its clear from this article that Oren is at least thinking about it as a real possibility that isn't hundreds of years away. Automatically forming learning problems sounds a lot like automatically discovering actions, which is something Stuart Russell also mentioned in a list of necessary breakthroughs to reach AGI, so maybe there's some widespread agreement about what is still missing.
That aside, even by some of Oren's own metrics, we've made quite substantial progress - he mentions the Winograd schemas as a good test of when we're approaching human-like language understanding and common sense, but what he may not know is that GPT-2 actually bridged a significant fraction of the gap on Winograd schema performance between the best existing language models and humans, which is a good object lesson in how the speed of progress can surprise you - from 63% to 71%, with humans at about 92% accuracy according to deepmind.
I find that the Winograd schemas is more useful as a guideline to adversarial queries to stump AIs than an actual test. An AI reaching human-level accuracy on Winograd schemas would be much less impressive to me than an AI passing the traditional Turing test conducted by an expert who is aware of Winograd schemas and experienced in adversarial queries in general. The former is more susceptible to Goodhart's law due to the stringent format and limited problem space.
Oren Etzioni recently wrote a popular article titled "How to know if artificial intelligence is about to destroy civilization." I think it's a good idea to write publicly available responses to articles like this, in case interested people are googling around looking for such. If I get lucky, perhaps Oren himself will come across this! Then we could have a proper discussion and we'd probably both learn things.
For most readers of LW, it's probably not worth reading further, since you probably already agree with what I'm saying.
Here is the full text of the article, interspersed with my comments:
It is unfair to claim that Bostrom is speculating whereas your question is concrete and empirical. For one thing, Bostrom and others you denigrate have already asked the same question you now raise; there is not a consensus as to the answer yet, but plenty of arguments have been made. See e.g. Yudkowsky "There's No Fire Alarm for Artificial General Intelligence" More importantly though, how is your question any less speculative than the others? You too are making guesses about what future technologies will come when and in what order, and you too have zero historical data to draw from. Perhaps you could make arguments based on analogy to other technologies -- e.g. as Grace and Christiano have argued about "slow takeoff" and "likelihood of discontinuous progress" but funnily enough (looking ahead at the rest of the article) you don't even do that; you just rely on your intuition!
If you think "many computer scientists believe X about AI" is a good reason to take X seriously, then... well, you'll be interested to know that many computer scientists believe superintelligent AI will come in the next 15 years or so, and also many computer scientists believe AI is an existential risk.
I agree with you here; rapid progress in AI gaming is evidence, but not nearly conclusive evidence, that human-level AGI is coming soon.
What about GPT-2? There was relatively little spoon-feeding involved there; they just gave it pretty much the whole Internet and told it to start reading. For comparison, I received much more spoon-feeding myself during my education, yet I managed to reach human-level intelligence pretty quickly!
Anyhow, I agree that machines can't currently learn as well as humans can. But even you must admit that over the past few decades there has been steady progress in that direction. And it seems that progress is and will continue.
I think I agree with you here, that we don't really have an impressive example of an AI organically identifying, framing, and shaping learning problems. Moreover I agree that this probably won't happen in the next ten years, based on my guesses about current rates of progress. And I agree that this would be a canary -- once we have impressive AI doing this organically, then human-level AGI is probably very close. But shouldn't we start preparing for human-level AGI before it is probably very close?
Again, I agree that we are far from being able to do that -- being a doctor requires a lot of very general intelligence, in the sense that you have to be good at a lot of different things simultaneously and also good at integrating those skills. And I agree that once we have AI that can do this, human-level AGI is not far away. But again, shouldn't we get started preparing before that point?
Yep. Same response.
First of all, this isn't an argument based on tiny probabilities of infinite costs. The probability that human-level AI will arrive soon may be small but it is much much higher than other probabilities that you regularly prepare for, such as the probability that you will be in a car accident today, or the probability that your house will burn down. If you think this is a Pascal's Wager, then you must think buckling your seatbelt and buying insurance are too.
Secondly, again, the rationale for preparing isn't just that AGI might arrive soon, it's also that it is good to start preparing before AGI is about to arrive. Suppose you are right and there is only a tiny tiny chance that AGI will arrive before these canaries. Still, preparing for AGI is an important and difficult task; it might take several -- even many! -- years to complete. So we should get started now.
What? No we won't, you yourself said that human-level AGI will not be far away when these canaries start collapsing. And moreover, you seem to think here that preparation will be easy: All we need to do is design some off-switches and specify some red lines... This is really naive. You should read the literature, which contains lots of detailed discussion of why solutions like that won't work. I suggest starting with Bostrom's Superintelligence, one of the early academic works on the topic, and then branching out to skim the many newer developments that have arisen since then.
Here's another argument that has the same structure as your argument: "Some people think we should give funding to the CDC and other organizations to help prepare the world for a possible pandemic. But we have no idea when such a pandemic will arise--and moreover, we can be sure that there will be "canaries" that will start dying beforehand. For example, before there is a worldwide pandemic, there will be a local epidemic that infects just a few people but seems to be spreading rapidly. At that point, it makes sense to start funding the CDC. But anything beforehand is merely un-empirical speculation that distracts from addressing existing issues like how to stay safe from the flu. Oh, what's that? This COVID-2019 thing looks like it might become a pandemic? OK sure, now we should start sending money to the CDC. But Trump was right to slash its budget just a few months earlier, since at that point the canary still lived."
Also, Andrew Ng, the concern is not that AI will turn evil. That's a straw man which you would realize is a straw man if you read the literature. Instead, the concern is that AI will turn competent. As leading AI expert Stuart Russell puts it, we are currently working very hard to build an AI that is smarter than us. What if we succeed? Well then, by the time that happens, we'd better have also worked hard to make it share our values. Otherwise we are in deep trouble.