Epistemic status: don’t know whether I actually believe all of this, but I think it’s worth considering. A “corrigible” agent, per the LW wiki, is: > …one that doesn’t interfere with what we would intuitively see as attempts to ’correct’ the agent, or ’correct’ our mistakes in building it; and...
A few days ago I saw this comic reposted, and I thought: wait! Unlike every prior time I have seen this comic, I actually know how to solve this now! One thing which I often find really cool, and which I don’t think comes up a lot in most people’s...
[Attention conservation notice: description of a social phenomenon which may be obvious to some people.] This post is partially inspired by Alexander Wales’ Majority and Minority, which I would recommend highly. I’m afraid of dogs. Like, pathologically afraid of dogs. I cross the street to avoid people walking tiny poodles;...
There is a common argument that AI development is dangerous that goes something like: 1. The “goal” of evolution is to make animals which replicate their genes as much as possible; 2. humans do not want to replicate their genes as much as possible; 3. we have some goal which...