As with all my blog posts, these are only very preliminary thoughts. I worry perhaps their scope gives them a sense of self-seriousness I don’t intend. This piece is about floating ideas that might be worth discussing, rather than strong claims to truth. Cate Hall tweeted: “In 2017 I was...
I am republishing this essay because of recent discussions about erratic, ‘emotional’ and aggressive behavior by Bing’s AI, Sydney. There has been some discussion about whether it’s ethical to run Sydney given that behavior. People are responding to such claims with “don’t be ridiculous, of course Sydney doesn’t feel anything,...
Cross-posting this from my blog, since it seems relevant. The case for GPT understanding language, by way of understanding the world There's a debate going on about whether or not language models similar to ChatGPT have the potential to be scaled up to something truly transformative. There's a group of...
I’ve been thinking about the control problem lately. The control problem, also called the AI alignment problem is, per Wikipedia: [A]spects of how to build AI systems such that they will aid rather than harm their creators. One particular concern is that humanity will have to solve the control problem...
I've written previously about an idea I call verbal parity, viz: A machine has verbal parity when, if given input in written form, it can give an appropriate output in written form as a response, as well as a human, for any input. Now this definition has many problems. For...
I’m putting my existing work on AI on Less Wrong, and editing as I go, in preparation to publishing a collection of my works on AI in a free online volume. If this content interests you, you could always follow my Substack, it's free and also under the name Philosophy...
Preface: I’m putting my existing work on AI on Less Wrong, and editing as I go, in preparation to publishing a collection of my works on AI in a free online volume. If this content interests you, you could always follow my Substack, it's free and also under the name...