Simon Goldstein — LessWrong

Will AI and Humanity Go to War?

[This post is the introduction to my full paper, available here https://philpapers.org/rec/GOLWAA. This post was partially inspired by a LW comment thread between @Matthew Barnett and @Wei Dai.] Abstract. This paper offers the first careful analysis of the possibility that AI and humanity will go to war. The paper focuses...

Oct 1, 202417

AI Rights for Human Safety

Just wanted to share a new paper on AI rights, co-authored with Peter Salib, that members of this community might be interested in. Here's the abstract: AI companies are racing to create artificial general intelligence, or “AGI.” If they succeed, the result will be human-level AI systems that can independently...

Aug 1, 202455

[Linkpost] A Case for AI Consciousness

by cdkg and Simon Goldstein

Just wanted to share a new paper on AI consciousness with Simon Goldstein that members of this community might be interested in. Here's the abstract: It is generally assumed that existing artificial systems are not phenomenally conscious, and that the construction of phenomenally conscious artificial systems would require significant technological...

Jul 6, 202422

AI Deception: A Survey of Examples, Risks, and Potential Solutions

By Peter S. Park, Simon Goldstein, Aidan O’Gara, Michael Chen, and Dan Hendrycks [This post summarizes our new report on AI deception, available here] Abstract: This paper argues that a range of current AI systems have learned how to deceive humans. We define deception as the systematic inducement of false...

Aug 29, 202354

Shutdown-Seeking AI

This is a draft written by Simon Goldstein, associate professor at the Dianoia Institute of Philosophy at ACU, and Pamela Robinson, postdoctoral research fellow at the Australian National University, as part of a series of papers for the Center for AI Safety Philosophy Fellowship's midpoint. Abstract: We propose developing AIs...

May 31, 202350

Language Agents Reduce the Risk of Existential Catastrophe

by cdkg and Simon Goldstein

This post was written by Simon Goldstein, associate professor at the Dianoia Institute of Philosophy at ACU, and Cameron Domenico Kirk-Giannini, assistant professor at Rutgers University, for submission to the Open Philanthropy AI Worldviews Contest. Both authors are currently Philosophy Fellows at the Center for AI Safety. Abstract: Recent advances...

May 28, 202339

The Polarity Problem [Draft]

by Dan H, cdkg, and Simon Goldstein

This is a draft written by Cameron Domenico Kirk-Giannini, assistant professor at Rutgers University, and Simon Goldstein, associate professor at the Dianoia Institute of Philosophy at ACU, as part of a series of papers for the Center for AI Safety Philosophy Fellowship's midpoint. Dan helped post to the Alignment Forum....

May 23, 202324