Ofer — LessWrong

Book review: Architects of Intelligence by Martin Ford (2018)

Cross-posted from the EA Forum. The 2018 book Architects of Intelligence by Martin Ford is a collection of 23 interviews about progress in AI and future impacts thereof, including the prospect of developing AGI and existential risks. The interviewees include some of the most prominent and influential AI researchers, and...

Aug 11, 202015

The recent NeurIPS call for papers requires authors to include a statement about the potential broader impact of their work

NeurIPS (formerly NIPS) is a top conference in machine learning and computational neuroscience. The recently published call for papers for NeurIPS 2020 includes the following (which did not appear in previous years): > In order to provide a balanced perspective, authors are required to include a statement of the potential...

Feb 24, 202012

ofer's Shortform

Nov 26, 20194

A probabilistic off-switch that the agent is indifferent to

Edit: I no longer think this post deserves attention. Abstract This post presents a setup with an off-switch that is defective with probability of almost 0. The agent is indifferent to being terminated in worlds where the off-switch works. Also, the agent doesn't try to find out whether the off-switch...

Sep 25, 201811

Looking for AI Safety Experts to Provide High Level Guidance for RAISE

The Road to AI Safety Excellence (RAISE) initiative aims to allow aspiring AI safety researchers and interested students to get familiar with the research landscape effectively; thereby hopefully increasing the number of researchers that contribute to the field. To that end, we (the RAISE team) are trying to build a...

May 6, 201817

A Safer Oracle Setup?

Edit: I no longer think this post deserves attention. Edit: Invoking a black-box agent is probably a very bad idea unless there’s a consensus in the AI safety community it should be done. This post describes a setup that allows asking a super-intelligent agent a question, and getting an answer...

Feb 9, 20185