Cullen — LessWrong

Thoughts on "The Offense-Defense Balance Rarely Changes"

Personal Views Only Introduction I thought Maxwell Tabarrok’s The Offense-Defense Balance Rarely Changes was a good contribution to the discourse, and appreciated his attempt to use various lines of data to “kick the tires” on the case that an AI could in principle defeat all humans by inventing technologies that...

Feb 12, 202446

Polio Lab Leak Caught with Wastewater Sampling

I hadn't seen this discussed here: > The near complete eradication of wildtype polioviruses (WPV) means that strict containment by facilities for essential work with infectious WPV is required. In the Netherlands, we have implemented environmental surveillance around all poliovirus essential facilities (PEFs) premises to monitor possible breaches of containment....

Apr 7, 202382

Tracking Compute Stocks and Flows: Case Studies?

Posted in my personal capacity The AGI governance community has recently converged on compute governance[1] as a promising lever for reducing existential risks from AI. One likely building block for any maximally secure compute governance regime is stock and flow accounting of (some kinds of) compute: i.e., requiring realtime accurate...

Oct 5, 202211

Law-Following AI 4: Don't Rely on Vicarious Liability

This post is written in my personal capacity, and does not necessarily represent the views of OpenAI or any other organization. Cross-posted to the Effective Altruism Forum. Image by OpenAI's DALL·E If an agent A causes some harm while intending to benefit a principal P, what is P's liability? The...

Aug 2, 20225

Law-Following AI 3: Lawless AI Agents Undermine Stabilizing Agreements

This post is written in my personal capacity, and does not necessarily represent the views of OpenAI or any other organization. Cross-posted to the Effective Altruism Forum In the previous post of this sequence, I argued that intent-aligned AIs would, by default, have incentives to break the law. This post...

Apr 27, 20222

Law-Following AI 2: Intent Alignment + Superintelligence → Lawless AI (By Default)

This post is written in my personal capacity, and does not necessarily represent the views of OpenAI or any other organization. Cross-posted to the Effective Altruism Forum. In the first post of this sequence, I defined "law-following AI" ("LFAI") and "intent alignment." In this post, I will begin to motivate...

Apr 27, 20225

Law-Following AI 1: Sequence Introduction and Structure

This post is written in my personal capacity, and does not necessarily represent the views of OpenAI or any other organization. Cross-posted to the Effective Altruism Forum. This sequence of posts will argue that working to ensure that AI systems follow laws is a worthwhile way to improve the long-term...

Apr 27, 202218