Jack O'Brien's Shortform

Jack O'Brien

LESSWRONG
LW

Jack O'Brien's Shortform

1 min read1st Dec 20222 comments

This is a special post for quick takes by Jack O'Brien. Only they can create top-level comments. Comments here also appear on the Quick Takes page and All Posts page.

Jack O'Brien's Shortform

1st Dec 2022

2Jack O'Brien

2Isabella Barber

2 comments, sorted by

top scoring

Click to highlight new comments since: Today at 10:05 AM

[-]Jack O'Brien2y20

Let's be optimistic and prove that an agentic AI will be beneficial for the long-term future of humanity. We probably need to prove these 3 premises:

Premise 1: Training story X will create an AI model which approximates agent formalism A
Premise 2: Agent formalism A is computable and has a set of alignment properties P
Premise 3: An AI with a set of alignment properties P will be beneficial for the long-term future.

Aaand so far I'm not happy with our answers to any of these.

[-]Isabella Barber2y20

maybe there is no set of properties p that can produce alignment hmm

Moderation Log