5

8th Jan 2019

1 min read

A

1 4

5

Frontpage

5

New Answer

New Comment

1 Answers sorted by
top scoring

Gordon Seidoh Worley

Jan 08, 2019

10

Well, I don't know that we know enough to say what is most promising, but I'm most excited to explore is my own approach that suggests we need to investigate ways to get the content of AI and human thought aligned along preference ordering. I don't think this is by any means easy, but I don't really see another practical framework in which to approach this. This framework of course admits many possible techniques, but I think it's useful to keep in mind and not get confused (as often happens in existing imitation learning papers) about how much we can know about the values of humans and AIs.

3 comments, sorted by

top scoring

Click to highlight new comments since: Today at 3:13 PM

[-]habryka7y40

Mod edit note: Made this into a question for you. You created it as an ordinary post.

Reply

[-]Ben Pace7y20

Do you mean approach for building it or general alignment research avenue? For example, agent foundations is not an approach to building aligned AGI, it's an approach to understanding intelligence better than may later significantly help in building aligned AGI.

Reply

[-]Chris_Leong7y20

This question is specifically about building it, but that's a worthwhile clarification.

Reply

Moderation Log

LESSWRONG
LW

LESSWRONG
LW

5

[ Question ]

Which approach is most promising for aligned AGI?

5

5

1 Answers sorted by
top scoring

Jan 08, 2019

5

[ Question ]

Which approach is most promising for aligned AGI?

5

5

1 Answers sorted by top scoring

Jan 08, 2019

1 Answers sorted by
top scoring