TeaSea

Message

Controlling AGI Risk

A theory of AGI safety based on constraints and affordances. I've got this proto-idea of what's missing in much public discussion and action on AI safety. I'm hoping that by sharing it here, the hive-mind might come together and turn it into something useful. Effective control of AI risk requires...

Mar 15, 2024•6

Message

8 karma

1 post

3 comments

Member for 2 years

TeaSea — LessWrong

TeaSea

Message

TeaSea

Controlling AGI Risk

Mar 15, 2024•6

Message

8 karma

1 post

3 comments

Member for 2 years

Replying toControlling AGI Risk

TeaSea2y

Controlling AGI Risk

Thanks! I might link to your excellent post in my next effort, if that's ok...?

Replying toControlling AGI Risk

TeaSea2y

Controlling AGI Risk

Brilliant! Thanks for these insightful comments, Nathan. I'll endeavour to address them in a follow-up post.

Replying toControlling AGI Risk

TeaSea2y

Controlling AGI Risk

Thanks very much, Carl. Your feedback is super-useful, and much appreciated. I'll take it on board along with other comments and will work on a follow-up that gives more examples of what sort of controls might be deployed in the wider system.

Controlling AGI Risk

TeaSea

A theory of AGI safety based on constraints and affordances.

I've got this proto-idea of what's missing in much public discussion and action on AI safety. I'm hoping that by sharing it here, the hive-mind might come together and turn it into something useful.

Effective control of AI risk requires a broader approach than those taken so far. Efforts to-date have largely gravitated into the two camps of value alignment and governance. Value alignment aims to design AI systems that reliably act in the best interest of humans. Governance efforts aim to constrain people who develop, deploy or use AI to do so in ways that ensure the AI doesn't cause unacceptable harm.

These two... (read 1029 more words →)

LESSWRONG
LW

LESSWRONG
LW

TeaSea

TeaSea

TeaSea

Controlling AGI Risk

TeaSea

TeaSea

TeaSea

Controlling AGI Risk