LESSWRONG
LW

AgencyAgent FoundationsAI Safety Public MaterialsFuture of Life InstituteAI
Frontpage

8

Can AI agents learn to be good?

by Ram Rachum
29th Aug 2024
1 min read
0

8

This is a linkpost for https://futureoflife.org/ai-research/can-ai-agents-learn-to-be-good/
AgencyAgent FoundationsAI Safety Public MaterialsFuture of Life InstituteAI
Frontpage

8

New Comment
Moderation Log
More from Ram Rachum
View more
Curated and popular this week
0Comments

Hi everyone!

My name is Ram Rachum and I'm working on AI Safety research. I want to elicit social behavior in RL agents and use it to achieve AI Safety goals such as alignment, interpretability and corrigibility.

I made a guest post on the Future of Life Institute's blog: https://futureoflife.org/ai-research/can-ai-agents-learn-to-be-good/

This isn't specifically about my research, as it's mostly geared towards the public so it's pretty basic. I do have a plug for my latest paper at the bottom. This is my first public writing on AI Safety, so I'd appreciate any comments or corrections.

I'm currently raising funding for my research. If you know of relevant funders, I'd appreciate a connection.