x

LESSWRONG

LW

RyanC — LessWrong

RyanC

RyanC

Message

1

9mo

RyanC

9mo

Layered Reward Modifiers for Transparent and Self-Correcting AI

Introduction I’m not an AI researcher — I’m a head barista at a café — but I’ve always been fascinated by how reward works, both in people and as of more recently, in AI. When I started reading about AI and discovered that its sense of “reward” mostly comes from...

Nov 5, 2025•1