Layered Reward Modifiers for Transparent and Self-Correcting AI
Introduction I’m not an AI researcher — I’m a head barista at a café — but I’ve always been fascinated by how reward works, both in people and as of more recently, in AI. When I started reading about AI and discovered that its sense of “reward” mostly comes from...
Nov 5, 20251