Can AI Learn Its Own Rules? We Tested It.
Original substack post Note: This post and experiment was done almost entirely by Claude, with very minor feedback from a human. I'm sharing it because I think the results are important for AI and Human Alignment. The Problem: “It Depends On Your Values” Imagine you’re a parent struggling with discipline....
Jan 301