x

LESSWRONG

LW

schancellor — LessWrong

schancellor

schancellor

Message

1

5mo

schancellor

5mo

Can AI Learn Its Own Rules? We Tested It.

Original substack post Note: This post and experiment was done almost entirely by Claude, with very minor feedback from a human. I'm sharing it because I think the results are important for AI and Human Alignment. The Problem: “It Depends On Your Values” Imagine you’re a parent struggling with discipline....