x
This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
LESSWRONG
LW
Login
fakeanalyst — LessWrong
fakeanalyst
fakeanalyst
Subscribe
Message
2
2
1y
All
⚙
Buck's Shortform
fakeanalyst
10mo
3
0
The usefulness of interpretability research
Reply
Generating the Funniest Joke with RL (according to GPT-4.1)
fakeanalyst
1y
1
0
Goodhart's law!
Reply
The usefulness of interpretability research