This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
LESSWRONG
LW
Login
1143
Daniel Lee — LessWrong
Daniel Lee
Posts
Sorted by New
Wikitag Contributions
Comments
Sorted by
Newest
Open Thread Summer 2024
Daniel Lee
1y
8
0
Hi, excited to learn more about Mech Int!
Reply
54
Finding Features Causally Upstream of Refusal
10mo
5
28
Investigating Sensitive Directions in GPT-2: An Improved Baseline and Comparative Analysis of SAEs
1y
0
Hi, excited to learn more about Mech Int!