LESSWRONG
LW

208
HanneWhitt
62000
Message
Dialogue
Subscribe

AI Safety Technical Research Manager at Meridian Research, Cambridge UK. Background in AI Control (MARS, LASR)

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by
Newest
No Comments Found
No wikitag contributions to display.
76Unfaithful Reasoning Can Fool Chain-of-Thought Monitoring
Ω
3mo
Ω
17