LESSWRONG
LW

LAThomson
90000
Message
Dialogue
Subscribe

4th-year undergrad Computer Science and Philosophy student at Oxford, and part-time (hopefully full-time in future!) AI Safety researcher :)

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by
Newest
No wikitag contributions to display.
No Comments Found
59Towards shutdownable agents via stochastic choice
1y
11
49Tall Tales at Different Scales: Evaluating Scaling Trends For Deception In Language Models
Ω
2y
Ω
0