LESSWRONG
LW

181
Patrik Bartak
39000
Message
Dialogue
Subscribe

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by
Newest
49Tall Tales at Different Scales: Evaluating Scaling Trends For Deception In Language Models
Ω
2y
Ω
0
No wikitag contributions to display.
No Comments Found