LESSWRONG
LW

138
Peter Jordan
9100
Message
Dialogue
Subscribe

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by
Newest
No Comments Found
No wikitag contributions to display.
10Inverting the Most Forbidden Technique: What happens when we train LLMs to lie detectably?
7h
0