This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
LESSWRONG
Tags
LW
Login
Truthful AI
Edit
History
Subscribe
Discussion
(0)
Help improve this page
Edit
History
Subscribe
Discussion
(0)
Help improve this page
Truthful AI
Random Tag
Contributors
Posts tagged
Truthful AI
Most Relevant
2
31
A tension between two prosaic alignment subgoals
Alex Lawsen
8mo
8
2
12
Truthfulness, standards and credibility
Ω
Joe_Collman
2y
Ω
2
1
45
Tall Tales at Different Scales: Evaluating Scaling Trends For Deception In Language Models
Ω
Felix Hofstätter
,
Francis Rhys Ward
,
HarrietW
,
LAThomson
,
Ollie J
,
patrik-bartak
,
Sam F. Brown
20d
Ω
0