This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
LESSWRONG
Tags
LW
Login
Truthful AI
•
Applied to
How do LLMs give truthful answers? A discussion of LLM vs. human reasoning, ensembles & parrots
by
Owain_Evans
1mo
ago
•
Applied to
Benchmark Study #2: TruthfulQA (Task, MCQ)
by
Bruce W. Lee
4mo
ago
•
Applied to
Tall Tales at Different Scales: Evaluating Scaling Trends For Deception In Language Models
by
Felix Hofstätter
6mo
ago
•
Applied to
A tension between two prosaic alignment subgoals
by
Ruby
1y
ago
•
Applied to
Truthfulness, standards and credibility
by
Ruby
2y
ago
•
Created by
Ruby
at
2y