This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
LESSWRONG
Tags
LW
Login
Truthful AI
•
Applied to
Constructing Benchmarks and Interventions for Combating Hallucinations in LLMs
by
Adi Simhi
1d
ago
•
Applied to
How do LLMs give truthful answers? A discussion of LLM vs. human reasoning, ensembles & parrots
by
Owain_Evans
4mo
ago
•
Applied to
Tall Tales at Different Scales: Evaluating Scaling Trends For Deception In Language Models
by
Felix Hofstätter
9mo
ago
•
Applied to
A tension between two prosaic alignment subgoals
by
Ruby
1y
ago
•
Applied to
Truthfulness, standards and credibility
by
Ruby
2y
ago
•
Created by
Ruby
at
2y