How truthful is GPT-3? A benchmark for language models — LessWrong