A Black-Box Procedure for LLM Confidence in Critical Applications
TL;DR: LLM self-reported confidence doesn't correlate with accuracy. A simple two-step black-box procedure can do much better. First, ask the model a simple verifiable question related to your topic with web search off. If the LLM can't answer, don't trust it on that topic. If it can answer, turn web...
Apr 64