Calibrate words, not just probabilities — LessWrong