No, really, it predicts next tokens. — LessWrong