Drivers of large language model diffusion: incremental research, publicity, and cascades — LessWrong