Are language models good at making predictions? — LessWrong