Testing for parallel reasoning in LLMs — LessWrong