A Proposed Test to Determine the Extent to Which Large Language Models Understand the Real World — LessWrong