Putting multimodal LLMs to the Tetris test — LessWrong