x
Why Cai et al.'s Framework Fails to Measure Spatial Understanding — LessWrong