Some recent research that applies: https://journals.plos.org/mentalhealth/article?id=10.1371/journal.pmen.0000145
You might want to have a look at Microsoft's TrueSkill. An ELO like rating for online team games. It was a good early (there's probably newer and better ones) answer for how to rank an individual when teamed randomly together with others.
You might be right - but more experimentation is needed. For example, the pivot (I think) is the two lines "This is theory of mind. This is self-awareness." What happens if you:
1. Omit these two lines?
2. Change everything prior to those two lines to somethings else. "1 + 1 = 2" for example.