Toward a taxonomy of cognitive benchmarks for agentic AGIs — LessWrong