New Capabilities, New Risks? - Evaluating Agentic General Assistants using Elements of GAIA & METR Frameworks
by Tej Lander FCCT Above: LLMs vs Agentic Assistants - a big step forward? (Image created by DALL.E via GPT4o) Overview Abstract 1: Why are Agentic Systems a ‘hot topic’? 2: What makes a system ‘agentic’? 2.1 Taxonomy of Agenticness from Shavit et al. (2023, OpenAI) 2.2 Taxonomy of Agenticness...