LLM Psychometrics and Prompt-Induced Psychopathy
This post contains experimental results and personal takes from my participation in the July 2024 edition of the BlueDot Impact AI Safety Fundamentals course. TL;DR: * Psychopaths are willing to manipulate and deceive. Psychometrics try to measure this with standardized tests. * AI models express different levels of psychopathy depending...
Oct 18, 202412