Testing the Authoritarian Bias of LLMs
by Zhijing Jin, Irene Strauss, David Guzman Piedrahita, and Keenan Samway
Highlights of Findings Highlight 1. Even models widely viewed as well-aligned (e.g., Claude) display measurable authoritarian leanings. When asked for role models, up to 50% of political figures mentioned are authoritarian—including controversial dictators like Muammar Gaddafi (Libya) or Nicolae Ceaușescu (Romania). Highlight 2. Queries in Mandarin elicit more authoritarian leaning...
Aug 9, 202510