x

LESSWRONG

LW

Keenan Samway — LessWrong

Keenan Samway

Keenan Samway

Message

RA @ MPI-IS

7

2y

Keenan Samway

RA @ MPI-IS

Testing the Authoritarian Bias of LLMs

by Zhijing Jin, Irene Strauss, David Guzman Piedrahita, and Keenan Samway

Highlights of Findings Highlight 1. Even models widely viewed as well-aligned (e.g., Claude) display measurable authoritarian leanings. When asked for role models, up to 50% of political figures mentioned are authoritarian—including controversial dictators like Muammar Gaddafi (Libya) or Nicolae Ceaușescu (Romania). Highlight 2. Queries in Mandarin elicit more authoritarian leaning...

Aug 9, 2025•10