Do frontier LLMs still express different values in different languages?
Previous work [1] [2] [3] [4] has found that the same model can give different value judgments when prompted in different languages. I wanted to know whether this still holds for the newest frontier models, so I tested GPT-5.4, GPT-5.4-mini, Claude Opus 4.6, and Claude Sonnet 4.6 on translated prompts...
Mar 2815