Anthropomorphic Misalignment research needs stronger evidence
by Lukas Fluri, Peter Nutter, Vansh Gupta, and Xin Chen, Cynthia
This is a distillation of our ICML 2026 Oral position paper, Position: Anthropomorphic Misalignment Research Needs Stronger Evidence. Joint work by Vansh Gupta, Peter Nutter, Samuel Stante, Andreas Krause, Florian Tramèr, Lukas Fluri, Xin Chen, and Anna Hedström at ETH Zurich. Code is here. TL;DR AI safety research increasingly studies...
Jun 2822