x

LESSWRONG

LW

Xin Chen, Cynthia — LessWrong

Xin Chen, Cynthia

Xin Chen, Cynthia

Message

47

1

1

8y

Xin Chen, Cynthia

47

8y

Anthropomorphic Misalignment research needs stronger evidence

by Lukas Fluri, Peter Nutter, Vansh Gupta, and Xin Chen, Cynthia

This is a distillation of our ICML 2026 Oral position paper, Position: Anthropomorphic Misalignment Research Needs Stronger Evidence. Joint work by Vansh Gupta, Peter Nutter, Samuel Stante, Andreas Krause, Florian Tramèr, Lukas Fluri, Xin Chen, and Anna Hedström at ETH Zurich. Code is here. TL;DR AI safety research increasingly studies...

The Vitalik Buterin Fellowship in AI Existential Safety is open for applications!

This is a linkpost for https://grants.futureoflife.org/ Epistemic status: Describing the fellowship that we are a part of and sharing some suggestions and experiences. The Future of Life Institute is launching its 2023 cohort of PhD and postdoctoral fellowships to study AI existential safety: that is, research that analyzes the most...

Oct 13, 2022•21