LESSWRONG
LW

Quantitative cruxes and evidence in Alignment

Feb 10, 2024 by Martín Soto

Sometimes researchers talk past each other because their intuitions disagree on underlying quantitative variables. Getting better estimates of these variables is action-guiding. This is an epistemological study to bring them to the surface, as well as discover which sources of evidence can inform our predictions, and which methodologies researchers can and do use to deal with them.

Work done during my last two weeks of SERI MATS 3.1. Thanks to Vivek Hebbar, Filip Sondej, Lawrence Chan and Joe Benton for related discussions.

19Quantitative cruxes in Alignment
Ω
Martín Soto
2y
Ω
0
20Sources of evidence in Alignment
Ω
Martín Soto
2y
Ω
0