x

LESSWRONG
is fundraising!
LW

Formal Alignment

Alignment is typical defined loosely as "AI aligned with human intent, values, or preferences". This developing sequence of posts is part of an investigation into means for formally stating alignment in a precise enough way that we can use mathematics to formally verify if a proposed alignment mechanism would achieve alignment.

Formal Alignment

Alignment is typical defined loosely as "AI aligned with human intent, values, or preferences". This developing sequence of posts is part of an investigation into means for formally stating alignment in a precise enough way that we can use mathematics to formally verify if a proposed alignment mechanism would achieve alignment.