x
LessWrong
This website requires javascript to properly function. Consider activating javascript to get access to all site functionality.
LESSWRONG
LW
Login
Pretraining on Aligned AI Data Dramatically Reduces Misalignment—Even After Post-Training
by
RogerDearnaley
You have commenting access to this post. Contact
RogerDearnaley
if you wish to be able to edit directly.