LESSWRONG
LW

Selective Generalization: Improving Capabilities While Maintaining Alignment
by ariana_azarbal, Matthew A. Clarke, jorio, Cailley Factor, cloud
You have commenting access to this post. Contact ariana_azarbal if you wish to be able to edit directly.