x
Alignment Fine-Tuning: Lessons from Operant Conditioning — LessWrong