Instruction Following without Instruction Tuning — LessWrong