x
An Empirical Study of Methods for SFTing Opaque Reasoning Models — LessWrong