On model weight preservation: Anthropic's new initiative — LessWrong