How model editing could help with the alignment problem — LessWrong