Causal representation learning as a technique to prevent goal misgeneralization — LessWrong