IRL 3/8: Mitigating degeneracy: feature matching — LessWrong