Goodhart's Law Example: Training Verifiers to Solve Math Word Problems — LessWrong