How does a toy 2 digit subtraction transformer predict the sign of the output? — LessWrong