Interpreting a matrix-valued word embedding with a mathematically proven characterization of all optima — LessWrong