What’s the backward-forward FLOP ratio for Neural Networks? — LessWrong