Why Gradients Vanish and Explode — LessWrong