x
Why Gradients Vanish and Explode — LessWrong