FordO

Posts

Sorted by New

Comments

Everyday Lessons from High-Dimensional Optimization

Do you count gradient descent as a blackbox optimization, and isn't backpropagation guided by gradient (at least in ANN)?

Everyday Lessons from High-Dimensional Optimization

Care to elaborate how did you get to the formula O(1/sqrt(n))?