True Stories of Algorithmic Improvement

[-][anonymous]4y40

Another great example of this is Striped Smith-Waterman, which takes advantage of SIMD instructions to achieve a 2-8 speed-up (potentially much more on modern CPUs though) for constructing sequence local alignments.

[-]ryan_b4y40

there’s a common narrative in which AI progress has come mostly from throwing more and more compute at relatively-dumb algorithms.

Is this context-specific to AI? This position seems to imply that new algorithms come out of the box at only a factor 2 above maximum efficiency, which seems like an extravagant claim (if anyone were to actually make it).

In the general software engineering context, I understood the consensus narrative to be that code has gotten less efficient on average, due to the free gains coming from Moore's Law permitting a more lax approach.

Separately, regarding the bitter lesson: I have seen this come up mostly in the context of the value of data. Some example situations are the supervised vs. unsupervised learning approaches; AlphaGo's self-play training; questions about what kind of insights the Chinese government AI programs will be able to deliver with the expected expansion of surveillance data, etc. The way I understand this is that compute improvements have proven more valuable than domain expertise (the first approach) and big data (the most recent contender).

My intuitive guess for the cause is that compute is the perspective that lets us handle the dimensionality problem at all gracefully.

[-]ryan_b4y40

Reflecting on this, I think I should have said that algorithms are the perspective that lets us handle dimensionality gracefully, but also that algorithms and compute are really the same category, because algorithms are how compute is exploited.

Algorithm vs compute feels like a second-order comparison in the same way as CPU vs GPU, or RAM vs Flash, or SSD vs HDD, just on the abstract side of the physical/abstraction divide. I contrast this with compute v. data v. expertise, which feel like the first-order comparison.

Chris Rackauckas as an informal explanation for algorithm efficiency which I always think of in this context. The pitch is that your algorithm will be efficient in line with how much information about your problem it has, because it can exploit that information.

[-]Shmi4y40

Another source of speedup is making the right approximations. Ages ago I coded a numerical simulation of a neuromusular synaptic transmission, tracking 50k separate molecules bumping into each other, including release, diffusion, uptake etc that ended up modeling the full process faithfully (as compared with using a PDE solver) after removing irrelevant parts that took compute time but did not affect the outcome.

[-]Measure4y30

Compared to 2012, it now takes 44 times less compute to train a neural network to the level of AlexNet (by contrast, Moore’s Law would yield an 11x cost improvement over this period). Our results suggest that for AI tasks with high levels of recent investment, algorithmic progress has yielded more gains than classical hardware efficiency.

If 11x of the 44x total speedup is from hardware, doesn't that leave just 4x from software?

[-]Trinley Goldenberg4y100

Moore's law simply means that the 44x less compute is 11x cheaper, right? Moore's law doesn't make algorithms need less compute, just lowers the cost of that compute.

[-]Measure4y10

Makes sense.

[-]Alex K. Chen (parrot)2y20

Now AlphaTensor - https://deepmind.google/discover/blog/discovering-novel-algorithms-with-alphatensor/

LESSWRONG
is fundraising!
LW

LESSWRONG
is fundraising!
LW

92

True Stories of Algorithmic Improvement

92

92

Rewrite In C

Big-O Speedups

The Point