A Primer on Matrix Calculus, Part 3: The Chain Rule — LessWrong