Last week I stumbled over Dimensional Analysis which is not only useful for applied fields (physics, biology, economics), but also for math (Why did no one tell me that you can almost always think of df/dx as having "the type of f"/"the type of x"? The fact that exponents always have to be unit-less etc.? It had never occurred to me to make this distinction. In my mind, f(x)=x went from the reals to the reals, just like did.

One example of a question that before I would have had to think slowly about: What is the type of the standard deviation of a distribution? What is the type of the z-score of a sample?

Answer

The standard deviation has the unit of the random variable X, while the z-score is unitless

If you found the above interesting, I recommend reading (or skimming) this lecture, it's excellent!

It had never occurred to me that I could use linear algebra to literally solve for an equation. Yes, avoiding type errors in a typed programming language is also great, but what's different there is that often it really doesn't matter if something is a dict or a list, it just has to fit with the rest of your code. Searching a type that correctly maps to the territory across scales is a more tricky business than efficient code for a simple programming problem. If you keep not being able to distinguish "weight" and "mass" you will keep getting lost. If you notice your confusion between the difference of the act of weighing and gravity, it might get you far.

New Comment
5 comments, sorted by Click to highlight new comments since:

think of df/dx as having "the type of f"/"the type of x"

I expect you learned calculus the wrong way, in a math class instead of in physics. That's the point the notation, and the key reason it's an improvement over something like  or !

Yep! When I was in high school I was self-learning physics and was learning calculus via Khan academy, which made me learn that "sometimes", "somehow", you can for example solve a differential equation by getting all the dx and x's on one side and dy's and y's on the other. I was always expecting when I would study this stuff more rigorously in university, this would be explained at some point. To my great disappointment, the mandatory class on this subject for my CS degree "real analysis", did not in fact clear up why this works in the slightest!? I don't know if every real analysis class is this bad (My linear algebra courses were excellent in comparison), but mine was mostly focusing on making students adopt "rigor"[1], presenting definition after definition and theorem after theorem (for short theorems proofs were included, but with zero sub-text what the motivation or idea of the proof was. Complicated proofs were just skipped). Starting studying during Corona and not being able to ask professors questions like that directly certainly didn't help. Somehow I forgot, though, that I never got these questions answered that I had hoped to get an answer to. Also, the quotient in df/dx still confuses me, and it might be due to my confusion about division? Looking at the wikipedia article you link, maybe I should take a look at differential forms?

As my secondary subject for my bachelor, I chose physics, but that mostly involved applying Euler-Lagrange to particular systems. I remember becoming really confused why this can work at this point and noticing that I can't fit the explanation in my head (I am in fact still confused and would love if you can point me to useful resources!). I remember thinking why can you differentiate with respect to speed and treat acceleration etc. as not dependent on speed, weird. Clearly, when framing them all as a function of time, it made no sense (and still doesn't). I was looking it up on Wikipedia and apparently "Differential Geometry" was the answer to my questions, so I got 2–3 textbooks on Differential Geometry (for physics), but just skimming, I couldn't find something that made this click.

Probably I should have looked more aggressively for people to tutor me. Now with language models it feels worth taking another stab at this though. For example when I told GPT4-o the rough areas I am confused about (link to full chat): there are these people talking about scaling laws in the context of "scaling laws" in engineering, and it seems like I might be missing some prerequisites, because I've bounced off this topic a few times even though this seems extremely interesting. Which led it to introduce dimensionless analysis to me, and then I asked more questions and ultimately gave me some textbook recommendations: "Street fighting mathematics", I had already heard praise for in "Biology by the numbers", so then I checked it out and discovered this gold.

Another question I am still confused by: how does your choice of units affect what types of dimensionless quantities you discover? Why do we have Ampere as a fundamental unit instead of just M,L,T? What do I lose? What do I lose if I reduce the number of dimensions even further? Are there other units that would be worth adding under some circumstances? What makes this non-arbitrary? Why is Temperature a different unit from Energy?

Also, I notice all the formalizations of dimensional analysis that Terence Tao mentions here strike me as ugly? Is that me being stupid, thinking that there must exist something nicer?


  1. Which I found mostly annoying and trivial, but made me glade I had spent ~60h before university just practicing proving things. How to proove it is a great book! ↩︎

Another question I am still confused by: how does your choice of units affect what types of dimensionless quantities you discover? Why do we have Ampere as a fundamental unit instead of just M,L,T? What do I lose? What do I lose if I reduce the number of dimensions even further? Are there other units that would be worth adding under some circumstances? What makes this non-arbitrary? Why is Temperature a different unit from Energy?

It’s pretty arbitrary, I tried to explain this point via a short fictional story here.

Gaussian units only has M,L,T base units, with nothing extra for electromagnetism.

There are practical tradeoffs involved in how many units you use—basically adding units gives you more error-checking at the expense of more annoyance. See the case of radians that I discuss here.

Not quite what you asked, but I found this video visualizing Lagrange multipliers quite helpful. Plausibly it'll help clarify Euler–Lagrange as well.

Your units question is easy: you get the same dimensionless quantities whatever units you choose. Instead of thinking of units as dimensions, I'd think of them as basis vectors in a five-dimensional space  (length, mass, time, current, temperature[1]): you should have exactly five of them, and they need to be "linearly" independent, but beyond that, you can choose any set you like: you could instead have something like the natural units of (speed, gravitationality[2], angular momentum, entropy, charge). In this conception, this 5D space is fundamental: you need at least five dimensions (the ones spanned by c, G, h, k and e), and if you want more, you need to find some new as-yet-unknown independent dimension (maybe baryon/lepton number counts?). 

  1. ^

    Of the seven SI units, the other two are Candela, which is some "human visual perception" bullshit masquerading as fundamental, and amount of stuff/Avogadro's constant, which I don't think meaningfully constitutes a "dimension."

  2. ^

    ykwim: something with the units of G. If you don't like this, just choose mass instead.

If you notice your confusion between the difference of the act of weighing and gravity

This description seems imprecise/confusing to me. It's rather that you need to notice that you need an extra assumption for inertial_mass=gravitational_mass, and then you can embark on finding a theory where those are identical by thinking stuff like "can i frame it as earth actually just accelerating up?".

Curated and popular this week