Categories: models of models

I'm really not convinced by this framing in terms of "objects doing things to other objects".

Let's take a typical example of a morphism: let's say $f : Z_{> 0} \to R$ (note for non-mathematicians: that is, $f$ is a function that takes a positive integer and gives you a real number) given by $f (n) = \sqrt{n}$ . How is it helpful to think about this as $Z_{> 0}$ doing something to $R$ ? How is it even slightly like "Alice pushes Bob"? You say "Every model is ultimately found in how one object changes another object" -- are you saying here that the integers change the real numbers? Or vice versa? (After that's done, what have the integers or the real numbers become?)

The only thing here that looks to me like something changing something else is that $f$ (the morphism, not either of the objects) kinda-sorta "changes" an individual positive integer to which it's applied (an element of one of the objects, again not either of the objects) by replacing it with its square root.

But even that much isn't true for many morphisms, because they aren't all functions and the objects of a category don't always have elements to "change". For instance, there's a category whose objects are the positive integers and which has a single morphism from $x$ to $y$ if and only if $x \leq y$ ; when we observe that $5 \leq 9$ , is 5 changing 9? or 9 changing 5? No, nothing is changing anything else here.

So far as I can see, the only actual analogy here is with the bare syntactic structure: you can take "A pushes B" and "A has a morphism f to B" and match the pieces up. But the match isn't very good -- the second of those is a really unnatural way of writing it, and really you'd say "f is a morphism from A to B", and the things you can do with morphisms and the things you can do with sentences don't have much to do with one another. (You can say "A pushes B with a stick", and "A will push B", and so forth, and there are no obvious category-theoretic analogues of these; there's nothing grammatical that really corresponds to composition of morphisms; if A pushes B and B eats C, there really isn't any way other than that to describe the relationship between A and C, and indeed most of us wouldn't consider there to be any relationship worth mentioning between A and C in this situation.)

[-]Said Achmiz6y70

Conceptual question:

In real life, i.e., when dealing with the physical world, there are usually many ways to generalize any given thing or phenomenon.

For example, a tomato is a fruit, but it’s also a vegetable; that is, it belongs to a botanical grouping, but also to a culinary grouping. Neither classification is more ‘real’ or ‘true’ than the other^[1]; and indeed there are many other possible categories within which we can put tomatoes (red things, throwable things, round things, soft things, etc.).

Is this also the case in category theory? That is: for anything which we might be tempted to generalize with the aid of category theory, are there multiple ways to generalize it, dictated only by convenience and preference? Or, is there necessary some single canonical generalization for any given mathematical… thing? If the former: how and by what criteria are generalizations selected? If the latter: what pitfalls does this create when using real-world-based analogies to understand category theory?

Recall that taxonomic classifications aren’t written in the heavens somewhere, but are merely a useful way for humans to classify organisms (namely, by putting them into groups arranged by common descent). This is useful for various reasons, but by no means unambiguous or necessary, nor dictated by reality—as “in truth there are only atoms and the void.” ↩︎

[-]Gurkenglas6y20

Math certainly has ambiguous generalizations. As the image hints, these are also studied in category theory. Usually, when you must select one, the one of interest is the least general one that holds for each of your objects of study. In the image, this is always unique. I'm guessing that's why bicentric has a name. I'll pass on the question of how often this turns out unique in general.

[-]countedblessings6y10

One of the reasons for my own interest in category theory is my interest in the question you raise. I'm hoping that we'll explore the idea that universal properties offer an "objective" way of defining "subjective" categories.

Maybe a more direct answer is that in the very next post in the series, we'll see that sets can be considered the objects of the category of sets and functions, and also the objects of the category of sets and binary relations. Functions are binary relations, so that's not a perfect answer, but yes, you can think of an individual category as a context of sorts through which you view the objects, like how you can view a tomato as a fruit or vegetable depending on the context.

[-]gilch6y60

I think it should be possible to embed images in the post instead of just linking to them.

[-]habryka6y50

It's totally possible. In the post-editor, just select some empty space and press the "image" button in the toolbar. Or use markdown syntax.

Happy to edit the above post to have images in the post, as opposed to just links to images.

[-]Gordon Seidoh Worley6y50

This continues to be a slyly gentle series that has you in to something before you know it. Well done!

As a side note, maybe you or the admins can set these posts up as a sequence so they are linked together.

[-]countedblessings6y70

Thank you for the positive feedback. (A very underrated thing in terms of encouraging free content production.) I can go back to each post and add a link to the next one. I am concerned that I may want to add, rearrange, or even delete individual posts at some point, but I suppose that's no reason not to add in the links right now for convenience's sake.

[-]avturchin6y10

Thanks for this sequence.

[-]Viliam6y20

What you learn to do is take a bunch of nouns—1, 2, 3, etc.—and a bunch of verbs—addition, subtraction—and make sentences. “1 + 2 = 3.”

I still have no idea how to express this in a picture of objects and arrows. I suppose that 1, 2, and 3 are objects. Is the addition an arrow? But an arrow has only one start and one end...

More meta: You have already provided the readers "motivation" in the two introductory articles. It is not necessary to add more hype in each article. Yes, I already heard that you can do everything in category theory, and I am willing to suspend disbelief. Now I am curious how specifically it can be done.

[-]philh6y20

It's possible to construct a category where numbers are objects and where the arrows are "plus zero" (identity), "plus one", "plus two" and so on. ("Numbers" here might look like it stands in for "natural numbers". But actually, as described, it would work just as well with "real numbers", "complex numbers", "integers greater than three", "numbers whose fractional part is the same as the fractional part of e to five decimal places"... formally, any set which is "closed under addition of natural numbers". Unless you pick a different way to operationalize "and so on".)

Then the objects in "1 + 2 = 3" are in and three, and the arrow is "plus two".

(If you picked "numbers" above to be "natural numbers", then there's a one-to-one correspondence between objects and "arrows from this object", for any object. But I'm not sure if that's important.)

More normally, "the set of numbers" would be an object all by itself, and the arrows would be the same as above, but all pointing from this one object to itself.

Neither of these sounds like what OP was trying to describe, but I don't have an answer that does.

[-]Viliam6y20

But then there would be no obvious connection between the number "two" and the arrow "plus two". Also, no obvious connection between the "plus two" arrow doing from 1 to 3, and the "plus two" arrow going from 6 to 8. That feels like we can make a diagram that somehow represents the addition of integers, but we can't derive new insights about addition from looking at the diagram, because most information is lost in the translation.

I guess what I meant was: I have no idea how to express 1+2=3 in a useful picture of objects and arrows.

[-]Slider6y10

Knowing that haskell I think the pattern to turn multiparty relations to two place relations is R(a,b,c,d,e,f,g) -> R(S(b,c,d,e,f,g)) -> R(S(T(d,e,f,g)) ... R(S(T(U(V(X(Z(g)))))))

The connection between "+2" and 2 would then be a function of +(2)="+2". You migth also need =(3)="=3" and then you can have =3(+2(2)) = "2+2=3" and maybe a T?("2+2=3")=False. In another style you would set it up that only true equations could be derived. Then one of the findings would be that any instance of +2(2) could be replaced with 4 and the mappings would still hold (atleast on the T? level). Mind you "2+2" could be a different object from "4"

[-]Said Achmiz6y20

For example, say you want to grow new kinds of fruit that have never existed. Having a concept of fruit is necessary to conceiving of that idea. Life’s not going to give you examples of fruit that have never existed! You have to explore the conceptual space of all fruit.

It’s not, actually. See this old comment of mine:

Note that under this interpretation, no “general” or “extended” version of the concept is ever created (the template is anonymous, and is discarded as soon as it “goes out of scope”—which is to say, as soon as it has been used to create the new concept). There is thus no need to ask the questions of what this new, “general”/“extended” concept means, to what else it may or may not apply, how to differentiate between uses of it and any specific version, etc.

[-]Gurkenglas6y20

Not every way to model reality defines identity and composition. You can start with a category-without-those G (a quiver) and end up at a category C by defining C-arrows as chains of G-arrows (the quiver's free category), but it doesn't seem necessary or a priori likely to give new insights. Can you justify this rules choice?

[-]countedblessings6y10

Honestly my real justification would be "adjoint functors awesome, and you need categories to do adjoint functors, so use categories." More broadly...as long as it's free to create a category out of whatever you're studying, there's clearly no harm. The question is whether anything's lost by treating the subject as a category, and while I fully expect that there are entire universes of mathematics and reality out there where categories are harmful, I don't think we live in one like that. Categories may not capture everything you can think of, but they can capture so much that I'd be stunned if they didn't yield amazing fruit eventually. I'd acknowledge that novel, groundbreaking theorems are still forthcoming.

[-]gjm6y50

Let's take a somewhat-concrete example. Your post mentions birds. OK, so let's consider e.g. a model of birds flying in a flock, how they position themselves relative to one another, and so on. You suggest that we consider the birds as objects: so far, so good. And then you say "they do stuff like fly, tweet, lay eggs, eat, etc. I.e., verbs (morphisms)." For the purpose of a flocking model, the most relevant one of those is flying. How are you going to consider flying as a morphism in a category of birds? If A and B are birds, what is this morphism from A to B that represents flying? I'm not seeing how that could work.

In the context of a flocking model, there are some things involving two birds. E.g., one bird might be following another, tending to fly toward it. Or it might be staying away from another, not getting too close. Obviously you can compose these relations if you want. (You can compose any relations whose types are compatible.) But it's not obvious to me that e.g. "following a bird that stays away from another bird" is actually a useful notion in modelling flocks of birds. It might turn out to be, but I would expect a number of other notions to be more useful: you might be interested in some sort of centre of mass of a whole flock, or the density of birds in the flock; you might want to consider something like a velocity field of which the individual birds' velocities are samples; etc. None of these things feel very categorical to me (though of course e.g. velocities live in a vector space and there is a category of vector spaces).

Maybe flocking was a bad choice of example. Let's try another: let the birds be hens on a farm, kept for breeding and/or egg-laying. We might want to understand how much space to give them, what to feed them, when to collect their eggs, whether and when to kill them, and so on. Maybe we're interested in optimizing taste or profit or chicken-happiness or some combination of those. So, according to your original comment, the birds are again objects in a category, and now when they "lay eggs, etc., etc." these are morphisms. What morphisms? When a bird lays an egg, what are the two objects the morphism goes between? When are we going to compose these morphisms and what good will it do us?

How does it actually help anything to consider birds as objects of a category?

Here's the best I can do. We take the birds, and their eggs, and whatever else, as objects in a category, and we somehow cook up some morphisms relating them. The category will be bizarre and jury-rigged because none of the things we care about are really very categorical, but its structure will somehow correspond to some of the things about the birds that we care about. And then we make whatever sort of mathematical or computational model of the birds we would have made without category theory. So now instead of birds and eggs we have tuples (position, velocity, number of eggs sat on) or objects of C++ classes or something. Now since we've designed our mathematical model to match up, kinda, to what the birds actually do, maybe we can find a morphism between these two jury-rigged categories corresponding to "making a mathematical model of". And then maybe there's some category-theoretic thing we can do with this model and other mathematical models of birds, or something. But I gravely doubt that any of this will actually deliver any insight that we didn't ourselves put into it. I'd be intrigued to be proved wrong.

[-]Gurkenglas6y20

That a construction is free doesn't mean that you lose nothing. It means that if you're going to do some construction anyway, you might as well use the free one, because the free one can get to any other. (Attainable utility anyone?)

Showing that your construction is free means that all you need to show as worthwhile is constructing any category from our quiver. Adjunctions are a fine reason, though I wish we could introduce adjunctions first and then show that we need categories to get them.

LESSWRONG
LW

LESSWRONG
LW

53

Categories: models of models

53

53