Category Theory
Zulip Server
Archive

You're reading the public-facing archive of the Category Theory Zulip server.
To join the server you need an invite. Anybody can get an invite by contacting Matteo Capucci at name dot surname at gmail dot com.
For all things related to this archive refer to the same person.

Stream: theory: applied category theory

Topic: properties of objects (in context of imaging)

David Egolf (Jan 03 2022 at 21:51):

I wanted to share some of my recent thoughts on talking about "properties" in the context of imaging using category theory. This is in the hopes that:

Someone might find them useful, or
Someone might be interested in sharing/discussing related ideas, perhaps from a more general/abstract context.

In the context of imaging, we are often content to observe some properties of an object, when observing everything about an object isn't possible or necessary. So, having a clear notion of "property" in categorical language is probably important for talking about imaging using categories.

One way to think about a "property" is as an equivalence relationship. If $p$ is an equivalence relationship on some objects, we can say that two objects have the same property value with respect to $p$ if they are in the same equivalence class. Many statements about properties can be rephrased into the language of equivalence relationships - for example "The mass of this stone is 15 kg" can be rephrased to "This stone and this reference object labelled with '15 kg' have the same property value - fall in the same equivalence class - with respect to an equivalence relationship we call 'mass'." This description of "property" is nice in that it ignores some non-essential differences in language or notation. For example, two people using different languages would be able to agree which objects have the same mass, even if the name they give to "mass" is different.

I want to describe this concept of a "property" of some objects in terms of a category containing those objects. The discussion above suggests that using an equivalence relationship might be reasonable. Let us start with a discrete category $\mathsf{C}$ . Let us put an equivalence relationship $p$ on the objects of $\mathsf{C}$ , which we think of as describing a property these objects have. For example, if the objects in $\mathsf{C}$ are points $(x,y)$ in the plane $\mathbb{R^2}$ , then $p$ could place two points in the same equivalence class if they have the same $x$ -coordinate. We can form a new category $\mathsf{C}_p$ , I think, as follows: if objects $a$ and $b$ are in the same equivalence class with respect to $p$ , then create a pair of morphisms - one going from $a$ to $b$ and one going from $b$ to $a$ . Each morphism can just be an ordered pair $(a,b)$ or $(b,a)$ . In each such pair of morphisms, we require the morphisms to be isomorphisms, and to be the inverse of each other. In the category $\mathsf{C}_p$ , I think we have $a \cong b$ exactly when $a$ and $b$ lie in the same equivalence class with respect to $p$ . So, we have created a new category that describes a property on its objects, in terms of its isomorphisms.

I am hoping this perspective can be helpful when designing or analyzing an observation process on the objects of $\mathsf{C}$ . Notice that if an observation process $F$ is a functor $F: \mathsf{C}_p \to D$ , then if $F(a)$ and $F(b)$ are not isomorphic in $D$ , then $a$ and $b$ are not isomorphic in $\mathsf{C}_p$ and so they have different values of the property $p$ . This is a way of talking about how our observations inform us about the properties of the objects we are observing. Notice also, that by designing the category $\mathsf{C}_p$ and asking our observation from $\mathsf{C}_p$ to be functorial, we put constraints on the nature of the observation process, including the category it maps to.

Any thoughts are welcome! A related question would be: How to model the fact that objects generally have more than one property?

Jason Erbele (Feb 08 2023 at 07:51):

David Egolf said:

I am hoping this perspective can be helpful when designing or analyzing an observation process on the objects of $\mathsf{C}$ . Notice that if an observation process $F$ is a functor $F: \mathsf{C}_p \to D$ , then if $F(a)$ and $F(b)$ are not isomorphic in $D$ , then $a$ and $b$ are not isomorphic in $\mathsf{C}_p$ and so they have different values of the property $p$ .

I assume $D$ is the category of "values" that property $p$ can take on, where "value" is whatever is the meaningful in measuring $p$ . I'm not sure what you gain from taking $F$ to be a functor $F \colon \mathsf C_p \to D$ rather than $F \colon \mathsf C \to D$ , where you can use the fiber of $F$ on the observed value of $p$ to get the collection of objects in $C$ that have that observed value of the property when using that observation process. Using the fiber approach, you can still directly access the objects of inquiry.
Moreover, if your measurements of $p$ have any margin of error, equivalence classes might be too "sharp" to be useful. I.e., you could have two objects that can be distinguished as having clearly different values of $p$ that end up in the same equivalence class because there is a sequence of indistinguishables between them. In the (very likely) extreme case, all your equivalence classes would collapse into a single equivalence class.

As a possible alternative, you could say two objects $a$ and $b$ are "the same" when measured by property $p$ , as observed by process $F$ , if there is a reference object $c$ such that you get a cospan $a \to c \leftarrow b$ . You can build something like an equivalence class of all objects that can form a cospan using a particular reference object. A key difference with these equivalence-class-like things is there can be overlap between two of them. So there still may be sequences of indistinguishable objects between two objects that can be distinguished, but the reference object would have to vary along the way, breaking the "sameness", avoiding the issue noted above. Hopefully you can get some mileage out of these thoughts if I haven't misunderstood you too badly.

A related question would be: How to model the fact that objects generally have more than one property?

For conciseness, I will write $p(X) = a$ for "the value of $X$ when measured according to (some observation process that measures) property $p$ is $a$ ." Then for the two-property case we are looking at $p(X) = a$ AND $q(X) = b$ . That AND immediately makes me think there should be some kind of product going on, and my intuition would point to the "reference objects" in the cospans. But the arrows clash between the simplest kind of products and cospans – a binary product with its projection maps has the form of a span: $A \leftarrow A \times B \to B$ . So maybe spans would work better than cospans in the previous paragraph? Another issue is the possibility for extraneous cospans to appear (one side coming from $p$ , the other side from $q$ , i.e. the same reference object is used for both properties). Maybe a fancier product works better? Or maybe some fancier cospan-based gadgets can deal with it? I'm too tired to work out any more details tonight.

Neither here nor there, I think it would have been fitting (or at least amusing) if "The mass of this stone is 6.35 kg" had been used instead of 15 kg.

David Egolf (Feb 08 2023 at 15:55):

Thanks for sharing your thoughts @Jason Erbele !
Looking back, it feels like the thoughts I wrote up here are very rough. I am glad you found some interest in going through them at least.

I'm trying to recall my motivation for talking about $\mathsf{C}_p$ . I think part of the idea was to try and create a category that models the property of interest. However, you are right to point out that much the same information is contained in a map that just labels each object of $\mathsf{C}$ directly. I think by talking about a functor from $\mathsf{C}_p$ to $\mathsf{D}$ I was hoping to make it easy to talk about structure in the observations made (in $\mathsf{D}$ ) that relate to figuring out the values of the property for an unknown object. I think I was trying to model categorically - in terms of morphisms - the property of interest in $\mathsf{C}_p$ .

I agree there are problems with working with equivalence classes in the context of e.g. noise. Although I'm not sure I understand what you are saying about a "sequence of indistinguishables" though.

Thinking in terms of cospans with respect to different reference objects is very interesting! I suppose the case where we are considering an equivalence class generated by isomorphisms between the objects is a special case of this: if $a \cong b$ , then we have a cospan $a \to a \leftarrow b$ provided by the isomorphism.

If I understand the idea, the reference object can provide a sort of "local" reference useful for measuring a certain kind of similarity. For example, imagine we have a category where the objects are length 10 vectors of real numbers, and we put a morphism $a \to c$ if $\|a-c\|$ is small enough (hmmm, although maybe this doesn't define a category - the composite of two morphisms might not always exist... hopefully the intuition expressed here is still relevant). Then a cospan $a \to c \leftarrow b$ says that $a$ and $b$ are both close enough to $c$ , and therefore are somewhat close to eachother. In this case, the equivalence-class-like-things are spheres of vectors, and they can overlap (unlike actual equivalence classes) because a given vector can be in many spheres.

David Egolf (Feb 08 2023 at 16:08):

While I'm thinking about this, I had another complaint about the modelling approach I used above. I talked about creating a category where we put a pair of isomorphisms between two objects $a$ and $b$ if $a$ and $b$ have the same value of some property. This results in pretty boring morphisms, distinguished entirely by their source and target. It feels like we are not using the fact that we can have more interesting morphisms between objects.

We can try and take more seriously the idea that an object is determined by its relationship to all others in our modelling. There is still a relationship between two objects with different values of a property, provided that the property has some kind of structure.

For example, say our property $p$ takes on values that naturally form a group. Then if object $A$ has value $p_A$ and object $B$ has value $p_B$ , then there is some kind of equation $p_A = p_B \circ q$ in the group corresponding to the property. To describe this relationship between $A$ and $B$ in terms of the property $p$ , we might try creating a category where there is a morphism from $A$ to $B$ labelled by $q$ . So we keep track of the difference in the property values between the objects, under this modelling approach.

Maybe this could be combined with the cospan approach above?
Anyways, it's fun to think about this again, a little!

David Egolf (Feb 08 2023 at 16:13):

I guess my overall thought on this at the moment is that it's hard to assess how good a given approach for modelling is without some specific test cases. I'm also unsure how I would assess a given modelling approach even if I picked out a specific thing I wanted to model. I suppose these are some of the challenges of trying to apply math, and I assume I would need to learn a lot more to get anywhere with this! Still, it's fun to think about.

Jason Erbele (Feb 08 2023 at 18:20):

Sorry about my vague "sequence of indistinguishables" comment. To give a concrete example of what I had in mind, suppose you can measure lengths with (for simplicity) a constant noise level of 0.01 units. Then an object that is observed to be 3 units long is clearly distinct from an object that is observed to be 4 units long. But you could find a sequence of objects that are observed to be 3.005, 3.01, 3.015, ..., 3.995 units long. Because of the noise level, consecutive observed objects in the sequence could not be distinguished as having different lengths. If we were using equivalence classes, $3 \cong 3.005 \cong 3.01 \cong 3.015 \cong \dots \cong 3.995 \cong 4$ when we should have $3 \not\cong 4$ .

If I understand the idea, the reference object can provide a sort of "local" reference useful for measuring a certain kind of similarity. For example, imagine we have a category where the objects are length 10 vectors of real numbers, and we put a morphism $a \to c$ if $\|a-c\|$ is small enough (hmmm, although maybe this doesn't define a category - the composite of two morphisms might not always exist... hopefully the intuition expressed here is still relevant). Then a cospan $a \to c \leftarrow b$ says that $a$ and $b$ are both close enough to $c$ , and therefore are somewhat close to eachother. In this case, the equivalence-class-like-things are spheres of vectors, and they can overlap (unlike actual equivalence classes) because a given vector can be in many spheres.

If you think of the reference objects as being some kind of Platonic ideal, they should only have arrows going to them, not going out (or maybe vice versa?). That was the intuition that motivated my cospan comments. I can see with your example here that we might run into problems if we treat real objects as Platonic ideals (with maybe some very limited exceptions, like the standard kilogram). So here's a very rough idea: let $\mathsf C$ be your discrete category of observable objects. Freely add "Platonic ideal objects" for property $p$ to get the objects of $\mathsf C_p$ (here we would distinguish the vector (6,8) from the Platonic ideal of (6,8), for instance). If $a$ is an observable object, $c$ is a Platonic ideal object, and $F \colon \mathsf C \to \mathsf C_p$ is an observation process that observes $p$ , then $a \to c$ is an arrow in $\mathsf C_p$ if $\|F(a) - c \|$ is smaller than the noise signal. Since the only nontrivial arrows in $\mathsf C_p$ are the ones pointing to Platonic ideal objects (and there are no arrows between different Platonic ideal objects), the composite of two morphisms is still a morphism. So $\mathsf C_p$ still defines a category.

This idea is still rough, but I think it can be fleshed out to deal with multiple properties and even with multiple observation methods that observe the same property, like measuring someone's height to the nearest centimeter and to the nearest inch to get more precision than either measurement alone. For noisy data, you might even assign probabilities to each arrow. I think an ambitious application along those lines might be to determine the likelihood that two CCTV images several hours apart depict the same individual.

David Egolf (Feb 08 2023 at 19:06):

Oh, I see - by only having arrows to the reference objects ("Platonic ideal objects" if you prefer), we avoid having to compose these arrows with one another. That enables us to introduce these arrows to model similarity to reference objects in terms of cospans, as we wanted above - and to get a category in the process. Very cool!

And thank you for clarifying your comment on sequences of indistinguishables. That makes a lot of sense!

John Baez (Feb 08 2023 at 19:21):

David Egolf said:

While I'm thinking about this, I had another complaint about the modelling approach I used above. I talked about creating a category where we put a pair of isomorphisms between two objects $a$ and $b$ if $a$ and $b$ have the same value of some property. This results in pretty boring morphisms, distinguished entirely by their source and target.

By the way, a category where all the morphisms are isomorphisms and there's at most one isomorphism from any object to any other is just the same as equivalence relation.

A category where all the morphisms are isomorphisms is called a [[groupoid]].

A category where there's at most one morphism from any object to any other object is called a [[preorder]].

So, a groupoid that's a preorder is an [[equivalence relation]].

John Baez (Feb 08 2023 at 19:21):

These are indeed a bit dull as categories go! Most people who use equivalence relations don't even bother to treat them as categories. But it can be useful to do so!