Category Theory
Zulip Server
Archive

You're reading the public-facing archive of the Category Theory Zulip server.
To join the server you need an invite. Anybody can get an invite by contacting Matteo Capucci at name dot surname at gmail dot com.
For all things related to this archive refer to the same person.


Stream: learning: reading & references

Topic: Lawvere, Introduction Linear Categories and Applications


view this post on Zulip Moby-Dick (Mar 03 2026 at 09:39):

In this thread, I would like to share my progress reading Lawvere's Introduction Linear Categories and Applications, including some attempts to solve the exercises. Anyone interested in this material is welcome to contribute.

view this post on Zulip Morgan Rogers (he/him) (Mar 03 2026 at 09:50):

I like the little algebra exercises, this should be fun. How far have you gotten by yourself, @Moby-Dick ?

view this post on Zulip Moby-Dick (Mar 03 2026 at 09:56):

Let's start by the Foreword. Lawvere begins by considering a set whose elements are either non-negative real numbers or the positive infinity. He introduces the operations:
a + b := minimum of a and b,
a * b := ordinary sum of a and b.

view this post on Zulip Moby-Dick (Mar 03 2026 at 09:57):

Using this notation, we will look like crazy to the outsiders.

view this post on Zulip Moby-Dick (Mar 03 2026 at 09:58):

Lawvere's motivation is to find the shortest path between two cities.

view this post on Zulip Morgan Rogers (he/him) (Mar 03 2026 at 10:00):

Huh? That's not an example he gives as far as I've seen

view this post on Zulip Morgan Rogers (he/him) (Mar 03 2026 at 10:02):

He mentions "linear control systems", but I don't know what that precisely refers to without further context (which presumably may arise later)

view this post on Zulip Moby-Dick (Mar 03 2026 at 10:04):

in page ii he says: "the elements of A are indices for certain states or products or locations, likewise B." I imagine these locations as cities.

view this post on Zulip Moby-Dick (Mar 03 2026 at 10:11):

The cost of going from city a city to another is defined. For example, the cost can be a length or the cost in oil. Using matrix multiplication, he computes the minimum cost of going from city a in province A to city c in province C, passing through city b in province B.

view this post on Zulip Morgan Rogers (he/him) (Mar 03 2026 at 10:11):

Okay, but he doesn't talk about finding shortest paths (although I can't deny that for a particular choice of linear category that would be what is calculated!) so I don't think you could say that's his "motivation". Maybe I'm being too picky.

view this post on Zulip Moby-Dick (Mar 03 2026 at 10:12):

minimum cost from a to c = minimum { cost from a to b + cost from b to c : for all b in B}

view this post on Zulip Moby-Dick (Mar 03 2026 at 10:13):

the length is a particular kind of cost

view this post on Zulip Morgan Rogers (he/him) (Mar 03 2026 at 10:14):

Sure. It's a useful particular case to have in mind, at any rate!

view this post on Zulip Moby-Dick (Mar 03 2026 at 10:17):

To fix ideas, let A = Quebec, B = Ontario, C = Manitoba,

view this post on Zulip Moby-Dick (Mar 03 2026 at 10:18):

a = Montreal, c = Winnipeg

view this post on Zulip Moby-Dick (Mar 03 2026 at 10:20):

A --- f ---> B is called the matrix (of cost for traveling between between the cities of Quebec and Ontario).

view this post on Zulip Moby-Dick (Mar 03 2026 at 10:20):

the entry f(b/a) of the matrix f represents the cost of going from city a to city b

view this post on Zulip Moby-Dick (Mar 03 2026 at 10:21):

B --- g ---> C is the matrix (of cost for traveling between between the cities of Ontario and Manitoba).
the entry g(c/b) of the matrix g represents the cost of going from city b to city c

view this post on Zulip Morgan Rogers (he/him) (Mar 03 2026 at 10:24):

That's unusual notation, right? One might be more accustomed to f(a,b)f(a,b) or fa,bf_{a,b} for the coefficients of matrices.

view this post on Zulip Moby-Dick (Mar 03 2026 at 10:26):

time to compose!
A --- gf ---> C is the matrix of cost for traveling from the cities of Quebec to the cities of Manitoba, choosing the minimum cost strategy while crossing Ontario

view this post on Zulip Moby-Dick (Mar 03 2026 at 10:26):

yes, his notation is not your friend

view this post on Zulip Moby-Dick (Mar 03 2026 at 10:27):

I would prefer lower and upper index notation, like in tensors

view this post on Zulip Moby-Dick (Mar 03 2026 at 10:28):

Explicitly, gf(c/a) = min_{b in B} ( g(c/b) + f(b/a) ).

view this post on Zulip Moby-Dick (Mar 03 2026 at 10:29):

Or, in tropical notation, we have a matrix multiplication in a rig:
gf(c/a) = sum_{b in B} g(c/b)*f(b/a)

view this post on Zulip Morgan Rogers (he/him) (Mar 03 2026 at 10:32):

Moby-Dick said:

time to compose!
A --- gf ---> C is the matrix of cost for traveling from the cities of Quebec to the cities of Manitoba, choosing the minimum cost strategy while crossing Ontario

Looking at the geography of Canada, this seems a reasonable description for this case, but if you had chosen a smaller region that one doesn't have to pass through on a shortest path from A to C, it becomes clearer that the composite is more accurately "the shortest path from cities of Quebec to cities of Manitoba conditioned on passing through some city in Ontario". For instance, if there is some trail through Ontario that happens to be shorter than any path through a city, then we won't actually be calculating the matrix of shortest paths here.

view this post on Zulip Moby-Dick (Mar 03 2026 at 10:37):

We can also interpret A, B and C as Bachelor, Master an PhD, so that the elements of each set is a university in which this degree is obtained. f and g can be interpreted as the tuition cost.

view this post on Zulip Morgan Rogers (he/him) (Mar 03 2026 at 12:15):

While the general concept of category involves associativity of "multiplication", the additional special feature of linear categories is that the "maps" from any object A to any object B can also be added in a way that satisfies the distributive laws below (also known as bilinearity) [diagrams omitted]

To place this in some wider context, categories in which the homsets form abelian groups had been around for a long time before Lawvere gave these lectures. Another way of presenting his definition would be to say that linear categories are categories enriched in commutative monoids. People have since also studied variants where the addition distributes over composition only on one side or the other.

view this post on Zulip Simon Burton (Mar 03 2026 at 18:50):

What a nice book, thanks for sharing it

view this post on Zulip Moby-Dick (Mar 05 2026 at 14:19):

Screenshot 2026-03-05 at 08-51-56 Lawvere-LinearCategories.pdf.png
Reading this fragment in the foreword (below, on page iii), I was reminded of George Soros's theory of reflexivity. According to Soros, in social phenomena there are two functions: the cognitive function (used to understand reality) and the manipulative function (used to change reality). Reflexivity happens when the input of the cognitive function is the output of the manipulative function, and the input of the manipulative function is the output of the cognitive function. This is the explanation of self-referential social phenomena, such as the economic bubbles.

For Lawvere, instead of cognitive and manipulative functions there are two processes: one objective (x) and the other subjective (y). He says that y controls x if the equation x = b(y - a x) is satisfied, where a x is the observation of the objective process x and b is the controlling intervention. Here is what I think may be an example of a physical system illustrating this model of control: Inverted pendulum

I don't know to what extent Soros's reflexivity can be formalized within the framework of linear categories. I leave it as an open question. My favorite author concerning self-referential economic phenomena is Didier Sornette. Here is the Master’s Thesis of one of his students, in which the theory of reflexivity is explored:

Shintre, Devendra. "Modelling Forex Market Reflexivity using Self-Exciting Point Process and Ensemble Learning." (2019).

view this post on Zulip Morgan Rogers (he/him) (Mar 07 2026 at 11:37):

Reading into the first chapter, we get to Lawvere's definition of rig. Interestingly, Lawvere chooses the "four-variable combination of the commutative and associative laws" such as (a+b)+(c+d)=(a+c)+(b+d)(a+b)+(c+d) = (a+c)+(b+d) for addition, but I notice that he only makes explicit one side of the unit laws, such as 0+x=x0+x = x. It's not immediately obvious to me how to prove that x+0=xx+0 = x from here, which is needed to deduce associativity and commutativity from the four-variable axiom. Am I missing something?

view this post on Zulip Matteo Capucci (he/him) (Mar 07 2026 at 11:40):

can't you go x+0=(0+0)+(x+0)=(0+x)+(0+0)=xx+0 = (0+0)+(x+0) = (0+x)+(0+0) = x?

view this post on Zulip Morgan Rogers (he/him) (Mar 07 2026 at 11:46):

where does the last equation come from?

view this post on Zulip Clémence Chanavat (Mar 07 2026 at 13:12):

I put the axioms in prover9, let it run for some time, and it couldn't find a proof of x+0=0x + 0 = 0, so it's probably safe to say it's not provable, even though I don't have a counter model.
Here is my code for the record:

formulas(sos).

    0 + x = x.
    (a + b) + (x + y) = (a + x) + (b + y).
    1 * x = x.
    (a * b) * (x * y) = (a * x) * (b * y).
    a * 0 = 0.
    a * (x + y) = (a * x) + (a * y).

end_of_list.

formulas(goals).

  x + 0 = x.

end_of_list.

view this post on Zulip Zoltan A. Kocsis (Z.A.K.) (Mar 07 2026 at 13:33):

If the axioms listed by @Clémence Chanavat are all the axioms, then x.x+0=x\forall x. x + 0 = x is clearly not provable. The universal closure of each formula in sos is satisfied by the operations x+y:=yx + y := y and xy:=yx \cdot y := y.

But under that operation, when 010 \neq 1 one has 1+0=011 + 0 = 0 \neq 1.

view this post on Zulip Moby-Dick (Mar 07 2026 at 13:44):

Notice the typo
typo_rig.png

view this post on Zulip Morgan Rogers (he/him) (Mar 07 2026 at 22:24):

Ah the moral of the story for me is not to try to be too efficient with one's axiomatisations

view this post on Zulip Moby-Dick (Mar 08 2026 at 03:41):

As a statistician, my main motivation for studying rigs (also known as "semi-rings" in the French literature) is their thermodynamic interpretation (some rigs can be interpreted this way, not all):

Source: https://arxiv.org/pdf/1108.2874

view this post on Zulip Moby-Dick (Mar 08 2026 at 03:46):

(Of course, statistical mechanics and statistical inference are two different sciences. However, in statistical inference many models are mathematically equivalent to models in statistical mechanics. For example, the probability distribution underlying logistic regression corresponds to the Boltzmann distribution.)

view this post on Zulip Morgan Rogers (he/him) (Mar 08 2026 at 08:45):

Interesting! Do you already have some intuition of what the relationship between the above and linear categories could be?

view this post on Zulip Moby-Dick (Mar 08 2026 at 13:55):

To develop that intuition, I need to take the Lawvere's course and read Matilde Marcolli & Ryan Thorngren's paper "Thermodynamic Semirings". I just copied from the last paper what the dictionary could be.

view this post on Zulip Moby-Dick (Mar 08 2026 at 13:56):

In other words, I am in the process of developing that intuition, but I guess it will take a year.