You're reading the public-facing archive of the Category Theory Zulip server.
To join the server you need an invite. Anybody can get an invite by contacting Matteo Capucci at name dot surname at gmail dot com.
For all things related to this archive refer to the same person.
In this thread, I would like to share my progress reading Lawvere's Introduction Linear Categories and Applications, including some attempts to solve the exercises. Anyone interested in this material is welcome to contribute.
I like the little algebra exercises, this should be fun. How far have you gotten by yourself, @Moby-Dick ?
Let's start by the Foreword. Lawvere begins by considering a set whose elements are either non-negative real numbers or the positive infinity. He introduces the operations:
a + b := minimum of a and b,
a * b := ordinary sum of a and b.
Using this notation, we will look like crazy to the outsiders.
Lawvere's motivation is to find the shortest path between two cities.
Huh? That's not an example he gives as far as I've seen
He mentions "linear control systems", but I don't know what that precisely refers to without further context (which presumably may arise later)
in page ii he says: "the elements of A are indices for certain states or products or locations, likewise B." I imagine these locations as cities.
The cost of going from city a city to another is defined. For example, the cost can be a length or the cost in oil. Using matrix multiplication, he computes the minimum cost of going from city a in province A to city c in province C, passing through city b in province B.
Okay, but he doesn't talk about finding shortest paths (although I can't deny that for a particular choice of linear category that would be what is calculated!) so I don't think you could say that's his "motivation". Maybe I'm being too picky.
minimum cost from a to c = minimum { cost from a to b + cost from b to c : for all b in B}
the length is a particular kind of cost
Sure. It's a useful particular case to have in mind, at any rate!
To fix ideas, let A = Quebec, B = Ontario, C = Manitoba,
a = Montreal, c = Winnipeg
A --- f ---> B is called the matrix (of cost for traveling between between the cities of Quebec and Ontario).
the entry f(b/a) of the matrix f represents the cost of going from city a to city b
B --- g ---> C is the matrix (of cost for traveling between between the cities of Ontario and Manitoba).
the entry g(c/b) of the matrix g represents the cost of going from city b to city c
That's unusual notation, right? One might be more accustomed to or for the coefficients of matrices.
time to compose!
A --- gf ---> C is the matrix of cost for traveling from the cities of Quebec to the cities of Manitoba, choosing the minimum cost strategy while crossing Ontario
yes, his notation is not your friend
I would prefer lower and upper index notation, like in tensors
Explicitly, gf(c/a) = min_{b in B} ( g(c/b) + f(b/a) ).
Or, in tropical notation, we have a matrix multiplication in a rig:
gf(c/a) = sum_{b in B} g(c/b)*f(b/a)
Moby-Dick said:
time to compose!
A --- gf ---> C is the matrix of cost for traveling from the cities of Quebec to the cities of Manitoba, choosing the minimum cost strategy while crossing Ontario
Looking at the geography of Canada, this seems a reasonable description for this case, but if you had chosen a smaller region that one doesn't have to pass through on a shortest path from A to C, it becomes clearer that the composite is more accurately "the shortest path from cities of Quebec to cities of Manitoba conditioned on passing through some city in Ontario". For instance, if there is some trail through Ontario that happens to be shorter than any path through a city, then we won't actually be calculating the matrix of shortest paths here.
We can also interpret A, B and C as Bachelor, Master an PhD, so that the elements of each set is a university in which this degree is obtained. f and g can be interpreted as the tuition cost.
While the general concept of category involves associativity of "multiplication", the additional special feature of linear categories is that the "maps" from any object A to any object B can also be added in a way that satisfies the distributive laws below (also known as bilinearity) [diagrams omitted]
To place this in some wider context, categories in which the homsets form abelian groups had been around for a long time before Lawvere gave these lectures. Another way of presenting his definition would be to say that linear categories are categories enriched in commutative monoids. People have since also studied variants where the addition distributes over composition only on one side or the other.
What a nice book, thanks for sharing it
Screenshot 2026-03-05 at 08-51-56 Lawvere-LinearCategories.pdf.png
Reading this fragment in the foreword (below, on page iii), I was reminded of George Soros's theory of reflexivity. According to Soros, in social phenomena there are two functions: the cognitive function (used to understand reality) and the manipulative function (used to change reality). Reflexivity happens when the input of the cognitive function is the output of the manipulative function, and the input of the manipulative function is the output of the cognitive function. This is the explanation of self-referential social phenomena, such as the economic bubbles.
For Lawvere, instead of cognitive and manipulative functions there are two processes: one objective (x) and the other subjective (y). He says that y controls x if the equation x = b(y - a x) is satisfied, where a x is the observation of the objective process x and b is the controlling intervention. Here is what I think may be an example of a physical system illustrating this model of control: Inverted pendulum
I don't know to what extent Soros's reflexivity can be formalized within the framework of linear categories. I leave it as an open question. My favorite author concerning self-referential economic phenomena is Didier Sornette. Here is the Master’s Thesis of one of his students, in which the theory of reflexivity is explored:
Shintre, Devendra. "Modelling Forex Market Reflexivity using Self-Exciting Point Process and Ensemble Learning." (2019).
Reading into the first chapter, we get to Lawvere's definition of rig. Interestingly, Lawvere chooses the "four-variable combination of the commutative and associative laws" such as for addition, but I notice that he only makes explicit one side of the unit laws, such as . It's not immediately obvious to me how to prove that from here, which is needed to deduce associativity and commutativity from the four-variable axiom. Am I missing something?
can't you go ?
where does the last equation come from?
I put the axioms in prover9, let it run for some time, and it couldn't find a proof of , so it's probably safe to say it's not provable, even though I don't have a counter model.
Here is my code for the record:
formulas(sos).
0 + x = x.
(a + b) + (x + y) = (a + x) + (b + y).
1 * x = x.
(a * b) * (x * y) = (a * x) * (b * y).
a * 0 = 0.
a * (x + y) = (a * x) + (a * y).
end_of_list.
formulas(goals).
x + 0 = x.
end_of_list.
If the axioms listed by @Clémence Chanavat are all the axioms, then is clearly not provable. The universal closure of each formula in sos is satisfied by the operations and .
But under that operation, when one has .
Notice the typo
typo_rig.png
Ah the moral of the story for me is not to try to be too efficient with one's axiomatisations
As a statistician, my main motivation for studying rigs (also known as "semi-rings" in the French literature) is their thermodynamic interpretation (some rigs can be interpreted this way, not all):
Source: https://arxiv.org/pdf/1108.2874
(Of course, statistical mechanics and statistical inference are two different sciences. However, in statistical inference many models are mathematically equivalent to models in statistical mechanics. For example, the probability distribution underlying logistic regression corresponds to the Boltzmann distribution.)
Interesting! Do you already have some intuition of what the relationship between the above and linear categories could be?
To develop that intuition, I need to take the Lawvere's course and read Matilde Marcolli & Ryan Thorngren's paper "Thermodynamic Semirings". I just copied from the last paper what the dictionary could be.
In other words, I am in the process of developing that intuition, but I guess it will take a year.