You're reading the public-facing archive of the Category Theory Zulip server.
To join the server you need an invite. Anybody can get an invite by contacting Matteo Capucci at name dot surname at gmail dot com.
For all things related to this archive refer to the same person.
The gluing lemma is the following statement, taken from Villani's monograph on optimal transport:
image.png
image.png
I wonder if a formalization in Markov categories with conditionals is known?
Yes! See for example here, sections 11 and 12.
(I think the construction is exactly the one in Definition 12.8, at least if you want the law of those random variables, which is what matters for optimal transport.)
Ah nice! The remark about the category of couplings is what I was looking for!
It's so good to have the necessary hypotheses for these results neatly figured out..!
Oh. Take a look at Definition 13.8 then.
(There is quite a big line of work on that category by the way, starting here.)
By the way, the fact that categories of couplings (i.e. joint distributions with specified marginals) and of transport plans (i.e. particular kernels/probabilistic relations) are one and the same thing, is a generalization of the very famous correspondence between regular categories and tabular allegories.
See for example this recent preprint and my talk at CT.
(There is for sure a double-categorical story as well.)
Paolo Perrone said:
By the way, the fact that categories of couplings (i.e. joint distributions with specified marginals) and of transport plans (i.e. particular kernels/probabilistic relations) are one and the same thing, is a generalization of the very famous correspondence between regular categories and tabular allegories.
See for example this recent preprint and my talk at CT.
(There is for sure a double-categorical story as well.)
Wow, that's neat. I haven't looked at those things yet. Do you find a general abstract result that includes those two correspondences as special cases?
Not quite but almost. But I think that general abstract result exists.
(The correspondence for now uses a strong version of 'maps', which in particular must be split epi. This is good enough e.g. for probability, but too strong for relations.)
Paolo Perrone said:
By the way, the fact that categories of couplings (i.e. joint distributions with specified marginals) and of transport plans (i.e. particular kernels/probabilistic relations) are one and the same thing, is a generalization of the very famous correspondence between regular categories and tabular allegories.
See for example this recent preprint and my talk at CT.
(There is for sure a double-categorical story as well.)
Oh, right :)
Actually, 'coupling' and 'transport plan' are usualy synonym (see e.g. [[coupling]]). Is the theorem that any coupling corresponds to a kernel , by disintegration?
There are two levels.
1) A joint distribution and an a.s. equivalence class of measure-preserving kernel amount to the same data (provided disintegrations exist). Hence they are both referred to as 'couplings' or 'transport plans' interchangeably.
2) There is a more general notion of coupling: a span of (eq. classes of) measure-preserving functions between probability spaces. (And the joint distribution is a terminal such span, in an appropriate sense.)
What we have here is that the category of kernels is equivalent to a category of spans of functions (=deterministic kernels). And this parallels how the category of relations is a category of spans of functions (=deterministic relations).
Nice!
The neat thing is that not only do kernels correspond to joint distributions, but also, and mostly, the composition of kernels corresponds to the 'gluing' of joint distributions (followed by marginalization on the middle variable).
This has to do with the fact that Markov kernels compose in a 'Markov' way (hence the name): in a composite kernel, the two probabilistic transitions happen independently of one another.
(This happens with relations too, and it's exactly what the Beck-Chevalley condition is about.)
image.png
I only just saw this paper, but I found this part very intersting. I recently had cause to use this fact (that a square is independent in this sense if and only if the square obtained after replacing the vertical maps with their Bayesian inverses is still commutative), in some work on stochastic processes. This condition is basically the one-step version of what probability theorists call an immersion of filtrations (formally this means that any martingale for the smaller filtration is still a martingale when considered with respect to the larger filtration). Very interesting stuff!
Eigil Rischel said:
I find this very fascinating too.
(By the way I recently gave a short talk focusing on just that: https://www.youtube.com/watch?v=mUPJEt3FeiU)