Category Theory
Zulip Server
Archive

You're reading the public-facing archive of the Category Theory Zulip server.
To join the server you need an invite. Anybody can get an invite by contacting Matteo Capucci at name dot surname at gmail dot com.
For all things related to this archive refer to the same person.


Stream: theory: categorical probability

Topic: characterization of disintegrations


view this post on Zulip Matteo Capucci (he/him) (Jan 22 2025 at 14:09):

Consider a probability space (B,db)(B, db) and a family of probability spaces (Eb,deb)(E_b, de_b) indexed by the points of bb. I think you can equip E=bBEbE = \sum_{b \in B} E_b with a measurable structure, generated by measurables of the form bUEb\sum_{b \in U} E_b for UBU \subseteq B measurable and of the form bBVb\sum_{b \in B} V_b for VbEbV_b \subseteq E_b measurable.
I think you can also equip EE with a measure dede, defined on generators as

de(bUEb)=db(U)de(bBVb)=bdeb(Vb)dbde\left(\sum_{b \in U} E_b\right) = db(U)\\ de\left(\sum_{b \in B} V_b\right) = \int_b de_b(V_b)\,db

In this way there is an obvious measure-preserving projection map p:EBp:E \to B.

view this post on Zulip Matteo Capucci (he/him) (Jan 22 2025 at 14:12):

Two questions:

  1. Does this construction make sense in generality? Is there a more general one in which I don't need to work pointwise eith B?
  2. Does anyone know a characterization of the maps p:EBp:E \to B arising in this way? I know these are called 'disintegrations'. Do they have a lifting property of some kind?

view this post on Zulip Paolo Perrone (Jan 22 2025 at 14:14):

Is the space B countable or uncountable?

view this post on Zulip Matteo Capucci (he/him) (Jan 22 2025 at 14:15):

Does it make a difference?

view this post on Zulip Paolo Perrone (Jan 22 2025 at 14:16):

In general, yes. (I know it shouldn't, but that's how measure theory works...)

view this post on Zulip Matteo Capucci (he/him) (Jan 22 2025 at 14:16):

Uh, why?

view this post on Zulip Paolo Perrone (Jan 22 2025 at 14:21):

Basically, in the second equation, for the right-hand side to type-check and exist, we want the integrand to be measurable (= integrable, in this case) as a function of bb. This is automatic if BB is discrete, but not in general.

view this post on Zulip Matteo Capucci (he/him) (Jan 22 2025 at 14:25):

I see, it is not a cardinality problem though

view this post on Zulip Matteo Capucci (he/him) (Jan 22 2025 at 14:26):

I found this definition which seems to be more careful about that point but I don't get it. It seems to be defining disintegrations only for 'trivial' bundles X×YYX \times Y \to Y.
image.png

view this post on Zulip Paolo Perrone (Jan 22 2025 at 14:27):

Consider the (easier) case where all the EbE_b are equal, so that the total space is just a product. (But we allow the measures debde_b to vary, so that we still have a "dependent product" feeling.)
In this case we want debde_b to be measurable in bb in the sense that for every measurable AEbA\subseteq E_b, the function bdeb(A)b\mapsto de_b(A) is measurable. This is precisely saying that we have a Markov kernel, (equivalently, a regular conditional distribution, or a disintegration).

view this post on Zulip Paolo Perrone (Jan 22 2025 at 14:27):

It's not a cardinality issue, you are right. It's just an issue that becomes visible once we leave the discrete case.

view this post on Zulip Paolo Perrone (Jan 22 2025 at 14:28):

Measurability in bb is crucial.

view this post on Zulip Matteo Capucci (he/him) (Jan 22 2025 at 14:28):

Yeah I see what you mean

view this post on Zulip Paolo Perrone (Jan 22 2025 at 14:28):

This is one of the main reasons why in categorical probability we are so fixated with Markov kernels. The good news is that once you do have that measurability, a lot of things work very well.

view this post on Zulip Matteo Capucci (he/him) (Jan 22 2025 at 14:29):

Okok so I'll add that assumption

view this post on Zulip Paolo Perrone (Jan 22 2025 at 14:29):

In terms of Markov categories, this is precisely the idea of having conditionals.

view this post on Zulip Paolo Perrone (Jan 22 2025 at 14:30):

Now if the EbE_b are not all equal, it becomes tricky to state the measurability condition: who is AA?

view this post on Zulip Matteo Capucci (he/him) (Jan 22 2025 at 14:32):

I guess you have to consider BB-indexed families of VbEbV_b \subseteq E_b and ask measurability for the measures bdeb(Vb)b \mapsto de_b(V_b)

view this post on Zulip Matteo Capucci (he/him) (Jan 22 2025 at 14:33):

Though yeah it's easier said than done :thinking:

view this post on Zulip Paolo Perrone (Jan 22 2025 at 14:35):

Well, the good news is that measurable spaces are very "coarse", and a lot of nice measurable spaces turn out to be isomorphic. (For example, R\mathbb{R}, R2\mathbb{R}^2, S1S^1 and every separable Banach space of dimension >0 are isomorphic as measurable spaces!)
In particular, the only standard Borel spaces (which always admit disintegrations) are, up to iso, R\mathbb{R} and its retracts (which are N\mathbb{N} and the finite sets).
So, in very very many situations, it is not a real loss of generality to assume that the EbE_b are all the same (take them to be R\mathbb{R}).

view this post on Zulip Matteo Capucci (he/him) (Jan 22 2025 at 14:36):

Bold words to write on a CT chatroom XD

view this post on Zulip Paolo Perrone (Jan 22 2025 at 14:36):

Even if you want some fibers to be discrete, you can take R\mathbb{R}, and as measure you take one supported, say, on NR\mathbb{N}\subseteq\mathbb{R}

view this post on Zulip Paolo Perrone (Jan 22 2025 at 14:36):

(That's at least what probabilists usually do.)

view this post on Zulip Matteo Capucci (he/him) (Jan 22 2025 at 14:38):

I'm more interested in characterizing when such a situation is available than constructing one, i.e. my question (2). That is, suppose you have a measure-preserving map p:EBp:E \to B already, when can you disintegrate the measure on EE such that (b,e)φ(b,e)d(b,e)=beφ(b,e)dbdeb\int_{(b,e)} \varphi(b,e) d(b,e) = \int_b \int_e \varphi(b,e)d b de_b?

view this post on Zulip Paolo Perrone (Jan 22 2025 at 14:39):

That's an excellent question. That's precisely (one of the many version of) the disintegration theorem.

view this post on Zulip Paolo Perrone (Jan 22 2025 at 14:40):

One positive answer is: if EE is standard Borel, then you always can.

view this post on Zulip Paolo Perrone (Jan 22 2025 at 14:40):

(Abstractly, the Markov category of standard Borel spaces has all conditional distributions.)

view this post on Zulip Matteo Capucci (he/him) (Jan 22 2025 at 14:41):

Paolo Perrone said:

(Abstractly, the Markov category of standard Borel spaces has all conditional distributions.)

Can you really phrase non-trivial disintegration as a Markov-categorical conditional?

view this post on Zulip Paolo Perrone (Jan 22 2025 at 14:41):

Yes!

view this post on Zulip Matteo Capucci (he/him) (Jan 22 2025 at 14:41):

Uhm maybe you are saying that in that case, every such bundle is trivial so it doesn't matter

view this post on Zulip Matteo Capucci (he/him) (Jan 22 2025 at 14:41):

Else can you show me how?

view this post on Zulip Paolo Perrone (Jan 22 2025 at 14:44):

Matteo Capucci (he/him) said:

Can you really phrase non-trivial disintegration as a Markov-categorical conditional?

Yes, that's one of the reasons which "sold me" to Markov categories.
(The other one being: these measurability issues are clearly necessary, but I find them very tricky to work with. Using Markov categories everything becomes string diagrams, and I find it much more suggestive and intuitive that way.)

view this post on Zulip Tobias Fritz (Jan 22 2025 at 14:45):

I'm just catching up on this, so I hope you don't mind me adding 2 cents. The fact that conditionals give you disintegrations is essentially Prop 11.7 in my 'synthetic' paper:
image.png
(Take A=IA = I to get your situation, with your pp in place of ff, which happens to be deterministic and surjective, but neither of these is actually relevant.)

Nowadays I guess we'd call this 'parametric Bayesian inversion' instead of disintegration.

view this post on Zulip Paolo Perrone (Jan 22 2025 at 14:45):

Matteo Capucci (he/him) said:

Uhm maybe you are saying that in that case, every such bundle is trivial so it doesn't matter

I don't think it's a matter of "triviality of the bundle" here. If we see the fibers as measure spaces, they are allowed to be very different, so our bundle is not even locally trivial. Still disintegrations exist.

view this post on Zulip Paolo Perrone (Jan 22 2025 at 14:47):

Tobias Fritz said:

The fact that conditionals give you disintegrations is essentially Prop 11.7 in my 'synthetic' paper

Exactly. The non-parametric case is drawn here on the nlab (which does not allow triangles in string diagrams):
image.png

view this post on Zulip Matteo Capucci (he/him) (Jan 22 2025 at 14:49):

Ah, interesting!!

view this post on Zulip Matteo Capucci (he/him) (Jan 22 2025 at 14:50):

The trick is to give the family of spaces (Eb,deb)(E_b, de_b) as a Markov kernel BbBEbB \to \sum_{b \in B} E_b, I see.

view this post on Zulip Matteo Capucci (he/him) (Jan 22 2025 at 14:51):

In Tobias notation, my EE s are YY and BB is XX, with the assignment given by f:XYf:X \to Y and the base measure being s:IXs:I \to X

view this post on Zulip Paolo Perrone (Jan 22 2025 at 14:51):

Yes! So a Markov kernel, "morally", is a "dependent probability measure", with all the technicalities worked out.

view this post on Zulip Paolo Perrone (Jan 22 2025 at 14:51):

Matteo Capucci (he/him) said:

In Tobias notation, my $$E$$s are YY and BB is XX, with the assignment given by f:XYf:X \to Y and the base measure being s:IXs:I \to X

No, wait. Your total space is X and your B is Y.

view this post on Zulip Matteo Capucci (he/him) (Jan 22 2025 at 14:52):

The confusing thing is that you never say f:BbEbf:B \to \sum_b E_b has to split the projection map. And I guess I understand why: you are going to condition on EbE_b anyway.

view this post on Zulip Matteo Capucci (he/him) (Jan 22 2025 at 14:52):

That is, debde_b is actually f(b)Ebf(b)\vert_{E_b}

view this post on Zulip Matteo Capucci (he/him) (Jan 22 2025 at 14:53):

Paolo Perrone said:

Matteo Capucci (he/him) said:

In Tobias notation, my $$E$$s are YY and BB is XX, with the assignment given by f:XYf:X \to Y and the base measure being s:IXs:I \to X

No, wait. Your total space is X and your B is Y.

Uh-oh did I get it backwards?

view this post on Zulip Matteo Capucci (he/him) (Jan 22 2025 at 14:54):

Right... you specify a measure on EE and then show it disintegrates, what I describe is integration. Guess I am constructively minded :P

view this post on Zulip Paolo Perrone (Jan 22 2025 at 14:54):

Oh, you can prove using string diagrams (try!) that the following conditions are equivalent:

view this post on Zulip Matteo Capucci (he/him) (Jan 22 2025 at 14:55):

Are you talking now about the disintegrating ff or the integrating ff? :sweat_smile:

view this post on Zulip Paolo Perrone (Jan 22 2025 at 14:55):

If you need a reference, see Proposition 2.5 here (but it's not where it was first proven)

view this post on Zulip Matteo Capucci (he/him) (Jan 22 2025 at 14:55):

I guess the disintegrating one, i.e. the 'bundle projection', right?

view this post on Zulip Paolo Perrone (Jan 22 2025 at 14:56):

f is the projection. Its Bayesian inverse/disintegration is the conditional