Category Theory
Zulip Server
Archive

You're reading the public-facing archive of the Category Theory Zulip server.
To join the server you need an invite. Anybody can get an invite by contacting Matteo Capucci at name dot surname at gmail dot com.
For all things related to this archive refer to the same person.


Stream: event: Categorical Probability and Statistics 2020 workshop

Topic: Jun 6: Paolo Perrone's talk


view this post on Zulip Paolo Perrone (Jun 04 2020 at 19:12):

Hey all,
This is the discussion thread of my talk, "Probability monads and stochastic dominance".
The talk, besides being on Zoom, is livestreamed here: https://youtu.be/kKDMCDUaxxE

view this post on Zulip Paolo Perrone (Jun 04 2020 at 19:24):

Date and time: Saturday, 6 Jun, 15h UTC.

view this post on Zulip Paolo Perrone (Jun 06 2020 at 14:30):

Hi! We start in 30 minutes.

view this post on Zulip Joscha Diehl (Jun 06 2020 at 15:57):

Great talk.
Can you say something about the bar construction mentioned in the abstract?

view this post on Zulip Joshua Meyers (Jun 06 2020 at 15:58):

Do these ideas have anything to say about deciding between two investments p, q with nonzero probability of both p<=q and p>q, on the basis of expected return and risk?

view this post on Zulip Peter Arndt (Jun 06 2020 at 15:59):

Yes, great talk!
Is there a formal formulation for this process of "moving the mass"? Maybe pullback along projetion and then pushforward?

view this post on Zulip Tomáš Gonda (Jun 06 2020 at 16:02):

I am wondering what is the main reason for considering order isomorphisms rather than homomorphisms for the complete metric space case? That is, why do we exclude coarse-grainings (of the order information)? Is (one) reason that we would be mixing up the first-order and second-order dominance (since the latter is related to coarse-grainings)?

view this post on Zulip Tomáš Gonda (Jun 06 2020 at 16:09):

A more exotic question: Is there a chance to have a unified treatment (which is still sufficiently expressive) of both first-order and second-order dominance at some level of abstraction? There are parallels between them, but also some important differences.

view this post on Zulip Paolo Perrone (Jun 06 2020 at 17:18):

Joscha Diehl said:

Great talk.
Can you say something about the bar construction mentioned in the abstract?

@Joscha Diehl Thanks! Yep. The maps EE and PeP e which give the equivalent characterization to second-order stochastic dominance are exactly the source and target maps of 1-simplices in the bar construction. This turns out to be quite structural!
There's quite a bit to talk about - if you want I can expand. For now, two references are Chapter 4 of my thesis, and my MFPS talk here: https://youtu.be/28EASeG1RBA (paper here: https://arxiv.org/abs/1810.06037)

view this post on Zulip Paolo Perrone (Jun 06 2020 at 17:18):

By the way, here's the link to my thesis: http://paoloperrone.org/phdthesis.pdf

view this post on Zulip Paolo Perrone (Jun 06 2020 at 17:24):

Joshua Meyers said:

Do these ideas have anything to say about deciding between two investments p, q with nonzero probability of both p<=q and p>q, on the basis of expected return and risk?

If I understand you correctly, yes. So one can prove that if we are over a PP-algebras, for example real numbers, then if pp is lower than qq in the stochastic order, pp has lower expectation value than qq - this expectation map is even strictly monotone.
For what concerns risk, the second-order stochastic dominance is precisely the "riskiness". One can also use the two orders together, and that's sometimes also called second-order stochastic dominance. I can expand on this, if you want. In any case if you want a reference, this is worked out in my thesis, the link is above.

view this post on Zulip Paolo Perrone (Jun 06 2020 at 17:25):

Peter Arndt said:

Yes, great talk!
Is there a formal formulation for this process of "moving the mass"? Maybe pullback along projetion and then pushforward?

Yes, but in general it's a random transport map, not a deterministic one. In the sense that the mass over a certain point may be split, and sent to two different points.

view this post on Zulip Paolo Perrone (Jun 06 2020 at 17:27):

Tomáš Gonda said:

I am wondering what is the main reason for considering order isomorphisms rather than homomorphisms for the complete metric space case? That is, why do we exclude coarse-grainings (of the order information)? Is (one) reason that we would be mixing up the first-order and second-order dominance (since the latter is related to coarse-grainings)?

Oh, we are not using only order isomorphisms, but homomorphisms. We can coarse-grain all we want. Only the unit of the monad (the Dirac delta) happens to be an order-embedding, everything else is not. Sorry if that wasn't clear!

view this post on Zulip Paolo Perrone (Jun 06 2020 at 17:28):

Tomáš Gonda said:

A more exotic question: Is there a chance to have a unified treatment (which is still sufficiently expressive) of both first-order and second-order dominance at some level of abstraction? There are parallels between them, but also some important differences.

Yep, there is, unfortunately I didn't have time to talk about that. It even satisfies a very nice universal property, it's a "lax codescent object". A way to obtain this composite order is to take the coinserter of the two maps E,Te:PAAE, Te:PA\to A. I can expand on this, if you want.

view this post on Zulip Tomáš Gonda (Jun 06 2020 at 17:32):

Paolo Perrone said:

Tomáš Gonda said:

I am wondering what is the main reason for considering order isomorphisms rather than homomorphisms for the complete metric space case? That is, why do we exclude coarse-grainings (of the order information)? Is (one) reason that we would be mixing up the first-order and second-order dominance (since the latter is related to coarse-grainings)?

Oh, we are not using only order isomorphisms, but homomorphisms. We can coarse-grain all we want. Only the unit of the monad (the Dirac delta) happens to be an order-embedding, everything else is not. Sorry if that wasn't clear!

hm, ok, so what was the extra property of morphisms in COMet besides being order-preserving and 1-Lipschitz?

view this post on Zulip Tomáš Gonda (Jun 06 2020 at 17:35):

Paolo Perrone said:

Tomáš Gonda said:

A more exotic question: Is there a chance to have a unified treatment (which is still sufficiently expressive) of both first-order and second-order dominance at some level of abstraction? There are parallels between them, but also some important differences.

Yep, there is, unfortunately I didn't have time to talk about that. It even satisfies a very nice universal property, it's a "lax codescent object". A way to obtain this composite order is to take the coinserter of the two maps E,Te:PAAE, Te:PA\to A. I can expand on this, if you want.

That sounds intriguing (although probably slightly different to what I actually had in mind). Does it give you the "combined" order (i.e. the intersection of the two order relations)? I would love to know more about that, is it in some of your articles?

view this post on Zulip Paolo Perrone (Jun 06 2020 at 18:44):

Tomáš Gonda said:

Paolo Perrone said:

Tomáš Gonda said:

I am wondering what is the main reason for considering order isomorphisms rather than homomorphisms for the complete metric space case? That is, why do we exclude coarse-grainings (of the order information)? Is (one) reason that we would be mixing up the first-order and second-order dominance (since the latter is related to coarse-grainings)?

Oh, we are not using only order isomorphisms, but homomorphisms. We can coarse-grain all we want. Only the unit of the monad (the Dirac delta) happens to be an order-embedding, everything else is not. Sorry if that wasn't clear!

hm, ok, so what was the extra property of morphisms in COMet besides being order-preserving and 1-Lipschitz?

That's not on the morphisms, it's on the orders. It's definition 4.1.1 in this paper: https://arxiv.org/abs/1808.09898
It says that 1-Lipschitz monotone functions are "enough to tell the order", the order is never too "steep". Example 4.1.2 there gives a space which does not satisfy that property.

view this post on Zulip Paolo Perrone (Jun 06 2020 at 18:45):

By the way, we called those spaces "L-ordered" in honor of Lawvere - I should have said that since he was in the audience!

view this post on Zulip Paolo Perrone (Jun 06 2020 at 18:47):

Tomáš Gonda said:

Paolo Perrone said:

Tomáš Gonda said:

A more exotic question: Is there a chance to have a unified treatment (which is still sufficiently expressive) of both first-order and second-order dominance at some level of abstraction? There are parallels between them, but also some important differences.

Yep, there is, unfortunately I didn't have time to talk about that. It even satisfies a very nice universal property, it's a "lax codescent object". A way to obtain this composite order is to take the coinserter of the two maps E,Te:PAAE, Te:PA\to A. I can expand on this, if you want.

That sounds intriguing (although probably slightly different to what I actually had in mind). Does it give you the "combined" order (i.e. the intersection of the two order relations)? I would love to know more about that, is it in some of your articles?

Oh, I think I see what you meant in your original question. I'd love to find a uniform treatment - so far we weren.t able to find it except pretty much for what I said during the talk. I suspect one can do this in Markov categories with conditionals though.
If you want to know more about the combined order, see Section 4.3 of my thesis (link above). Hopefully it will become a paper soon.

view this post on Zulip Tomáš Gonda (Jun 06 2020 at 18:55):

Paolo Perrone said:

That's not on the morphisms, it's on the orders. It's definition 4.1.1 in this paper: https://arxiv.org/abs/1808.09898
It says that 1-Lipschitz monotone functions are "enough to tell the order", the order is never too "steep". Example 4.1.2 there gives a space which does not satisfy that property.

oh, that makes a lot of sense, thanks! I must have had a brief attention glitch while you were explaining it :sleeping:

view this post on Zulip Tomáš Gonda (Jun 06 2020 at 20:37):

Paolo Perrone said:

Oh, I think I see what you meant in your original question. I'd love to find a uniform treatment - so far we weren.t able to find it except pretty much for what I said during the talk. I suspect one can do this in Markov categories with conditionals though.
If you want to know more about the combined order, see Section 4.3 of my thesis (link above). Hopefully it will become a paper soon.

I guess one way to interpret what I'd like to have from my (biased) perspective is finding a class of resource theories that would have enough structure to allow one to prove classification theorems about their resource orderings and which would include, as special cases, the first-order and second-order stochastic dominance among other preorders.

view this post on Zulip Joshua Meyers (Jun 06 2020 at 22:19):

Paolo Perrone said:

Joshua Meyers said:

Do these ideas have anything to say about deciding between two investments p, q with nonzero probability of both p<=q and p>q, on the basis of expected return and risk?

If I understand you correctly, yes. So one can prove that if we are over a PP-algebras, for example real numbers, then if pp is lower than qq in the stochastic order, pp has lower expectation value than qq - this expectation map is even strictly monotone.
For what concerns risk, the second-order stochastic dominance is precisely the "riskiness". One can also use the two orders together, and that's sometimes also called second-order stochastic dominance. I can expand on this, if you want. In any case if you want a reference, this is worked out in my thesis, the link is above.

Yes but if the events p<=q and p>q both have nonzero probability, then neither has first-order stochastic dominance over the other, as you have defined it. And second-order stochastic dominance doesn't even apply, as they are not about "the same data".

Yes p<=q in the stochastic order implies that E(p)<=E(q), but not conversely.

Update: I should have said that the events p<q and q<p both have nonzero probability, sorry for the confusion.

view this post on Zulip Paolo Perrone (Jun 06 2020 at 22:59):

Joshua Meyers said:

Paolo Perrone said:

Joshua Meyers said:

Do these ideas have anything to say about deciding between two investments p, q with nonzero probability of both p<=q and p>q, on the basis of expected return and risk?

If I understand you correctly, yes. So one can prove that if we are over a PP-algebras, for example real numbers, then if pp is lower than qq in the stochastic order, pp has lower expectation value than qq - this expectation map is even strictly monotone.
For what concerns risk, the second-order stochastic dominance is precisely the "riskiness". One can also use the two orders together, and that's sometimes also called second-order stochastic dominance. I can expand on this, if you want. In any case if you want a reference, this is worked out in my thesis, the link is above.

Yes but if the events p<=q and p>q both have nonzero probability, then neither has first-order stochastic dominance over the other, as you have defined it. And second-order stochastic dominance doesn't even apply, as they are not about "the same data".

Yes p<=q in the stochastic order implies that E(p)<=E(q), but not conversely.

Update: I should have said that the events p<q and q<p both have nonzero probability, sorry for the confusion.

Uhm, I think I don't understand. There are (at least) 2 orders that one can construct, first and second order stochastic dominance, and their composite order as well. What is the question exactly, possibly in other words?

view this post on Zulip Tobias Fritz (Jun 06 2020 at 23:08):

Perhaps Joshua is talking about the pointwise order on random variables that Paolo spoke about at the beginning? Joshua, what do you mean by the probability of the event p<=q?

If pp and qq are just probability measures on an ordered space, then it makes sense to try and compare them with respect to first-order stochastic dominance, even if no joint distribution is given, meaning that even if we cannot talk about the probability of "the event pqp \ge q". So you should think of it like this: if someone offers you a bet with 50:50 odds and someone else offers you one with 80:20 odds against you, then you'd rather choose the 50:50 one, right? (Assuming that the stakes are the same.) That's because the 50:50 is higher up in first-order stochastic dominance. And to make this decision, there's no need to know how the two bets are correlated or whether they're independent! It's merely a relation involving pairs of distributions.

view this post on Zulip Paolo Perrone (Jun 07 2020 at 01:31):

Hi all! Here's the video.
https://youtu.be/auIuhRjMokQ

view this post on Zulip Paolo Perrone (Jun 07 2020 at 02:35):

By the way, somebody was asking whether the stochastic order can be equivalently obtained by comparing quantiles: the answer is, yes if X = R, while in the general case of a partial order one cannot define quantiles. (Thanks to Sharwin Rezagholi for pointing that out!)

view this post on Zulip Joshua Meyers (Jun 07 2020 at 11:07):

I am thinking that pp and qq are random variables, perhaps pp is the return on investing in bananas and qq is the return on investing in oranges. Suppose that the sets {ωΩq(ω)>p(ω)}\{\omega\in\Omega | q(\omega)>p(\omega)\} and {ωΩp(ω)>q(ω)} \{\omega\in\Omega | p(\omega)>q(\omega)\} both have nonzero measure. Then by your definition, we cannot say that either pp or qq has first-order stochastic dominance over the other. (I am recalling that you define pp to have stochastic dominance over qq iff μ({ωΩp(ω)q(ω)})=1\mu(\{\omega\in\Omega | p(\omega)\geq q(\omega)\})=1.) Is there a way to still compare them in this case? (Tobias's example of the two uncorrelated bets is an example of this.)

view this post on Zulip Christoph Thies (Jun 07 2020 at 11:15):

Thank you very much for the effort to explain clearly and comprehensibly, @Paolo Perrone . As a non-expert I deeply appreciate that, and I really enjoyed your talk, though much of it is way over my head. At least I feel your work is not completely out of reach for me, and that makes me very happy!

Please take the following as a gentle suggestion and not as criticism. How about occasionally replacing examples using goods and gains by examples using, e.g., biological traits and fitness (these happen to be the constructs I'm mostly interested in :see_no_evil:)? Traits can be anything from a cell's production rate of a protein, to the kind of care given by a parent to its offspring, and to the ploidy of the genetic system of the organism (multiplicity of chromosomes) or the pinkness of unicorns. Fitness usually represents some sort of success in reproduction or persistence that, crucially, does not necessarily occur at the expense of other entities (I am aware this may be possible also in economic contexts). IMHO this setting provides much nicer stories. I apologise if this suggestion is inappropriate.

view this post on Zulip Paolo Perrone (Jun 07 2020 at 11:34):

Joshua Meyers said:

I am thinking that pp and qq are random variables, perhaps pp is the return on investing in bananas and qq is the return on investing in oranges. Suppose that the sets {ωΩq(ω)>p(ω)}\{\omega\in\Omega | q(\omega)>p(\omega)\} and {ωΩp(ω)>q(ω)} \{\omega\in\Omega | p(\omega)>q(\omega)\} both have nonzero measure. Then by your definition, we cannot say that either pp or qq has first-order stochastic dominance over the other. (I am recalling that you define pp to have stochastic dominance over qq iff μ({ωΩp(ω)q(ω)})=1\mu(\{\omega\in\Omega | p(\omega)\geq q(\omega)\})=1.) Is there a way to still compare them in this case? (Tobias's example of the two uncorrelated bets is an example of this.)

I see. If I understand correctly, if pp and qq come from random variables, and their joint assigns nonzero probability both to xyx \le y and yxy\le x, then they are not comparable as random variables. However, it could still be that there exists some other joint entirely supported on the order relation - in that case pp and qq are comparable as random variables (in the stochastic order). It turns out that in many cases (such as over the real line) the stochastic order is a partial order, so if there is a joint assigning probability 1 to xyx\le y then there is no joint assigning probability 1 to yxy\le x. (A reference for the latter, for example, is this paper of Tobias, https://arxiv.org/abs/1810.06771.)

view this post on Zulip Paolo Perrone (Jun 07 2020 at 11:36):

Christoph Thies said:

Thank you very much for the effort to explain clearly and comprehensibly, Paolo Perrone . As a non-expert I deeply appreciate that, and I really enjoyed your talk, though much of it is way over my head. At least I feel your work is not completely out of reach for me, and that makes me very happy!

Please take the following as a gentle suggestion and not as criticism. How about occasionally replacing examples using goods and gains by examples using, e.g., biological traits and fitness (these happen to be the constructs I'm mostly interested in :see_no_evil:)? Traits can be anything from a cell's production rate of a protein, to the kind of care given by a parent to its offspring, and to the ploidy of the genetic system of the organism (multiplicity of chromosomes) or the pinkness of unicorns. Fitness usually represents some sort of success in reproduction or persistence that, crucially, does not necessarily occur at the expense of other entities (I am aware this may be possible also in economic contexts). IMHO this setting provides much nicer stories. I apologise if this suggestion is inappropriate.

Thank you for the appreciation!
It would be nice to use examples from biology. Unfortunately I don't know enough about it though. Do you have a particular example in mind, that you'd like to explain to me?

view this post on Zulip Paolo Perrone (Jun 07 2020 at 11:38):

By the way, some people asked which program I used as virtual blackboard. It's called Xournal, http://xournal.sourceforge.net/.

view this post on Zulip Dario Stein (Jun 07 2020 at 12:44):

@Paolo Perrone You showed that conditional expectations on X can be phrased as partial evaluation/double distributions in the case where X is convex. Do you know how this could connect to other synthetic notions of conditioning (assuming our Markov category comes from a monad)? What if X is not convex, e.g. can we somehow express conditioning on boolean outputs?

view this post on Zulip Christoph Thies (Jun 07 2020 at 13:01):

Paolo Perrone said:

Christoph Thies said:

Thank you very much for the effort to explain clearly and comprehensibly, Paolo Perrone . As a non-expert I deeply appreciate that, and I really enjoyed your talk, though much of it is way over my head. At least I feel your work is not completely out of reach for me, and that makes me very happy!

Please take the following as a gentle suggestion and not as criticism. How about occasionally replacing examples using goods and gains by examples using, e.g., biological traits and fitness (these happen to be the constructs I'm mostly interested in :see_no_evil:)? Traits can be anything from a cell's production rate of a protein, to the kind of care given by a parent to its offspring, and to the ploidy of the genetic system of the organism (multiplicity of chromosomes) or the pinkness of unicorns. Fitness usually represents some sort of success in reproduction or persistence that, crucially, does not necessarily occur at the expense of other entities (I am aware this may be possible also in economic contexts). IMHO this setting provides much nicer stories. I apologise if this suggestion is inappropriate.

Thank you for the appreciation!
It would be nice to use examples from biology. Unfortunately I don't know enough about it though. Do you have a particular example in mind, that you'd like to explain to me?

Hm, I don't have a specific example. In fact, wrt your talk I think your examples were perfectly appropriate. It just occurred to me this might be a useful suggestion since biology instantiates these abstract matters in ways quite different from economy.

Personally, I am interested for example in biological scenarios in which taking the codomain of random variables that represent, say, traits to be the real numbers is not adequate so that classical statistics becomes sort of dodgy as a language for reasoning about them (and I am here in order to understand better what I mean by this and other things I don't quite understand).

view this post on Zulip Paolo Perrone (Jun 07 2020 at 14:06):

Dario Stein said:

Paolo Perrone You showed that conditional expectations on X can be phrased as partial evaluation/double distributions in the case where X is convex. Do you know how this could connect to other synthetic notions of conditioning (assuming our Markov category comes from a monad)? What if X is not convex, e.g. can we somehow express conditioning on boolean outputs?

That's a very good question! We are currently working on it actually (Tobias, Eigil, Tomáš and I), we suspect that you can.
For non-convex things, when one conditions, one is usually doing one of these two things: either conditioning a function on XX into a convex set (say, US states to political preferences), or, working on the free convex space PXPX.

view this post on Zulip Manuel Bärenz (Jun 07 2020 at 14:15):

@Paolo Perrone I don't understand your example in the beginning. Why is the white curve better than the yellow?

view this post on Zulip Paolo Perrone (Jun 07 2020 at 14:21):

(Following up from in-person conversation: the intuition is that the white curve is both "higher up", so that you are likely to gain more, and "more peaked", so you have a better idea of how much exactly you will gain - which can be useful for planning a strategy.)

view this post on Zulip Dario Stein (Jun 07 2020 at 17:23):

Paolo Perrone said:

Dario Stein said:

Paolo Perrone You showed that conditional expectations on X can be phrased as partial evaluation/double distributions in the case where X is convex. Do you know how this could connect to other synthetic notions of conditioning (assuming our Markov category comes from a monad)? What if X is not convex, e.g. can we somehow express conditioning on boolean outputs?

That's a very good question! We are currently working on it actually (Tobias, Eigil, Tomáš and I), we suspect that you can.
For non-convex things, when one conditions, one is usually doing one of these two things: either conditioning a function on XX into a convex set (say, US states to political preferences), or, working on the free convex space PXPX.

Fantastic, very interested by that. Sam's and my Beta-Bernoulli monad encodes essentially the conjugate prior relationship of the two distributions and nothing more; I'd be keen to work out if we can formalize conditioning on the Bernoulli flips in a synthetic way (the main difficulty so far was the central position of if/coproducts in the language, which Markov categories still don't reflect well).

view this post on Zulip Arthur Parzygnat (Jun 07 2020 at 19:46):

Dario Stein said:

Fantastic, very interested by that. Sam's and my Beta-Bernoulli monad encodes essentially the conjugate prior relationship of the two distributions and nothing more; I'd be keen to work out if we can formalize conditioning on the Bernoulli flips in a synthetic way (the main difficulty so far was the central position of if/coproducts in the language, which Markov categories still don't reflect well).

I'd like to understand what you mean in a little more detail, if you could elaborate or point me to a reference. For example, what exactly is the Beta-Bernoulli monad? Perhaps this paper of yours? And is what you're talking about related to Jacobs' work on conjugate priors [Ref: https://arxiv.org/abs/1707.00269]? From my understanding of his paper, he had a nice description of conjugate priors in the Markov category language.

view this post on Zulip Joscha Diehl (Jun 08 2020 at 07:16):

Paolo Perrone said:

Joscha Diehl said:

Great talk.
Can you say something about the bar construction mentioned in the abstract?

Joscha Diehl Thanks! Yep. The maps EE and PeP e which give the equivalent characterization to second-order stochastic dominance are exactly the source and target maps of 1-simplices in the bar construction. This turns out to be quite structural!
There's quite a bit to talk about - if you want I can expand. For now, two references are Chapter 4 of my thesis, and my MFPS talk here: https://youtu.be/28EASeG1RBA (paper here: https://arxiv.org/abs/1810.06037)

Thanks! The paper should be a good entry point for me to get what's going on. I was thinking of the bar construction in algebra, that builds a coalgebra out of an algebra. I guess the categorial bar construction is an abstraction of that?
More conceptually: is the hope in understanding 'your' bar construction, to get to third-order dominance, etc?

view this post on Zulip Paolo Perrone (Jun 08 2020 at 11:38):

Joscha Diehl said:

Paolo Perrone said:

Joscha Diehl said:

Great talk.
Can you say something about the bar construction mentioned in the abstract?

Joscha Diehl Thanks! Yep. The maps EE and PeP e which give the equivalent characterization to second-order stochastic dominance are exactly the source and target maps of 1-simplices in the bar construction. This turns out to be quite structural!
There's quite a bit to talk about - if you want I can expand. For now, two references are Chapter 4 of my thesis, and my MFPS talk here: https://youtu.be/28EASeG1RBA (paper here: https://arxiv.org/abs/1810.06037)

Thanks! The paper should be a good entry point for me to get what's going on. I was thinking of the bar construction in algebra, that builds a coalgebra out of an algebra. I guess the categorial bar construction is an abstraction of that?
More conceptually: is the hope in understanding 'your' bar construction, to get to third-order dominance, etc?

@Joscha Diehl I don't know if we get a coalgebra in this general case, it could be interesting! Anybody knows how this works?
In any case, here's roughly what happens. In the category of sets, the functor given by multiplying with a group (or monoid) XG×XX\mapsto G\times X has a natural monad structure, with unit and multiplication given by those of GG. It's called the action monad, or writer monad in computer science, https://ncatlab.org/nlab/show/action+monad
Algebra over this monad are GG-sets, i.e. sets with a GG-action. The very same holds in the category of Abelian groups if you take a ring RR and the functor given by tensoring with the ring. Algebras in this case are RR-modules.
Now it turns out that in order to define the bar construction, one does not need a group or a ring, the monad structure is enough. Outside the category of Abelian groups one gets a simplicial object, which in the Abelian case is "the same" as a chain complex by the Dold-Kan correspondence.
If you do that for probability monads, this simplicial object is such that 0-simplices are probability measures over a convex space (such as R), and 1-simplices are (basically) conditional expectations. Two measures are connected by an "edge" if one can be obtained by "concentrating" the other one (as we did in the US states example).

The question about third-order stochastic dominance is very interesting, I've always wondered how to obtain it in this framework, but I never figured it out. Do you have an idea in mind?

view this post on Zulip Dario Stein (Jun 08 2020 at 12:18):

Arthur Parzygnat said:

Dario Stein said:

Fantastic, very interested by that. Sam's and my Beta-Bernoulli monad encodes essentially the conjugate prior relationship of the two distributions and nothing more; I'd be keen to work out if we can formalize conditioning on the Bernoulli flips in a synthetic way (the main difficulty so far was the central position of if/coproducts in the language, which Markov categories still don't reflect well).

I'd like to understand what you mean in a little more detail, if you could elaborate or point me to a reference. For example, what exactly is the Beta-Bernoulli monad? Perhaps this paper of yours? And is what you're talking about related to Jacobs' work on conjugate priors [Ref: https://arxiv.org/abs/1707.00269]? From my understanding of his paper, he had a nice description of conjugate priors in the Markov category language.

@Arthur Parzygnat yes, I meant that paper -- we didn't frame things very categorically there, but this amounts to giving a commutative&affine monad on the functor category [Fin,Set] (Sam mentioned this approach in his talk). Its Kleisli category is thus a minimalistic combinatorial Markov category that encodes beta+bernoulli+their conjugate relationship. Thanks for the reference to Bart's paper, I'll have a look at it.

view this post on Zulip Joscha Diehl (Jun 08 2020 at 14:38):

@Paolo Perrone Thanks! Regarding third-order: no, I don't even know what third order dominance would mean. I'd hope it would pop out of the categorial view you have ..

view this post on Zulip Paolo Perrone (Jun 08 2020 at 14:40):

Joscha Diehl said:

Paolo Perrone Thanks! Regarding third-order: no, I don't even know what third order dominance would mean. I'd hope it would pop out of the categorial view you have ..

My intuition would suggest, "the measure is more concentrated on lower values and more spread on higher values". But that's as much as I can say for now, nothing precise. (Imagine, on R, assigning a larger integral to functions such as f(x)=x^3.)

view this post on Zulip Joscha Diehl (Jun 08 2020 at 14:42):

Paolo Perrone said:

Joscha Diehl said:

Paolo Perrone Thanks! Regarding third-order: no, I don't even know what third order dominance would mean. I'd hope it would pop out of the categorial view you have ..

My intuition would suggest, "the measure is more concentrated on lower values and more spread on higher values". But that's as much as I can say for now, nothing precise. (Imagine, on R, assigning a larger integral to functions such as f(x)=x^3.)

Is there something like this already in the literature? Also, you might have mentioned it, but it his obvious how first order and second order dominance are part of a 'ladder' of dominances?

view this post on Zulip Paolo Perrone (Jun 08 2020 at 14:43):

Joscha Diehl said:

Paolo Perrone said:

Joscha Diehl said:

Paolo Perrone Thanks! Regarding third-order: no, I don't even know what third order dominance would mean. I'd hope it would pop out of the categorial view you have ..

My intuition would suggest, "the measure is more concentrated on lower values and more spread on higher values". But that's as much as I can say for now, nothing precise. (Imagine, on R, assigning a larger integral to functions such as f(x)=x^3.)

Is there something like this already in the literature? Also, you might have mentioned it, but it his obvious how first order and second order dominance are part of a 'ladder' of dominances?

@Joscha Diehl Yep, but I don't understand them very well: https://en.wikipedia.org/wiki/Stochastic_dominance#Third-order