You're reading the public-facing archive of the Category Theory Zulip server.
To join the server you need an invite. Anybody can get an invite by contacting Matteo Capucci at name dot surname at gmail dot com.
For all things related to this archive refer to the same person.
We've been having a cool conversation on twitter (https://twitter.com/guillaume_bso/status/1397524819252584457) with @Cole Comfort @Jules Hedges and @Robin Piedeleu, let's continue here!
@cole_comfort @rwolffoot @mattecapu An interesting fact is that we can make transistors out of diodes and the aforementioned measurement-controlled sources, which are a linear element: http://ecee.colorado.edu/~bart/ecen3320/newbook/chapter5/ch5_3.htm#5_3_3 So adding diodes seems enough to get all of electronics
- Guillaume Boisseau (@guillaume_bso)We're asking what happens if we add a gadget to GAA that allows us to model diodes. So far it appears this is equivalent to extending the theory of polyhedra (https://arxiv.org/abs/2105.10946) with a gadget that gives the union of x=0 and y=0
We conjecture this gives us something like piecewise-linear subspaces. So far I've convinced myself that we can build arbitrary R->R piecewise linear functions (https://twitter.com/guillaume_bso/status/1397661569937518593) and the boundary of any polyhedron (https://twitter.com/guillaume_bso/status/1397704652087496709)
@rwolffoot @_julesh_ @cole_comfort @mattecapu Got it! From that I think we can build many piecewise linear stuff if we have the affine fragment. Not sire what we can do without it https://twitter.com/guillaume_bso/status/1397661569937518593/photo/1
- Guillaume Boisseau (@guillaume_bso)@rwolffoot @_julesh_ @cole_comfort @mattecapu Hahaha yeeee I can do it! We need to replace all the ≥s at the same time. Behold (the sideways triangle denotes ≥) https://twitter.com/guillaume_bso/status/1397704652087496709/photo/1
- Guillaume Boisseau (@guillaume_bso)@Robin Piedeleu has a conjecture for how to model those subspaces https://twitter.com/rwolffoot/status/1397870805275582466
Robin's twitter is private btw
oh right I forgot
well, his idea is based on this https://www.emis.de/journals/BAG/vol.43/no.1/b43h1ovc.pdf
I've found some resources on piecewise linear manifolds but it's all a mess of topology and simplical complexes
Cole Comfort said:
Robin's twitter is private btw
I've opened it now if that makes it easier for everyone to follow what we wrote.
(I may close it again in a while)
Beyond arbitrary PL functions , would we also need to express max/min: , with a view to exploit the representation of PL functions given above?
They certainly both look like PL relations, at least intuitively (as long as we have not pinned down the precise meaning of PL relation).
But then, I'm a bit worried about finding a rewriting procedure to normal form for completeness, because I remember things getting quickly undecidable with + and max together.
I think it's even worse: piecewise linear subspace means essentially the triangulation of an arbitrary manifold, right?
We can get things with holes
Which feels a lot stronger than min/max and a lot less normalizable
Robin Piedeleu said:
They certainly both look like PL relations, at least intuitively (as long as we have not pinned down the precise meaning of PL relation).
There is a definition of piecewise linear functions, and there is an obvious way to compose them. Then there is a category, so I guess we can check if this category is regular and then maybe this will be the right notion of piecewise linear relations? Linear relations arise in this way, and affine relations are just the full subcategory of nonempty such things...
Oh, cool!
I'm not following that last bit: affine relations can be empty
If you consider an affine subspace as possibly being empty, then affine relations can be characterized in terms of the full subcategory of relations of finite dimensional affine vector spaces where the objects are nonempty.
I see
Because essentially you are trying to take relations of affine matrices, but this doesn't exist because you can get the zero scalar which stops you from taking pullbacks. But if you allow affine spaces to be empty, zero can live as a subobject
do you have an analogue thing for cones and polyhedra?
to be more precise: cones arise when we add the comparison gadget to GLA. We get "conic relations" which are convex subspaces of R^n closed under positive addition
I have to think about this a bit more. But I think most of these relational calculi are just projections of categories of relations (which are multisorted props), down to props by considering a full subcategory thereof.
Is there a good notion of conic function that would give that?
why multisorted?
Like in the case of affine relations, it is multisorted, because there are empty objects as well as free ones. So you have forget the empty ones at the level of objects by taking a full subcategory.
For additive relations, there is a section of my thesis that shows it's a subcategory of a category of relations.
There must be a way of showing the same for conic relations.
Yes, I think it is a recurring theme
well, it's a subcategory of Rel. I assume you mean something stronger?
Something like a full subcategory of relations of additive (affine) monoids where the objects are freely generated?
Sorry, yes. I mean a subcategory of the category of relations (obtained via the standard construction) for the structured functions you started with.
For additive relations, the starting point is some subcategory of the category of commutative monoid homomorphisms.
hm interesting, so maybe cone homomorphisms?
My hope is that thinking of things this way will allow one to give a modular presentation of such categories. If certain conditions hold then there is such a modular presentation; however, it isn't always going to be nice enough to care about.
Guillaume Boisseau said:
hm interesting, so maybe cone homomorphisms?
If I recall correctly, in both the affine and the additive cases, the props of relations arise as a full subcategory of some larger category of relations, so I think it would be something similar
well, if that gives us something useful would be cool. But I personally would be fine with a crudely hand-constructed category too
Nothing wrong with an old-fashioned hand-crafted category! :)
I think it would be nice to know merely because then we will know what we are actually working with. In the case of affine relations, we are really working with an extra object which we just can't access directly, and this manifests itself in terms of the zero scalar. Maybe in more exotic cases this would reveal something enlightening
Cole Comfort said:
If I recall correctly, in both the affine and the additive cases, the props of relations arise as a full subcategory of some larger category of relations, so I think it would be something similar
Yes, you're right, I think it was indeed a full subcategory for the additive case.
Coming back to my initial worry about how expressive these things are, it would also mean that we can represent the ReLU function , commonly used in machine learning as an activation function for neural networks. Does this mean we can approximate arbitrary continuous functions by just throwing in this diode-like element?
I guess piecewise linear means we can have as many linear pieces as we want...It's a bit scary but does not necessarily preclude a normal form argument.
Robin Piedeleu said:
Coming back to my initial worry about how expressive these things are, it would also mean that we can represent the ReLU function , commonly used in machine learning as an activation function for neural networks. Does this mean we can approximate arbitrary functions by just throwing in this diode-like element?
As in, do you just get all of relations instead of some extension of linear relations
Pretty sure we can get a triangulation of an arbitrary manifold, which includes approximating any function
Also we can express transistors so arbitrary logical circuits
I was not even talking about relations, just continuous functions.
Which would imply the case of relations, I guess
also just continuous functions yeah
we can definitely approximate any continuous R->R function since we have all piecewise linear R->R fns
as in, I know how to draw an explicit diagram for an arbitrary piecewise linear R->R fn
also I showed how to get the absolute value function. From that we can get max(x,y)=(abs(x-y)+x+y)/2
what's a piecewise-linear subspace?
that's one of the things we're trying to figure out ^^
we want to generalize piecewise-linear functions to relations, and the result is probably piecewise-linear submanifolds of R^n
what those looks like is manifolds made from straight lines/polyhedra
probably
Do you actually want to restrict to manifolds? What about the following subset of the plane: three line segments with a common endpoint, like the letter Y.
is that not a manifold? cause you're right, I want that too
(I know very little about manifolds)
no, the joining point is not locally Euclidean.
You may want "stratified spaces"
Manifold means it should locally look like R^d (where d is the dimension of the manifold).
And I'm guessing you are only interested in closed subsets (because the graph of a function is closed exactly when the function is continuous)?
yep
To give more of a sense of what manifolds are like you can list all of them in dimensions 0, 1, and 2:
0: any discrete space, countably infinite at most
1: R, a circle, countable disjoint unions of those
2: R^2, sphere, torus, 2-hole torus, ..., n-hole torus, Klein bottle, projective plane, connected sums of those, countable disjoint unions of those
Did I get everything? Is there some redundancy in listing Klein bottle, projective plane, and connected sums?
also btw we want finitely-generated things, which should be easier than the general notion
So it sounds like you want to consider semilinear (= can be described by a formula of first-order logic involving , scalar multiplication, constants, and ) sets that are closed, which I think are the same thing as finite unions of what you call polyhedra (but beware that term has various slightly different meanings in related contexts).
yeah, we might just want "a union of some polyhedra that fit together in a nice way"
At least, when a closed semilinear set is bounded then I know it's a finite union of closed simplices; I'm less used to thinking about the unbounded case.
if you want to think in terms of formulas it's easier: the actual problem we have is what we get if we add a certain gadget to the graphical language of Graphical Affine Algebra. In terms of equations we have constants,+,multiplication by a constant,existential quantification,>= and the gadget which offers a limited form of alternation, namely "x=0 or y=0"
There actually aren't any non-nice ways for polyhedra to fit together, which is not obvious. For example if you take the union of the underlying sets of two simplicial complexes in R^n, then it is also the underlying set of a simplicial complex in R^n, but you might need to add more vertices and subdivide some of the original simplices, if they overlap awkwardly.
hm, that makes intuitive sense
nice!
our setting does seem to force connectedness though
Right, it is intuitive but not so easy to prove, and it would definitely not be true for the union of two submanifolds of R^n, say (because they could be wiggly near their intersection).
That's really cool, because "finite union of polyhedra" sounds like a simple enough notion
So let me see if I understand the problem. We have a bunch of basic relations on various , which you listed above. First, we want to understand all the relations that can be built from them by composition (which I guess correspond to the formulas that can be built from them using regular logic). Then we want to give a generators-and-relations presentation of the resulting category of relations. Is that a reasonable summary?
Can you build "x = 0 or y = 0 or z = 0"?
If so I think you can build any finite union of polyhedra, because for a polyhedron you can build a function which is zero precisely on .
Reid Barton said:
So let me see if I understand the problem. We have a bunch of basic relations on various , which you listed above. First, we want to understand all the relations that can be built from them by composition (which I guess correspond to the formulas that can be built from them using regular logic). Then we want to give a generators-and-relations presentation of the resulting category of relations. Is that a reasonable summary?
Yes, exactly!
Reid Barton said:
Can you build "x = 0 or y = 0 or z = 0"?
Yes: if R(x,y) is "x=0 or y=0", then "x = 0 or y = 0 or z = 0" iff "exists a,b, R(x,a) and R(y,b) and z=a+b"
Reid Barton said:
If so I think you can build any finite union of polyhedra, because for a polyhedron you can build a function which is zero precisely on .
Ohhh that's perfect! Indeed we can use one of the normal forms from this paper https://arxiv.org/abs/2105.10946 to get a total relation that gives a polyhedron when the output is set to zero :o
How do we depict finite unions of polyhedra in this syntax, @Guillaume Boisseau ? Is it just some white-black convolution operation (potentially guarded by some of the choice operation)?
gimme a sec
hm, can't send a pic from my phone
Reid Barton said:
If so I think you can build any finite union of polyhedra, because for a polyhedron you can build a function which is zero precisely on .
What if their intersection is empty? Isn't one invariant of our setting that all these subsets are connected?
turns out I was wrong about connectedness
if I could just figure out how to send a pic ><
as an example of non-connected thing, take for P1 and P2 the functions x-1 and x-2. The result is the set {1,2}
I see! Very cool
In the -linear (additive) case there was already a similar behaviour with the sum; affine constant diagram, that implemented a limited form of choice. This here seems to be arbitrary finite unions.
Also this diagram looks like a potential normal form :D
Absolutely!
I'm very excited :D
So we have to find equations that allow us to always place the choice generator in this position. Let's see:
It has the inequalities of abelian bicategories
Damn, I wish I had enough time to keep up with this..... and that I remembered how graphical affine algebra works :D
Yes! It's the interaction with addition that is more interesting.
It should be the colorswap of its interaction with copy
Yes, was going to write this! And it's self transpose
Indeed
And self-colorswap
It is invariant under colour swapping, so this is dual
I can't find something that would pin it down though, so far this seems generic
co-zero ; zero choice discard ; co-discard
somehow, it's precisely between those two in some sense
but that's maybe a consequence of something else
anything is between those two
right, i'm an idiot
:)
so far that's all just axioms of additive relations aka abelian bicats, except the part where we plug a zero or discard into it
ah, also if we plug a nonzero scalar into it, the scalar disappears
Wait, is this the same type of additive relations as in Robin's thesis?
I think Guillaume means polyhedral
I can check if you have a link
we're assuming white Frobenius here (abelian bicat.) so this is different
what I actually mean is frobenius theories and the colorswap of that
Is an abelian bicategory just an ab-encriched bicategory?
I've been wondering that
Do you have a reference for this notion, because I am a bit confused
I'm looking but got nothing, I only heard @Pawel Sobocinski use that term
I think it's at the end of Carboni and Walters original paper
ie: https://core.ac.uk/download/pdf/82513341.pdf
Ah I see, I remember this, just not the name for it
I can't follow most of that paper x) hope it answers the question
I thought it was something much more complicated, haha
afk to see a friend, I hope there's still something left for me to figure out when I come back ^^
I need to take a few days to read this paper on polyhedral relations. Looks very exciting
It's really cool!
Ok, I've got a property:
copy; (id choice) ; co-copy = co-zero ; zero
(and the color-swap)
does that follow from something else?
(am I being stupid again?)
Nice, not stupid
Actually the polyhedra paper has a very interesting axiom: totality of the order is an "or" kind of thing that's hard to express diagrammatically. The axiom that does that job is the spider axiom, which is frankly unexpected
I'm expecting we get similarly a very non-obvious axiom that captures something about this making choices
(I plan to tweet about the spider axiom in the coming days also)
By spider, do you mean the frobenius law?
Oh I see, it is this kind of bialgebra law.
"a spider with four legs"
what did you do to that poor spider's other four legs
It's alright, this generalizes to an arbitrary number of legs ^^
A lot of properties of orders can be nicely captured in the category of posets and monotone relations (relations such that , and , imply ). This is a bicategory of relations and its discrete objects (those for which copy, discard etc. are a Frobenius algebra) are those posets for which the order relation is equality. But there's much more: if the poset is a lattice, intersection is adjoint (in the 2-categorical sense) to copy. In fact you also get a dual operation and all these adjunctions:
image.png
And if the lattice is a Boolean algebra, then the white structure is a Frobenius algebra (but black is not)! So being a Boolean algebra is dual to being a set with equality in this sense.
There's a lot of interesting things that happen when moving to this category!
Oh: choice;choice=choice
Robin: oh, very interesting
x=-y is not very monotone tho
The idempotency of choice is definitely really important. As a partial function, it is a restriction idempotent, which is of particular interest to me.
Guillaume Boisseau said:
Actually the polyhedra paper has a very interesting axiom: totality of the order is an "or" kind of thing that's hard to express diagrammatically. The axiom that does that job is the spider axiom, which is frankly unexpected
Oh that's interesting. So I noticed at one point that if is a strict total order, then being a "nonempty dense linear order without endpoints" is equivalent to saying that for any and (we allow and/or ) then
which is typically the exact thing you need in quantifier elimination--here eliminating the quantifier . (Model theorists seem to prefer over for some reason, and also tend to implicitly assume their models are nonempty.)
But in fact, maybe you're saying that we could start with even less than the assumption that is a strict total order. For example, now I see that transitivity also follows from this condition (take ). But how do you prove the order is total?
Guillaume Boisseau said:
x=-y is not very monotone tho
Yes, it was just a side comment about characterising properties of orders in a diagrammatic context :)
One thing that I expect to hold, which I have not managed to calculate, is that it should commute with cocopying and copying. Is this an identity already?
Do you mean choice? It should not hold because it's not a function
Or, more precisely, it only holds laxely, like in any Cartesian bicategory
No I mean, not that it is copied, but that it commutes with the multiplication like copy; (1 x choice) = choice;copy
Hmmm wait, maybe I am wrong. It choice a subset of the identity?
No
scratch that, then
Cole Comfort said:
My hope is that thinking of things this way will allow one to give a modular presentation of such categories. If certain conditions hold then there is such a modular presentation; however, it isn't always going to be nice enough to care about.
What's a "modular presentation" of a category?
John Baez said:
Cole Comfort said:
My hope is that thinking of things this way will allow one to give a modular presentation of such categories. If certain conditions hold then there is such a modular presentation; however, it isn't always going to be nice enough to care about.
What's a "modular presentation" of a category?
Like in terms of distributive laws of props (seen as monads in Mon-Prof) as well as pushouts when necessary
Like in the style of composing props of Lack
Okay. "Modular" means so many things in math, from modular lattices to modular tensor categories.
Here is one more identity which may have already been written down.
copy;(choice choice);add = choice
John Baez said:
Okay. "Modular" means so many things in math, from modular lattices to modular tensor categories.
Maybe I should have said "compositionally." Like in the way that linear relations can be presented in terms of a distributive law.
Also if you conjugate copy with choice, then this relates all the legs to each other.
@Reid Barton ooh I'd like to understand this better. Transitivity does indeed follow. I can prove that the order is total but only by appealing to completeness of the axioms or in other words by using the normal form. I'd love to find a more direct way
Cole Comfort said:
Here is one more identity which may have already been written down.
copy;(choice choice);add = choice
that's also part of the cartesian bicategory structure
Cole Comfort said:
Also if you conjugate copy with choice, then this relates all the legs to each other.
what do you mean by this?
Guillaume Boisseau said:
Cole Comfort said:
Also if you conjugate copy with choice, then this relates all the legs to each other.
what do you mean by this?
choice; copy; (choice choice) = d;(d d )
where d is the discard.
ah nice
also oops, it's not idempotent:
Wow really, it just looks so much like it would be idempotent!
We can generalize this to any black spider surrounded by choices
I know!
Oh yes, I suppose this is the binary case of the thing that I just stated
Cole Comfort said:
Wow really, it just looks so much like it would be idempotent!
But being able to make independent choices breaks the correlation
hm, it appears we can't colorswap naively actually:
wait, shouldn't composing choice with a counit just always produce the discard instead of colour swapping
because the union of discard and cozero is discard
yeah indeed, that's what I just stated
Oh yes it is, haha. Maybe I have to re-check my other calculations, now haha
might have sth to do with the fact that colorswap also flips the 2-cells i.e. the inclusion order
shit we don't have abelian bicat axioms either, namely choice doesn't slide laxly along the white spider
which to be fair was rather surprising
in my head choice was the superposition of discard;cozero and zero;codiscard, so any property shared by those two things would hold for choice. But if I calculate that doesn't appear to be true
I think this means that the colour swap doesn't extend to a functor in this fragment, at least in the way I am thinking.
yeah
maybe if it mapped choice to something else?
Is there a generator which takes units to zero?
The transpose of which would be the obvious candidate if it exists...
Cole Comfort said:
Is there a generator which takes units to zero?
what do you mean by this?
Like that takes ,
where the action is just post-composition
But we don't want it to be something silly like
hm, good question
here's a cool one: ^
Guillaume Boisseau said:
"At least one leg must be zero"
from which I can deduce that if choice has a colorswap, it must be a subset of cozero;zero
from which I deduce that there is no extension of colorswap that includes choice, because cozero;zero doesn't respect all the colorswapped properties of choice :(
I am thinking of a potential way to prove completeness as opposed to going straight to a normal form.
First, prove completeness for the prop of matrices with this generator adjoined wrt to the interpretation into piecewise linear maps between f.d. vector spaces.
Second, show (hope) that this category is finitely complete, and show that the category of spans thereof arises in terms of a distributive law.
Third show (hope) it is regular and then take a quotient to get you from spans to relations.
I see, so go the same way as the Interacting Hopf Algebras approach
We also need the order generator, right? So we need not just matrices+choice.
that might work for just matrices plus this. But if we also want comparison I'm not sure it'll work then
yeah that
I'm still very curious what we'd get with IH+choice or matrices+choice
And the order and choice generators are both very non-function-like.
I feel like getting choice+affine would be very easy from choice+linear
Which leaves me with little hope that the category you get would be sufficiently well-behaved.
affine is generally easy yeah; comparison is hairier
Btw, have you looked at the interaction between the order and choice?
there's not much as far as I can tell
Yeah, couldn't see much; that's why I asked :)
yeah mixing the 2-cells with the union is confusing to me
2-cells are obviously like implication and union is like disjunction, but trying to compute these things is so hairy
oh, I understood that as choice+comparison, which is where I haven't found many interesting things
choice+2-cells gives the cartesian bicat things, but apart from that not much either
I mean, just trying to calculate things is harder in this case
Also there is one obvious identity I think we have missed. Choice eats up scalars
ah yeah there's that too
Actually choice+2-cells is the only one that seems to somewhat pin down what choice does, cause we have and the symmetric
Can we somehow express that choice is the minimal such thing?
Many axioms are compatible with choice=codiscard;discard. The main one that isn't is where self-connected choice gives zero, that Robin mentioned a while back
Yeah, intuitively those two things together seem to pin choice down: it must accept one or the other input being zero, and if both are equal they must be zero. Can we derive the other things from them?
@Guillaume Boisseau Perhaps totality of the order actually lies outside the scope of the completeness theorem? After all, as a logical formula, is not in the fragment of regular logic; and the graph of the function isn't a polyhedron.
I can prove something that looks close to totality, namely that for a relation expressible in the language, iff
so I can case-split on whether y is positive or negative, but instead of having a real "or" I sum the corresponding x's
Is this in the linear or the affine setting?
that's in the linear+comparison setting, i.e. cones
in the affine/polyhedra setting I think only the colorswap of this still holds, which is iff . In the linear/cones setting both hold.
I see. And I assume could be any number of variables, for example zero, in which case it says that any can be written as the sum of something nonnegative and something nonpositive.
absolutely
but that in itself doesn't give totality, right?
Right, for example if we give R^2 the componentwise partial order, then it's still true (since we can write each component that way)
only if we could ensure that one of the two is zero, which we can with the choice gadget! So maybe choice can express totality more directly
here's totality expressed using the choice operator ^
namely that every x can be written as the sum of a positive thing and a negative one, one of which must be zero
But there's a subtlety in that this only really expresses a disjunction " or " to the extent that the choice operator has the intended semantics that either the first argument is zero or the second argument is zero
indeed. That would be if we were adding comparison to an established setting that includes choice I guess
For example, I could give R^2 the componentwise choice relation, sending to "( or ) and ( or )"
nice example! yeah we'd need other axioms to pin down choice to what we want first
I think you cannot rule out this kind of componentwise construction as long as you have only axioms that can be expressed in regular logic... though I'm not very confident about this.
oh, that would rule out our entire endeavour :o EDIT: wait, maybe not
So, I think it's probably best to accept that totality might fail "externally" but it still holds "internally" in the sense that there is a function which satisfies all the expected relations.
It's just that, from the external point of view, might not equal either or .
By the way, I should mention these slides of Marra and Menni: https://math.unice.fr/tacl/assets/2019/contributed/s3/4/4-marra-menni.pdf
What they are doing is at least one or two steps removed from what you are doing, but there's clearly some relationship there.
I see. We can live with that
Is adding max directly as a generator instead of the choice generator equally expressive?
I think equally: using max we can build the absolute value function, and by rotating the absolute value we can recover the initial diode-like generator we had, from which we can build choice
Ok, this seems much more standard then. We should look for results that might rule out what we're trying to do. For example, is there a quantifier elimination result for max,+ (aka tropical) arithmetic?
what's the link between quantifier elimination and what we're trying to do?
I claim the following describe the same class of subsets of R^n:
(a) Sets definable (in full first-order logic with quantifiers) using , scalar multiplication, constants and , that are closed in R^n.
(b) Finite unions of polyhedra.
(c) Sets cut out by a formula of the form , where is a function built from and affine linear combinations.
The sets of the form (b) are closed under projection, because the projection of an individual polyhedron is one (I think?)
It's actually a bit peculiar to the piecewise linear setting that this works for closed subsets (as opposed to closed+bounded subsets). For example, suppose we also allow multiplication as an operation. Then the set has projection , which is not closed.
Guillaume Boisseau said:
what's the link between quantifier elimination and what we're trying to do?
Its not entirely clear to me either but it might provide clues. So would (un)decidability of the corresponding first-order theory. As I understand it, quantifier elimination is kind of the standard model-theoretic version of the sort of normal form argument we use to prove completeness for these diagrammatic theories (which are restricted to the regular logic fragment). For example, for polyhedra, i.e. the logic of the reals as an ordered abelian group, quantifier elimination is proved by Fourier-Motzkin elimination, which is also what you use for the diagrammatic proof. One way to see it is that it gives a procedure to go from conjunctions of equations with some existentially quantified variables (span form) and an equivalent formula without existential quantifiers (cospan form). This is a bit hand-wavy but maybe someone more knowledgeable could make the connection more precise.
@Reid Barton if I'm following right, (b) closed under projection means we have quantifier elimination? That's cool!
@Robin Piedeleu I see, thanks
ah, so we do know that the theory of polyhedra is closed under projection. So we can deduce that the theory of union of polyhedra is too
ah, that's not quite quantifier elimination: it only says that plugging a black dot into the diagram for some union of polyhedra still gives some union of polyhedra. We still need to prove (c), namely that any union of polyhedra can be described by a quantifier-free formula. Presumably we know that for a single polyhedron?
(I'm just untangling what you guys said because that stuff is new to me)
good news: the axioms we've mentioned so far are enough to pin down choice if we know we're interpreting into R^n. It's not too hard to see: using the scalar and self-loop axioms, if . And we also have as an axiom.
that doesn't mean they're enough to derive everything we want diagrammatically. In fact they don't look enough
"Closed under projection" is the same as existential quantifier elimination, since the effect of projection on a subset is to produce a new subset defined with an existential quantifier.
A famous fact about subsets of defined using inequalities ( and ), boolean connectives () and polynomial functions is that they are closed under projection. This is called the Tarski-Seidenberg theorem.
Also the complement of any such subset is again such a subset.
So, if you have any subset of defined using inequalities ( and ), polynomial functions, boolean connectives and quantifiers, you can eliminate all the quantifiers and define your set using just inequalities ( and ), polynomial functions and boolean connectives!
One spinoff is that if you compose two relations between 's defined using inequalities, polynomial functions and boolean connectives, you get another such relation.
I used this fact in my work with Blake Pollard on black-boxing reaction networks.
The result @Reid Barton is claiming should be good for proving some similar theorems.
ah you're right we only eliminate existential quantifiers that way. The good thing is, our graphical syntax only allows existential quantifiers anyways. I wonder if universal quantifiers would allow more subsets to be described
here's one approach to axiomatizing this mess: remember we defined SMITs to be like SMTs except axioms could be inclusions instead of equalities. We could define SMUTs similarly, where axioms can include the union operator. Then we don't even need to add choice, and I expect that natural axioms for how union works should be enough to prove completeness of the SMUT of polyhedra
Once we have that, we can still study the choice operator within normal (in)equational theories. We'd only have to define the union in terms of it and check that it respects the relevant axioms. The same way we can have equational theories that have a natural order arising from a Frobenius structure, I expect we'd get an inequational theory that has a natural union arising from a choice structure
Would that work?
Guillaume Boisseau said:
here's one approach to axiomatizing this mess: remember we defined SMITs to be like SMTs except axioms could be inclusions instead of equalities. We could define SMUTs similarly, where axioms can include the union operator. Then we don't even need to add choice, and I expect that natural axioms for how union works should be enough to prove completeness of the SMUT of polyhedra
Sounds reasonable---there's plenty of diagrammatic calculi out there that exploit semiring structure on the homsets and use some form of sum of diagrams.
Guillaume Boisseau said:
Once we have that, we can still study the choice operator within normal (in)equational theories. We'd only have to define the union in terms of it and check that it respects the relevant axioms. The same way we can have equational theories that have a natural order arising from a Frobenius structure, I expect we'd get an inequational theory that has a natural union arising from a choice structure
I'm not sure I understand what you mean here. You'd want to show some sort of equivalence between two categories, one defined as a SMUT, the other a standard SMT with choice? Wouldn't that be equivalent to axiomatising choice directly as an SMT?
Is this a bimonoidal category?
If so, you could use their proof sheets: https://arxiv.org/pdf/2010.13361.pdf
However, in the case when both tensor products are symmetric, this isn't proven to be complete
Robin Piedeleu said:
Guillaume Boisseau said:
I'm not sure I understand what you mean here. You'd want to show some sort of equivalence between two categories, one defined as a SMUT, the other a standard SMT with choice? Wouldn't that be equivalent to axiomatising choice directly as an SMT?
Yeah that's equivalent to axiomatizing choice as a SMT (probably SMIT actually), since that's what we want to do. But I'm hoping going through the SMUT would make completeness a lot nicer
Cole Comfort said:
Is this a bimonoidal category?
Really looks like it
assuming I have a well-behaved union operator, I can easily make unions of polyhedra. A naive normal form would be just to independently take the normal form of each polyhedron in the union. That doesn't feel enough. Is there some canonical way to split our subset into polyhedra? Simplical complexes maybe?
in my head there should be a somewhat unique triangulation with minimal number of simplices, is that remotely the case?
yeah no obviously not, the square has two non-obviously equal triangulations
Can't even show that [1,2]U[2,3]=[1,3] :exhausted:
ah, yeah I can: not hard to reduce it to the case of , which itself is just totality
This generalizes! Assume A and B are subspaces separated by a hyperplane . Then using totality I can case-split on which side of the hyperplane we are, and reduce to and . Now if we can reduce any union of polyhedra to a union where the polyhedra don't intersect too much, then that should give us completeness! So new normal form is: either a simple polyhedron, or a union of normal forms where A and B are separated by a hyperplane.
Yeah pretty sure we can do all that diagrammatically. The only axioms I used are those of the theory of polyhedra, obvious generic axioms about union, and one axiom that uses the union: totality
Guillaume Boisseau said:
This generalizes! Assume A and B are subspaces separated by a hyperplane . Then using totality I can case-split on which side of the hyperplane we are, and reduce to and . Now if we can reduce any union of polyhedra to a union where the polyhedra don't intersect too much, then that should give us completeness! So new normal form is: either a simple polyhedron, or a union of normal forms where A and B are separated by a hyperplane.
That's exciting! The intersection might be non-convex, but we can always express it as a finite union of convex regions by cutting it further with auxilary hyperplanes. Then I suppose starting from the union of A and B, we can always obtain a decomposition in which the component polyhedra intersect pariwise along a single hyperplane by cutting the intersection into smaller polyhedra using these auxilary hyperplanes. This is entirely geometric intuition for the moment, though.
So the thing to do is find the equation for these extra hyperplanes.
a convex polyhedron is defined by a bunch of hyperplane boundaries, and we can always choose one of those to cut along
they are easy to read out from the normal form too
What I'm saying is that to obtain your normal form for the union of two polyhedra, we night need more, to make sure we separate the pieces into polyhedra that intersect at most along a single hyperplane. Wasn't that what you were trying to say?
yes I agree. I was only pointing out that it's easy to find the equation for the hyperplanes
But not necessarily of these extra hyperplanes.
why not?
if two polyhedra intersect non-trivially, there is a face of say A that has some B on both sides. Cut B along the hyperplane corresponding to that face. Repeat
I see, that makes sense.
my geometric intuition says that works. I wouldn't know how to make this precise though
actually this still works if A and B are themselves unions of polyhedra. We can still choose a face of A to split B
so to normalize AUB, find a face of A that cuts B into B1 and B2, then take normalize(A U B1) U normalize(B2) as a normal form. Does this terminate?
It should? A face of A is given by a hyperplane and you have only finitely many hyperplanes that describe A and B. The same face cannot be used twice because in A U Bi the face of A you used does not cut each Bi further, right?
Yeah that sounds right :)
So if we're right this gives a complete SMUT for union of polyhedra. To get a complete SMT, we could just define a union operator using choice and assert as axioms that it behaves properly! So we also get a complete SMT! Only remains to pick prettier axioms
Guillaume Boisseau said:
So if we're right this gives a complete SMUT for union of polyhedra. To get a complete SMT, we could just define a union operator using choice and assert as axioms that it behaves properly! So we also get a complete SMT! Only remains to pick pretty axioms
I think that a complete "SMUT" could still be a ways away from yielding a complete "SMT."
Consider the case of the bimonoidal category of matrices over a ring. The bimonoidal theory can be described in terms of the embedding of the monoidal theory generated the bicommutative bialgebra which generates the additive fragment. However, a symmetric monoidal theory for the multiplicative tensor is much more complicated.
So the union operator we get from choice is the one that operates on the underlying polyhedral normal form (where you replace the co-zero with a sum of choices gadget)? We need a proper union operation that does not refer to this form, no?
Hmm...maybe that does not matter actually
What we want is an equivalence between the SMUT and the SMT (well the free things quotiented by these theories). The union operator is a way of describing one direction of this equivalence, SMUT SMT. The other direction is the decomposition of choice as a union of two lines. Is this the strategy?
@Cole Comfort indeed I don't think you can always get a complete SMT from a complete SMUT. But we're in a lucky case because we know how to reify union as an operation on diagrams
@Robin Piedeleu Yeah something like that
Robin Piedeleu said:
What we want is an equivalence between the SMUT and the SMT (well the free things quotiented by these theories). The union operator is a way of describing one direction of this equivalence, SMUT SMT. The other direction is the decomposition of choice as a union of two lines. Is this the strategy?
Ah, I see
A bit more precisely, let be the free (semi)lattice-enriched prop, quotiented by the SMUT of polyhedral relations and let be the free prop quotiented by the SMT for unions of polyhedra (that we still have to figure out in full). Let be the identity on all generators except choice, which is mapped to (co-zero; co-discard) U (discard ; zero). Let be the mapping induced by the union operator that Guillaume constructed here. We'll need the right axioms on choice to check that this functorial and monoidal.
Then is the identity, again by what Guillaume showed here which requires going through the normal form for polyhedral relations.
For , we just check it is the identity for all generators of (which is trivial except for choice)?
Guillaume Boisseau said:
here's one approach to axiomatizing this mess: remember we defined SMITs to be like SMTs except axioms could be inclusions instead of equalities. We could define SMUTs similarly, where axioms can include the union operator. Then we don't even need to add choice, and I expect that natural axioms for how union works should be enough to prove completeness of the SMUT of polyhedra
Allowing inclusion axioms as opposed to equations sounds a lot like allowing sequents as axioms in a logical theory rather than logical equivalences. In that case, though, we can always replace a sequent with an equivalent equivalence . This makes the process of building a quotient theory easier.
Can we do something similar here? If not, what goes wrong, and how do you get around it to define the quotient SMIT?
Yes, we can do something similar not for all SMITs but for Frobenius theories, i.e. those with a well-behaved Frobenius algebra. More details here: https://arxiv.org/pdf/1711.08699.pdf . The Frob algebra allows the definition of a join and then we can recover the order like you mentioned
and indeed, I'm trying to do the corresponding thing for SMUTs: somehow find a way to express the extra structure (here the semilattice) equationally
the quotient SMIT is described is at the beginning of part 3 in the paper I just linked. Tbh there aren't a lot of details
Ah. I've read & cited this paper before :upside_down:
Here we're leaning on the enrichment in Pos to interpret inequalities. Naively, this looks like a different strategy from the approach I described above, which gets rid of the inequalities/sequents altogether. I'm still curious whether there's a connection.
The logical equivalence I wrote down only makes sense in a semicartesian setting, since the implication is just discarding on the right.
Assuming that is given, then the content of the equivalence is replacing a generator (switching to monoidal notation), by another , recovering by discarding on the left.
Naively, I would expect this to form a section/retraction pair: followed by discarding on the right is the identity, but thinking proof-relevantly I wouldn't expect the reverse to be true.
the problem is that you require some cartesian-like structure right? Unless you can get it for free somehow I don't expect this to work in general
Btw, I figured out how to define union without talking about normal forms. This is sooo much more convenient x)
Kinda obvious in retrospect
interestingly, it seems the many-legged choice thing can be defined from the one-legged one in the theory of cones/polyhedra, but not in plain GLA/GAA. I expect we can get the SMUT for GLA/GAA/cones/polyhedra equationally in one swoop by axiomatizing the many-legged version
or maybe even axiomatizing this new thing that's the union of the identity and the unit, since that requires only a single Frobenius algebra
help I need notation for the new thing and I'm out of quick-to-draw shapes :cry:
Much better. I have axioms on this new choice thing that guarantee (in the linear case) that the corresponding union is indeed a union wrt the order.
Only problem is distributivity of choice over composition: the weird part is that the last equality only holds when b and c don't have outgoing wires. That's not a pretty axiom but I can't find better things to replace it
Guillaume Boisseau said:
interestingly, it seems the many-legged choice thing can be defined from the one-legged one in the theory of cones/polyhedra, but not in plain GLA/GAA. I expect we can get the SMUT for GLA/GAA/cones/polyhedra equationally in one swoop by axiomatizing the many-legged version
Which part of the definition of union above requires polyhedral relations specifically? It does not use the comparison generator as far as I can see, just the one-legged choice, copy and (co)addition. There's something I must be missing...
Guillaume Boisseau said:
Should ?
Guillaume Boisseau said:
or maybe even axiomatizing this new thing that's the union of the identity and the unit, since that requires only a single Frobenius algebra
Yes, that seems like a very interesting thing to try to pin down axiomatically!
n and m can be different
The thing that requires polyhedral relations is defining many-legged choice from one-legged choice, because it requires a 1 to n diagram such that plugging a unit gives n units and plugging a zero gives n zeros. Afaik that's impossible in GLA
It is interesting that this generator is the union of the state preparation in both complementary observables.
Guillaume Boisseau said:
The thing that requires polyhedral relations is defining many-legged choice from one-legged choice, because it requires a 1 to n diagram such that plugging a unit gives n units and plugging a zero gives n zeros. Afaik that's impossible in GLA
This is not possible in GLA because of a variation of the no cloning theorem
Oh interesting!
Do you know how to prove that?
Guillaume Boisseau said:
@Robin Piedeleu this definition of union uses many-legged choice, hence the issue I just mentioned
Guillaume Boisseau said:
This older definition of union subtly hides the number of legs. You can read it as using many-legged choice too, even though I had intended only one-legged. You can make it work with one-legged by merging all the legs into one with a black spider just before the choice, but that only works because all the Pi end with a comparison operator
Then I'm confused about the construction you give: the diagram you give for union seems to not be well typed if because copy nodes are connected to addition nodes.
And, if , can't you define a choice with copies of the basic choice generator, such that the union diagram you gave implements the union of two arbitrary diagrams and as in your picture?
hm, which n and m are you talking about actually?
Guillaume Boisseau said:
I mean in this picture (sorry discussions can get confusing on this platform, especially about diagrams).
if I use many copies of the 1->1 choice, I won't end up with only the two options I want, I'll also get plenty of mixtures with some zeros and some black dots on each side
What are the options you want? the union of a bunch of things of the same colour?
Aaah ok, I get it!
But, for the union operator, you're still only using the case, right?
@Cole Comfort I want the union of "all zeros on the left and all units on the right" and "same thing but flipped"
Robin Piedeleu said:
But, for the union operator, you're still only using the case, right?
I am
ok, all clear for me, thanks!
I like this new generator more because it feels like it fits more naturally in the Lagrangian picture. Indeed you could just add it as a generator, and then add another type of undoubled wires
and it is almost like a form of measurement
oh you mean using it to pass from doubled to single wire?
such a weird operation, it's like measuring one of the wires but you don't know which
yes exactly
This diverges from the quantum picture where you can only measure X or Z, but doing so discards the other side.
can you have an ordering on diagrams in the quantum picture?
Yes, but only for mixed states. In the pure case, everything has the same dimension so it is iso
(the pure things are precisely for an odd prime)
I am thinking of considering the multicoloured prop of affine lagrangian relations with these undoublings, and then proving completeness for it
so this generator would fit nicely into this picture
I wonder if it has an operational interpretation in terms of electrical circuits.
I strongly doubt it x)
I don't know, it feels kind of nondeterministic. Like the computation branches on the result of a Z or X measurement.
that's not what we get for electrical circuits: the whole point was to get diodes but they're not nondeterministic
I know, but even though diodes are deterministic, this isn't a diode, despite them being derived.
sure, but it's not more nondeterministic than other relational things
like, I just want to dispel the intuition that there's somehow a new layer of superposition happening here. It's just a normal relation that happens to be made of multiple linear pieces instead of a single one
I am not claiming that there is somehow a "new layer of superposition," but rather that it appears that such a circuit relates elements iff replacing this generator with Z or X measurements on the same wire results in a circuit which relates these elements. So it is not that something new is being introduced, but rather that the condition that the symplectic form vanishes is no longer constraining the system.
Not all relations have reasonable operational interpretations for eletrical circuits nor stabilizer circuits.
what's a stabilizer circuit?
In short: stabilizer circuits (in the odd prime dimensional case) are the affine Lagrangian relations over , corresponding to a very important (but non universal) fragment of quantum circuits. Since changing the field between real rational functions and finite prime fields changes the semantics from electrical circuits to this fragment of quantum circuits, I am trying to use the intuition from both cases.
Because on the quantum side of things, these circuits have been studied lots, even from a diagrammatic perspective (stabilizer ZX calculi).
oh cool
@Robin Piedeleu hm you were right about hyperplanes being hard to find actually, my geometric intuition misled me.
If two polyhedra are intersecting I still think we can choose one face and extend it into a cutting hyperplane. But if they don't intersect we kinda want to find a hyperplane in between them and I don't know how to choose one
And there might be none of course ><
I guess the question is in which case is it clear that cutting there will make the procedure terminate. I don't want to be cutting intervals in half forever
I see. My immediate and naive reaction was that we could arbitrarily pick one face of either of them as the separating hyperplane, but that clearly does not work!
Could we instead proceed as in the proof of the separating hyperplane theorem? For and two disjoint polyhedra, let be the shortest vector of . Then there exists such that and for all and . Here, the separating hyperplane is given by . I guess we need an effective way to find and diagrammatically?
Anyway, for the point in that's the closest to and the point in that's the closest to . Then or something like that.
Ohh that sounds very good
I don't think we need to find anything diagrammatically, we can use the semantics to find what we need
The issue is that we'll want to find a hyperplane between not just polyhedra but unions of them, and that's not convex
For everyone else, here's the problem statement: we have X and Y diagrams for unions of polyhedra. We know in the semantics (i.e. as subsets of R^n), and we want to prove that fact diagramatically. Wlog we can assume X to be a single polyhedron, because
The theory for single polyhedra is complete, so if Y is a single polyhedron we're done. The idea I had to make progress in the general case is to choose a hyperplane H in between A and B to reduce to and . Of course there will not always be such a hyperplane, but maybe we can still make progress this way
hm, here's a different angle: we know that . Since is a polyhedron, this means we only need to care about unions of polyhedra that equal a single polyhedron. So an alternative problem statement is to prove diagrammatically hat knowing that this holds in the semantics. Hopefully that's simpler because is a single polyhedron
The prototypical example is . To prove that , the way I did it was to cut along the hyperplane , which reduces the proof to two cases of reflexivity
Guillaume Boisseau said:
The issue is that we'll want to find a hyperplane between not just polyhedra but unions of them, and that's not convex
Good point. But the non-convexity of these things is less wild than the arbitrary kind so we should be able to tame it. Is it enough, for a finite union of polyhedra to find the hyperplane separating and the convex polyhedron enveloping , and then repeat for , etc.?
why should the convex hull of the be disjoint from , even if they're pairwise disjoint? (even in 1D, if we take between and on the line, there's a problem!)
You're right.
Then we need to consider all pairwise separating hyperplanes? Let be the separating hyperplane between and . Define , the polyhedron defined by all separating hyperplanes between and all other . Then, for do we have ?
That assumes that the polyhedra are pairwise disjoint, in which case can't we just pick ?
Ah that would be missing the point
Hm that's really cool, you're partitioning R^n into cells that contain a unique polyhedron. I think that works yeah, again assuming the polyhedra are disjoint
I thought the problem was for non-intersecting polyhedra
There's an edge case where the polyhedra touch on their faces
(there's a pun somewhere here)
My hunch is that your thing still works for that edge case, but not confident
That's my hunch too, because the separating hyperplane can include a face of both polyhedra (this is the case where the minimal distance between both is zero).
(But my intuition has been consistently rubbish about all this so I wouldn't trust myself.)
Can we recover the whole space as a union of the ?
Not necessarily
Hmm... actually I don't know, maybe
Maybe not after all. I think if the intersection of all the is nonempty you leave out some chunk of the whole space. To see this, you can draw a picture with three convex shapes in the plane and the three separating hyperplanes that leave a whole in the middle.
Guillaume Boisseau said:
The theory for single polyhedra is complete, so if Y is a single polyhedron we're done. The idea I had to make progress in the general case is to choose a hyperplane H in between A and B to reduce to and . Of course there will not always be such a hyperplane, but maybe we can still make progress this way
Is it allowed more generally to choose any hyperplane and reduce to and ?
@Reid Barton yes absolutely, that's the general idea
@Robin Piedeleu Ah yeah that's unfortunate. There should still be a way to recover the whole space from the and a bunch of other polyhedra defined from the . Presumably these other polyhedra will contain none of the ones in Y
The reason I want this is that if we have such that is always a single polyhedron and the only overlap on their boundaries, then we can reduce to which are single polyhedra. So then the proof reduces to showing diagrammatically. Hopefully for a clever choice of that's easier than the initial problem
Hmm, so I thought you could just split along all the hyperplanes that make up the faces of polyhedra whose union is , but now I realize it doesn't always work. For instance if you have two segments in the plane that meet at a common endpoint, then you also need to include a line that separates the two other endpoints.
yeah, polyhedra of dimension lower than the space have faces that aren't hyperplanes, so we kinda want to extend those into hyperplanes but that's not easy
Here's a different angle: we have kind of a notion of complement. If H is a hyperplane, and are kind of complements. So if we write , we have a "complement" . Then isn't empty but should be the boundary of , which is of lower dimension.
Now, since , I can show . I think this is good, because should be lower dimension than . So we reduced the problem to one of lower dimension, and we can iterate until Y is a union of polyhedra of dimension 0 i.e. the empty set. And we know we can prove diagrammatically
Hm, the complement is not well-defined actually: by this definition, which is not the same as :(
Well, it can still be a valid procedure to find a such that and , which is all we really need
hm that doesn't work as well as I'd like. If we're in R^n and X is a polyhedron of dimension n, then yeah . But if then is the full space . We could try to save things by going to the smallest subspace that contains each and complementing there, but then we lose the fundamental property that . So we can't take the complement of a union in a way that always reduces dimension :(
Back to @Robin Piedeleu's hyperspace idea. Write such that the have pairwise separating hyperspaces . This means and . Also .
We make a cell by choosing a direction for each hyperspace and intersecting all the . We should get: the union of all cells is ; for any cell c, is a single polyhedron. If that's true then we're done
grmbl that doesn't work. Take a small square that touches a bigger square on one side . Take the line where they touch. It's not the case that is a single polyhedron: it's union a side of
I said earlier we could assume wlog that Y equals a single polyhedron in the semantics. In other words, it's convex. This should rule out annoying cases like the above, but I'm not sure how.
Grmbl that might be enough but it's far from trivial: we could have all sorts of weird intersections with other things filling the gaps to make the whole convex.
Reid Barton said:
Hmm, so I thought you could just split along all the hyperplanes that make up the faces of polyhedra whose union is , but now I realize it doesn't always work. For instance if you have two segments in the plane that meet at a common endpoint, then you also need to include a line that separates the two other endpoints.
Actually maybe that works too. The normal form makes sure we get all those hyperplanes I think. For example the normal form for a line segment in 2D will be something like
So instead of picking a set of separating hyperplanes, let's try to take the set of all hyperplanes in Y. If we're careful to include each hyperplane twice (once for each orientation), then the intersection of two cells is always another cell (unless empty). Now each can be written as the union of some of those cells. In fact this is a new normal form for Y: we write Y as a union of polyhedra that are all defined using the same set of hyperplanes. I think that's pretty cool.
If is one of the cells in Y, then which makes it easy to show . We're left with the case where is not in . I'm hoping convexity of Y means is always a cell but I'm not sure.
My intuition is that since is semantically a polyhedron and also is described a union of polyhedra made using a known set of hyperplanes, then it should be the case that can be described as a single polyhedron using those same hyperplanes. And hopefully that derivation can be done diagrammatically. If so, then we have a pretty neat proof of completeness, cause this whole thing about cells is quite elegant diagrammatically
Or, maybe this normal form is a good enough? Then we'd add the hyperplanes of X to our set, write both and in terms of the same hyperplanes, and maybe to prove it suffices to check that and use the same set of cells? We can read off which cells directly from the normal forms
Guillaume Boisseau said:
We're left with the case where is not in . I'm hoping convexity of Y means is always a cell but I'm not sure.
Wait, I'm confused about notation here: is not necessarily convex but we can assume wlog that is, right?
We can assume wlog that both X and Y are convex, surprisingly enough
I mean that their semantics are convex sets. Also wlog X is described as a single polyhedron. Y is still described as a union of course, otherwise we'd be done
I might have been a bit imprecise in distinguishing between Y as a term and the semantics of Y as a subset of R^n
Guillaume Boisseau said:
We're asking what happens if we add a gadget to GAA that allows us to model diodes. So far it appears this is equivalent to extending the theory of polyhedra (https://arxiv.org/abs/2105.10946) with a gadget that gives the union of x=0 and y=0
Okay, I've only been vaguely following this topic, because I didn't know what GAA was and I was too lazy to find out, but now that I've had a look at the paper you referenced, can I get some clarification on basics here? For example, we've been talking about polyhedra as if they're the usual "shapes in n-dimensional Euclidean geometry", but that's not how they're defined in that paper, which is as sets of the form . How exactly are we identifying these with ordinary polyhedra?
Those sets are exactly the convex polyhedra, and that's what I've been meaning by "polyhedra" so far in this chat
But it's good timing, I was about to make a big summary
Guillaume Boisseau said:
Those sets are exactly the convex polyhedra, and that's what I've been meaning by "polyhedra" so far in this chat
In one dimension, these are sets of the form , which is a ray starting at or a point or the empty set if . I can see how you might argue this is somehow a polygon (or whatever the name for 1D polyhedra is), but there are clearly plenty of polyhedra (e.g. closed intervals) missing.
A doesn't need to be square. In other words, the number of equations can be larger than the dimension of the space. You can easily get all closed intervals with two equations
Aha! That clarifies a lot. Still, I'm not sure one would normally assume that polyhedra could include infinite subsets of Euclidean space, so I'm glad I clarified that too.
Ah fair, I forgot about those cases but yeah we also want them
I would have to understand a lot more about the diagrammatic stuff to engage with the question regarding whether one can prove that one polyhedron is a union of others diagrammatically; maybe I'll come back to that :wink:
I'll try to explain that somewhat in my summary. Coming soon after my lunch ^^
We have a sound and complete diagrammatic theory of convex polyhedra (that I will call simply "polyhedra") here: https://arxiv.org/abs/2105.10946 . Polyhedron here also includes polyhedral-like shapes that go to infinity, for example the top-right quadrant . Terms in this theory are diagrams made from some basic generators, see there. An diagram in this language is interpreted as a polyhedron in .
What we're trying to do is extend this to arbitrary unions of polyhedra. To do that, we add to our language a union operator on diagrams of the same type, and some axioms. So far I think we want:
union and the intersection already present in the language form a distributive
lattice
(aka totality)
The semantics functor (call it ) now maps a term in this language to a union of polyhedra in . I believe the axioms mentioned above are enough to get completeness, namely that when then is provable in the theory.
Single polyhedra (i.e. terms that don't feature the union operator) have a normal form, described in the paper above. The normal form expresses a polyhedron as the intersection of a bunch of half-spaces, where a half-space is for some nonzero affine function . I sometimes write or for the hyperplane , and tend to call itself a hyperplane too.
Given a general term , we can move all the unions to the top-level to get an explicit union of diagrams without unions aka polyhedra. We then normalize all those subpolyhedra. We end up with a form where is the union of intersections of half-spaces. Moreover, is provable, so we can add some half-spaces to a given subpolyhedron, so that in the end is a union of intersections of half-spaces, and each term in the union mentions the same set of hyperplanes.
Take and such that . First normalize both X and Y. From the lattice axioms , so wlog we can assume that doesn't use the union operator, i.e. is a single polyhedron.
Now . So for completeness it suffices to be able to prove when is a single polyhedron and is convex. The direction reduces to inclusions between single polyhedra, which we can prove by completeness of the theory of single polyhedra. So remains only the case where is a single polyhedron and is convex.
The final reduction, which might not be the right approach, is as follows. Take all the hyperplanes in the normal form of and consider all the possible intersections of the corresponding half-spaces. Call each of those a cell. From our axioms we can show that the union of all cells is the whole space. From this and lattice axioms we get where ranges over the cells. Remember that is written as a union of some of those cells. I'm hoping this makes it easy to express as a single polyhedron, which would entail completeness.
Ah, there's also a simpler approach that doesn't involve all the reductions: take all the hyperplanes in X and Y, and write X and Y in normal form using all of those hyperplanes. Then they are both unions of some of the cells defined from those hyperplanes. I suspect that when then they use essentially the same set of cells. We just have to figure out what "essentially" means here ^^
This all feels incredibly like the linear case of Cylindrical Algebra Decomposition (CAD). For which there is a huge amount of theory, (implemented) algorithms, normal forms, etc. Also, the convex optimization people have done lots of work on representing polyhedra. Lastly, the Abstract Interpretation people have delved into polyhedral domains too.
First example of an entire extended abstract written on zulip!
Jacques Carette said:
This all feels incredibly like the linear case of Cylindrical Algebra Decomposition (CAD). For which there is a huge amount of theory, (implemented) algorithms, normal forms, etc. Also, the convex optimization people have done lots of work on representing polyhedra. Lastly, the Abstract Interpretation people have delved into polyhedral domains too.
Hahaha, I'm sure I'm reinventing the wheel. Do you have references for those things by any chance?
ok wow, this indeed looks very similar to https://en.wikipedia.org/wiki/Cylindrical_algebraic_decomposition . I see an important difference: most of the pain we have is that we can't find a nice decomposition into cells that are mutually disjoint, becaues we can only represent (topologically) closed cells. In other words we don't have negation.
Suppose you could find a collection of hyperplanes such that, when we form all the cells they define as described in your last paragraph, the following two properties hold:
(a) is (provably/by definition) a union of some of those cells.
(b) The intersection of with one of those cells is again one of those cells (or empty).
Is that enough?
CAD doesn't quite have negation either, but because it has inequalities, it has trichotomy. So that gives you a positive description of everything. So you get a complete 'cellular' decomposition of space. All 3 of the domains that I mentioned have enormous literature, and I'm not an expert in any of them (so I don't really know the modern introductions).
I think then the equations of part (b) must be provable. Because say and we know that for some cells and . The equation is provable because both sides are polyhedra. So we just need to prove that , that is, . I want to try to arrange this by making one of the .
Or at least, should be a subset (a face) of one of the , and I think that's automatic from the way the "cells" were defined.
Now if is a polyhedron then to prove that we can prove that for all . And we know we can provably rewrite to a polyhedron (another cell), so now we are left with a true inclusion between polyhedra and it is provable. Does that seem right so far? So the key is to arrange for (b) above to be satisfied.
But here is nevertheless an attempt:
Reid Barton said:
Suppose you could find a collection of hyperplanes such that, when we form all the cells they define as described in your last paragraph, the following two properties hold:
(a) is (provably/by definition) a union of some of those cells.
(b) The intersection of with one of those cells is again one of those cells (or empty).
Is that enough?
We already have a way to do this, but I don't think this is enough. I think it's enough if we add the following extra assumption: for three cells a, b and c,
Reid Barton said:
Now if is a polyhedron then to prove that we can prove that for all . And we know we can provably rewrite to a polyhedron (another cell), so now we are left with a true inclusion between polyhedra and it is provable. Does that seem right so far? So the key is to arrange for (b) above to be satisfied.
yeah absolutely! That's where I'm stuck too
Guillaume Boisseau said:
Reid Barton said:
Suppose you could find a collection of hyperplanes such that, when we form all the cells they define as described in your last paragraph, the following two properties hold:
(a) is (provably/by definition) a union of some of those cells.
(b) The intersection of with one of those cells is again one of those cells (or empty).
Is that enough?We already have a way to do this, but I don't think this is enough. I think it's enough if we add the following extra assumption: for three cells a, b and c,
If "cell" means any subset formed by, for each of the chosen hyperplanes , selecting one of , , , and intersecting all of these, I think this is automatic. Or at least I can't think of a counterexample.
The problem I know about is that the intersection of with a cell might not be a single cell.
Which kinds of people use CAD?
I assume it has nothing to do with CAD/CAM, though it could. :upside_down:
So I wanted to suggest a partial solution to (b). Suppose is bounded. Enclose it in a big box, and write the box as a simplicial complex in such a way that form a subcomplex. If we take all the hyperplanes that make up the faces of the simplices of this complex, it's not good enough to achieve (b), as the example with two segments joined at an endpoint shows. Instead, form the barycentric subdivision of this complex--I claim that that one works.
The point is that if is a simplicial complex, then a subcomplex of could meet a simplex of in some arbitrary union of faces. But the subdivision of can only meet a simplex of the subdivision of in very specific ways; in particular, in a single face or not at all.
Reid Barton said:
The problem I know about is that the intersection of with a cell might not be a single cell.
That's kind of the same question I think: since is a union of cells, so is . Then I'm hoping that to prove I can always find that one of the cells of contains
Oh I see. Maybe this is easier. Isn't this true (provided we fix a single choice of hyperplanes, and thus a notion of "cell", for both and )?
Reid Barton said:
The point is that if is a simplicial complex, then a subcomplex of could meet a simplex of in some arbitrary union of faces. But the subdivision of can only meet a simplex of the subdivision of in very specific ways; in particular, in a single face or not at all.
I think we already have a property like that for our subdivision into cells, since an intersection of cells is always a cell. Does the simplicial stuff help more?
Reid Barton said:
Oh I see. Maybe this is easier. Isn't this true (provided we fix a single choice of hyperplanes, and thus a notion of "cell", for both and )?
Yeah, I'm pretty sure this is true. Just unclear how to prove it, and I'm losing track of what needs to be proven diagrammatically and what normally ><
I was trying to arrange that the intersection was not merely a union of cells, but actually a single cell.
If is the union of the nonnegative x and y axes, for example, then you can see from a picture that we need to include a hyperplane that cuts through the origin that isn't either of the axes.
ah, well that's not the case in general, though I suspect it might be true in the convex case. Counter-example: choose to be the union of the top-left and bottom-right quadrants of the plane. The two relevant hyperplanes are the two axes. Take to be the top-right quadrant, then isn't a single cell
hah, we had the same example in mind ^^
But this is fixable by adding more hyperplanes, which is what the subdivision is for.
ohh, now I understand a message of yours from a while ago
hm, how would we do that? Is it enough to add separating hyperplanes between any two of the initial Y_i?
no the might have non-trivial intersection. So we start by doing the subdivision using all the hyperplanes of Y, and _then_ we add pairwise separating hyperplanes? Is that enough?
Pairwise, meaning for each pair of cells?
yeah
but I'm really hoping we don't need to go this way and that the initial subdivision is enough
I think my whole scheme depended on the same hypothesis that the simpler approach you suggested does. Namely, if we fix some hyperplanes and use them to divide space into cells, then if one cell is contained in the union of some others, then it is already contained in one of the cells.
nice
well, I think all the pieces are here and we just need to put them together. I'm exhausted rn though, will try later
ah, "a polyhedron is called a polytope if it is bounded". It sounds like our terminology is standard
Reid Barton said:
Namely, if we fix some hyperplanes and use them to divide space into cells, then if one cell is contained in the union of some others, then it is already contained in one of the cells.
Do you mean that for cells (built from the same set of hyperplanes), if then or ? I'm not convinced that's true. What I find more convincing is that there exists another cell built from the same collection of hyperplanes (not necessarily or ) that contains . Is this what you meant? That would be enough for us, right?
Guillaume Boisseau said:
ah, "a polyhedron is called a polytope if it is bounded". It sounds like our terminology is standard
Yeah, I thought so too!
I also haven't worked out all the details, but currently I believe some argument along the following lines should work:
Robin Piedeleu said:
Reid Barton said:
Namely, if we fix some hyperplanes and use them to divide space into cells, then if one cell is contained in the union of some others, then it is already contained in one of the cells.
Do you mean that for cells (built from the same set of hyperplanes), if then or ? I'm not convinced that's true.
Yes, this is what I mean and I'm not 100% convinced it is true either.
What I find more convincing is that there exists another cell built from the same collection of hyperplanes (not necessarily or ) that contains . Is this what you meant? That would be enough for us, right?
Only if it is contained in as well.
I think this works, though. The intersection of any two cells is a cell. If then also , so we can assume that and are contained in . Then if neither one equals then they both have lower dimension than , and so their union cannot cover .
Picture as a triangle (defined by three halfspaces) within a larger triangle (say three other halfspaces given by parallel hyperplanes to the first three). Cut the larger triangle in half with another hyperplane such that it also cuts into two, and call the two sides of the larger triangle and (a picture would be better here). Isn't this a counterexample to the first statement?
If is cut in half then it isn't a cell.
For me a cell means we have to specify , or for every chosen hyperplane.
Right
yes, that's true
Robin Piedeleu said:
Reid Barton said:
Namely, if we fix some hyperplanes and use them to divide space into cells, then if one cell is contained in the union of some others, then it is already contained in one of the cells.
Do you mean that for cells (built from the same set of hyperplanes), if then or ? I'm not convinced that's true. What I find more convincing is that there exists another cell built from the same collection of hyperplanes (not necessarily or ) that contains . Is this what you meant? That would be enough for us, right?
it would be enough, but I still believe the statement to be true
@Reid Barton I'd be surprised that a dimension argument would work here. We can have nontrivial inclusions between polyhedra of the same dimension
Reid Barton said:
I think this works, though. The intersection of any two cells is a cell. If then also , so we can assume that and are contained in . Then if neither one equals then they both have lower dimension than , and so their union cannot cover .
To me, this seems to work.
You're not convinced @Guillaume Boisseau?
No: cut a square along the diagonal, call the two triangles b and c.
then , yet is equal to neither
But is not a cell
yeah but how do you use this fact in the proof?
I think this is where I was also confused earlier.
Here's the proof I had in mind: define cells from a set of hyperplanes. Take Y a union of cells and c a cell such that . Wlog we can assume , as mentioned above. This means that if any hyperplanes are set to 0 in , they are also set to 0 in all the cell of . So wlog we can ignore those. Now all hyperplanes in are set to or . Wlog (up to negating some hyperplanes) we can assume they're all set to . Now take a point in the interior of . So . Now . So there is a cell in that contains . Since , must have assigned to all hyperspaces, i.e. , which is what we wanted.
But we also cannot use that in the proof, right?
why not?
Lemme sketch this
The part using strict inequality
we can't do it diagrammatically, but we don't need to
Take and two terms. Express them as unions of cells using the same hyperplanes. We know we can reduce to the case of . So assume . Using the above, we know there's a syntactically mentioned in such that . Those are two polyhedra, so we can prove diagrammatically, from which we get .
The proof of existence of doesn't need to be diagrammatic. Once we know there's such a from the semantics side that's enough. Am I making sense?
Robin Piedeleu said:
Reid Barton said:
I think this works, though. The intersection of any two cells is a cell. If then also , so we can assume that and are contained in . Then if neither one equals then they both have lower dimension than , and so their union cannot cover .
To me, this seems to work.
Oh, actually I see the argument now: it relies on the fact that if two cells are then they're either equal or of different dimensions. I like that, it's clean. We'd need to prove it tho. I'd guess it relies on a similar argument to my more complex proof above.
I guess. But then what was the problem with @Reid Barton's proof above?
it was that I hadn't understood it ^^
Oh ok
and now it is that it relies on an unproven fact about our cells
Yes, there is exactly this point where you need to show that if a cell is strictly contained in another cell , then belongs to the boundary of (say) and therefore has lower dimension.
I didn't totally check it but your argument sounds plausible.
hm, I think I like your argument better than mine. If the hyperplanes are distinct, then I think , where the order is and . From that, I think we should be able to deduce your dimension argument
We might need the hyperplanes to be linearly independent or something like that
At some point one needs to make or use a real argument involving for instance the fact that the ordering on the field is dense, since otherwise might equal .
I think your argument comes down to the following thing. We can divide all the hyperplanes involved into two classes, those which contain (i.e. the associated function vanishes on all of ) and those for which there is with (WLOG). You need to know that you can find a point such that all of the second kind of satisfy at once.
Because then as you argued, whatever cell contains this has to contain all of .
yeah, I assume known that a nonempty polyhedron of dimension d has an interior of dimension d, in which we can find that x
Yes, this seems good
which relies on an implicit assumption that for a nice enough set of hyperplanes (I'd guess linearly independent or at least distinct), a polyhedron made from them really has dimension d
Well first we can pass to whatever affine subspace is defined by the intersection of those that vanish on all of , and call it
that's actually not quite trivial, since for example take a triangle and move the defining hyperplanes closer until your triangle is a point. It syntactically looks like we should get a 2d polyhedron with an interior but we don't
ah I see, I was thinking too syntactically your thing should work
or actually, here is a handy trick
We can assume WLOG that we never use inequalities of the form to define , so we know that on for every (although for some , the defining relation might actually be ).
So if we have a bunch of points on each of which a particular is positive, then at their centroid, all the are positive.
hm, doesn't that break in my triangle example?
or in a simpler example: take as two hyperplanes the two 2D axes. I can definitely find two points that each have one positive coordinate but whose midpoint has both coordinates negative. E.g. (-2,1) and (1, -2)
I mean as long as we place on the nonnegative side of each , then we average a bunch of nonnegative values and at least one positive value
John Baez said:
Which kinds of people use CAD?
I assume it has nothing to do with CAD/CAM, though it could. :upside_down:
One of the big uses of CAD is quantifier elimination. It's only double-exponential instead of tower-of-exponential for the 'generic' procedure, so much faster/better!
I've used it in a symbolic-probabilistic programming language for normalizing of region descriptions. The natural descriptions that come out of a probabilistic model often need to be "re-ordered" when doing conditioning. Since most models are naturally cut out by semi-algebraic sets, this was quite natural.
But basically: if you've got semi-algebraic sets around, CAD should be your go-to computational tool.
Reid Barton said:
I mean as long as we place on the nonnegative side of each , then we average a bunch of nonnegative values and at least one positive value
I think this still breaks in my triangle example: each half space has an interior but their intersection doesn't
If an individual function is zero on all of then we place it in the first class, even if the defining relation was a priori of the form .
hm. I'm still skeptical, this involves too much cleverness in the choice of each
lemme try to rephrase your argument: take the on which is not always 0. For each , pick an such that and . Then the midpoint of the is such that ? How am I sure I can pick each , and how to I know that taking the midpoint preserves positivity?
by convention we chose the sign so that on , so either it's constantly zero, or positive somewhere and nonnegative everywhere
then if we average the "somewheres", we make every which isn't constantly zero positive at once
now since that point is contained in one of the cells of , we know that that cell must be defined by for all the which aren't constantly zero on , and the ones that are constantly zero will be fine no matter what
I'm still not clear on why we can pick the . I'm getting tired, it'll probably make sense tomorrow
It's just that the negation of "" is "".
for a single hyperplane sure yeah
Right sorry, what I mean is either vanishes on or we can find a point in where is positive.
I don't care about anything outside
For the hyperspaces we've picked, we know for a given i: . But I also want constraints of the other hyperspaces on , that's what bugs me
Ah yeah I see it
you're right, I forgot we were choosing an in
niiice, I think that concludes the whole argument! Don't even need independent hyperplanes. This suffices to show that if then there's a syntactically in such that . That's what we were missing.
Here's a cool idea: we started all this by noticing that if we added the diode to our theory of convex polyhedra, we could build arbitrary unions of polyhedra. What happens if we try to add other non-convex gadgets? I suspect we always get everything, but there could also be an interesting in-between theory.
Currently trying to figure out what happens if we add the simplest thing I could think of:
pretty crazy: I can characterize the points of and diagrammatically! And from that it's instant that if for two cells c and d, then , which gives
image-c0210306-347c-4e16-8144-431de1794251.jpg
(none of the equivalences I wrote are evident, but they hold)
Huh, the convex hull of closed things might not be closed: take the hull of and . This doesn't contain but contains points infinitely close to it.
We can construct a hull diagrammatically, but it'll be the closure of the actual hull. It's neat that we can express that algebraically
Algebraically, we can describe it as taking the usual convex hull and adding the "0-spaces" of a and b, which are the conic subspaces obtained by replacing the affine "1" generator with "0" in the diagram for a polyhedron
image-f40f10ec-305f-440b-8365-6f4efba8a19a.jpg
I really like this diagram for the hull: the left part says: , so it really feels like we're building the appropriate convex combination . It's a bit more subtle because a and b are not maps, but that's the idea
We wrote this all up, and I4ll be presenting the results at SYCO this December! https://arxiv.org/abs/2111.03956