Category Theory
Zulip Server
Archive

You're reading the public-facing archive of the Category Theory Zulip server.
To join the server you need an invite. Anybody can get an invite by contacting Matteo Capucci at name dot surname at gmail dot com.
For all things related to this archive refer to the same person.


Stream: theory: category theory

Topic: "general position", linearity in category theory


view this post on Zulip Patrick Nicodemus (Jan 06 2025 at 18:08):

Hi folks. Recently I have been thinking about coherence conditions in category theory. I want to extend a well known theorem about Grothendieck fibrations to fibrations between bicategories. This should be straightforward and boring, so my mind drifted to the question of whether there was a way to get it easily by appealing to general coherence principles in categories and bicategories.

I noticed that many coherence conditions in category theory concern terms which are in "general position", i.e., they have the maximal number of free variables possible. For example, over the typing context
x0,x1,x2,x3,x4:Ob,f:x0x1,g:x1x2,h:x2x3,k:x3x4x_0,x_1,x_2,x_3, x_4: Ob , f : x_0 \to x_1, g : x_1 \to x_2, h : x_2 \to x_3, k : x_3\to x_4, all words in the language of category theory using each of f,g,h,kf,g,h,k are provably equal. However, over the context
x:Ob,f:xx,g:xxx : Ob, f : x \to x, g : x\to x, there are many words which are not provably equal, such as f2f^2, f3f^3, fgfg and gfgf.
One way we can explain this is that the context is not fully general: fgf\circ g is well defined over the context f:xy,g:yzf : x\to y, g : y \to z. We could also conjecture a linearity requirement - fff\circ f is not linear (the variable occurs twice) so it shouldn't appear in coherence conditions.

Similarly, in a symmetric monoidal category, not all diagrams commute, because do not require the coherence law
syma,a=idaa:aaaasym_{a,a} = id_{a \otimes a } : a \otimes a \cong a \otimes a, because this is not in "most general position" (the symmetry isomorphism can be stated more generally) or alternatively because aaa \otimes a is not linear.

This seems closely related to the observation that many algebraic theories can be presented using operads, which are more narrow/restrictive than Lawvere theories, and don't allow duplication or deletion of variables from a context. Even stronger are the theories presented by finitary Cartesian monads, in which variables cannot even be permuted.

Some questions:

  1. Broadly speaking, why do you think categorical coherence conditions tend to be of this form? Why does so much of category theory seem to fit in this "operad" fragment of algebra rather than the "Lawvere theory" fragment?
  2. Is there a satisfying explanation for why symmetric monoidal categories are the "right" notion of categorified commutative monoid and not a stronger one with syma,a=idaasym_{a,a} = id_{a \otimes a }?
  3. Is there a satisfying explanation for why a category is the "right" notion of a vertically categorified preorder, and we don't consider vertically categorified preorders with laws like fg=gffg=gf when this is valid?
  4. Can someone point me to a precise general definition of coherence for categorical structures along the lines of "all diagrams in general position commute" that is sufficiently general that it includes the coherence conditions for, say, a bicategory and a lax pseudofunctor between bicategories

view this post on Zulip Todd Trimble (Jan 06 2025 at 21:13):

Your first two paragraphs prompt the question: have you read the work of Max Kelly in this area? To start with, he and Eilenberg developed a graphical calculus for tracking compositions of extranatural transformations, and this calculus is an early precursor of the proof nets of linear logic. The paper by Kelly and Mac Lane, Coherence in Closed Categories, JPAA 1) refers this graphical calculus to state coherence theorems in their paper.

A coherence problem is formalized as the problem of deciding equality of morphisms in free structures, in this case, in free smc categories on categories CC (relative to being able to decide equality of morphisms in CC). A first result is that the free structure F(C)F(C) can be constructed as a certain "wreath product" F(1)CF(1) \wr C (along the lines of Max Kelly's [[clubs]], which were invented around the same time), so that it suffices to decide equality in F(1)F(1). The coherence theorem of Kelly and Mac Lane is a partial solution, which says that as long as objects X,YX, Y in F(1)F(1) are isomorphic to objects that do not involve the monoidal unit II in their syntactic construction, two morphisms from XX to YY are equal if and only if they have the same extranaturality graph.

Anyway, for anyone new to this area, I would recommend reading, in addition to these two papers, the early papers of Lambek

and

which inaugurated these attacks on non-trivial coherence problems, in particular showing how to adapt Gentzen cut elimination techniques to study coherence problems. For me, these papers were the real eye-opener; I found his description of the essential technique a lot clearer than the account given by Kelly and Mac Lane (who acknowledge his influence). There's also a book by Lambek's student Manfred Szabo, titled Algebra of Proofs, which unfortunately got a bad reputation because some of his technical claims about solutions to coherence problems were found to be overreaching and incorrect, but I think it's just fine in terms of laying out some basic ideas and strategies.

By the way, Lambek uses terminology very similar to yours, Patrick: he refers to proofs or morphisms [morphisms in a free smc structure, for instance, where these morphisms are viewed as equivalence classes of proofs] as being of "maximal generality". The Eilenberg-Kelly-Mac Lane graphs are a way of visualizing such maximally general morphisms, where caps and cups and lines connect variables ("atomic types", syntactically speaking) that are linked through instances of extranaturality or naturality of definable maps.

Not sure how followable this is, but I'm happy to discuss (this is the same area that I did my thesis on).

view this post on Zulip Patrick Nicodemus (Jan 06 2025 at 22:03):

Thanks very much, Todd. I really appreciate it. The term "general position" also occurs in a phd thesis on coherence in 2-categorical rewriting problems by Jonathan Asher Cohen. However I think Cohen slips up in a few instances and forgets this requirement and so his main result characterizing coherent rewriting systems appears to be incorrect.

I'll check out the sources you linked. Right now I'm refreshing myself on Eugenia Cheng's writeup of your "iterative operadic theory of n-categories"

view this post on Zulip Amar Hadzihasanovic (Jan 07 2025 at 08:43):

My hunch is that the "general position" thing is somewhat of a red herring:

Now I think that the "general position" is a shadow of "acyclicity", since acyclicity prevents a variable from appearing twice in a chain of compositions, just as a vertex cannot appear twice in a path in an acyclic graph.
So I'd believe that a characterisation of diagrams to which coherence applies that is based on "general position" would also fail to generalise to higher-dimensional coherence theorems.

view this post on Zulip Patrick Nicodemus (Jan 07 2025 at 23:40):

@Amar Hadzihasanovic I think I understand the gist of your comments. I am wondering if the alternate approach you suggest in your book is applicable to symmetric monoidal categories. I can imagine formally adjoining an edge from mmm \cdot m, where mm is a 1-cell from a 00-cell \bullet to itself, but would this be appropriately "contractible?"