Category Theory
Zulip Server
Archive

You're reading the public-facing archive of the Category Theory Zulip server.
To join the server you need an invite. Anybody can get an invite by contacting Matteo Capucci at name dot surname at gmail dot com.
For all things related to this archive refer to the same person.


Stream: theory: mathematics

Topic: Big Witt ring


view this post on Zulip John Baez (Oct 06 2024 at 17:09):

I'm trying to understand the "big Witt ring" of a commutative ring; there are various perspectives on it.

1) One slick abstract perspective is that the forgetful functor from [[lambda-rings]] to commutative rings has both the expected left adjoint but also a right adjoint

W:CommRingλRingW : \mathsf{CommRing} \to \lambda\mathsf{Ring}

and W(R)W(R) is the big Witt ring. This perspective is helpful in proportion to how well you understand lambda-rings. There are several perspectives on lambda-rings, each of which should give a different outlook on the big Witt ring. My favorite is that when you decategorify a 2-rig by taking its Grothendieck group, you get not only a ring but a lambda-ring. But there is another important perspective based on number theory, where a lambda-ring is a commutative ring equipped with commuting [[Frobenius lifts]], one for each prime pp.

view this post on Zulip John Baez (Oct 06 2024 at 17:14):

2) The underlying set of W(R)W(R) is

CommRing(Λ,R) \mathsf{CommRing}(\Lambda, R)

where Λ\Lambda is the underlying ring of the free lambda-ring on one generator. I will explain why later.

Λ\Lambda is a biring, i.e. a ring object in CommRingop\mathsf{CommRing}^{\text{op}}, and the multiplication in W(R)W(R) comes from the comultiplication in Λ\Lambda.

view this post on Zulip John Baez (Oct 06 2024 at 17:23):

3) Since one can show the underlying commutative ring of the free lambda-ring on one generator is

Λ=Z[λ1,λ2,]] \Lambda = \mathbb{Z}[\lambda_1, \lambda_2, \dots ]]

we have an isomorphism of sets

W(R)1+tR[[t]]R[[t]] W(R) \cong 1 + t R[[t]] \subset R[[t]]

where

fZ[λ1,λ2,]] f \in \mathbb{Z}[\lambda_1, \lambda_2, \dots ]]

is mapped to

1+n=1f(λi)tn 1 + \sum_{n = 1}^\infty f(\lambda_i) t^n

Then the challenge is to describe the addition and multiplication on W(R)W(R) in these terms. People often do this with explicit formulas, which I tend to find cryptic. The addition in W(R)W(R) corresponds to the multiplication in 1+R[[t]]1 + R[[t]], which is why we use this description. The multiplication in W(R)W(R) is more tricky. For any aRa \in R we get an element of W(R)W(R) called (1at)1(1 - a t)^{-1}, defined by

(1at)1=1+at+a2t2+ (1 - a t)^{-1} = 1 + a t + a^2 t^2 + \cdots

and the multiplication W\cdot_W on W(R)W(R) turns out to be determined once we know

(1at)1W(1bt)1=(1abt)1 (1 - a t)^{-1} \cdot_W (1 - b t)^{-1} = (1 - a b t)^{-1}

This formula turns out to be very useful but I don't have a good understanding of how it comes from 2).

view this post on Zulip John Baez (Oct 06 2024 at 17:31):

4) There's a ring isomorphism

W(R)R×R×R× W(R) \cong R \times R \times R \times \cdots

where aW(R)a \in W(R) is sent to (g1(a),g2(a),)(g_1(a), g_2(a), \dots) , and

gi:W(R)R g_i : W(R) \to R

sends aRa \in R to something called its ith ghost component. Here's an explicit formula for the ghost components that one often sees. Start by using the isomorphism

W(R)1+tR[[t]]W(R) \cong 1 + t R[[t]]

from 3) to write aW(R)a \in W(R) as a power series in tt. Then write

tddtlna=i=1giti \displaystyle{ t \frac{d}{d t} \ln a = \sum_{i = 1} g_i t^i }

This is quite cryptic at first sight, but there has got to be some conceptual interpretation, probably involving [[Adams operations]].

view this post on Zulip John Baez (Oct 06 2024 at 17:44):

5) Back to conceptual interpretations, Cartier noticed that the big Witt ring W(R)W(R) can be seen as the ring of all formal curves starting at 11 in the multiplicative group of RR... or something like that. This is explained here, where that ring of formal curves starting at 11 is called C(Gm,R)C(\mathbb{G}_m, R).

This is probably just another way of thinking about 3), but it connects the big Witt ring to formal group laws, and in particular the "multiplicative formal group law", and I believe this should ultimately clarify the following trio of facts: a) the KK-theory of any space is a λ\lambda-ring, b) KK-theory is a complex oriented cohomology theory, 3) such cohomology theories are classified by formal group laws, 4) KK-theory corresponds to the multiplicative formal group law.

(Cartier generalized the big Witt ring to an arbitrary formal group law.)

view this post on Zulip John Baez (Oct 06 2024 at 19:19):

It should be possible to clearly see how all these facts follow from the definition 1), but I'm not there yet!

Here's how 1) implies 2). We use the fact that the forgetful functor

U:λRingCommRingU : \lambda\mathsf{Ring} \to \mathsf{CommRing}

has not only a left adjoint

F:CommRingλRingF: \mathsf{CommRing} \to \lambda\mathsf{Ring}

but also a right adjoint

W:CommRingλRingW: \mathsf{CommRing} \to \lambda\mathsf{Ring}

Let's use this to compute the underlying set of W(R)W(R) for some commutative ring RR.

The underlying set of any commutative ring AA is CommRing(Z[x],A)\mathsf{CommRing}(\mathbb{Z}[x], A) since Z[x]\mathbb{Z}[x] is the free commutative ring on one generator.

Thus the underlying set of (the underlying commutative ring of) W(R)W(R) is

CommRing(Z[x],U(W(R))\mathsf{CommRing}(\mathbb{Z}[x], U(W(R))

but since UU has a left adjoint LL this is

λRing(F(Z[x]),W(R)) \lambda\mathsf{Ring}(F(\mathbb{Z}[x]), W(R))

and since WW has a left adjoint UU this is

Ring(U(F(Z[x])),R)\mathsf{Ring}(U(F(\mathbb{Z}[x])), R)

By general nonsense F(Z[x])F(\mathbb{Z}[x]) is the free lambda-ring on one generator. The underlying commutative ring of this, U(F(Z[x]))U(F(\mathbb{Z}[x])), is denoted Λ\Lambda. So, from this we get

The underlying set of W(R)W(R) is isomorphic to the set of ring homomorphisms from Λ\Lambda to RR.

It happens that Λ\Lambda is the ring of [[symmetric functions]], but we didn't need this in the above argument. It also happens that the ring of symmetric functions is the polynomial ring on generators λi\lambda^i called [[elementary symmetric functions]]. This is called the fundamental theorem of symmetric functions. We used this to see why 2) implies 3).

view this post on Zulip Morgan Rogers (he/him) (Oct 07 2024 at 05:50):

Where did this come from original and why is it named after Witt? I've never heard of it, so I'm curious how it connects to other things and why you're interested in understanding it now ;)

view this post on Zulip David Corfield (Oct 07 2024 at 08:09):

There was an exchange between you and James Borger back here which touched on Big Witt vectors. Jack Morava and David Ben-Zvi chip in and get to Adams operations. Maybe a conversation worth mining.

view this post on Zulip John Baez (Oct 07 2024 at 17:18):

Morgan Rogers (he/him) said:

Where did this come from original and why is it named after Witt? I've never heard of it, so I'm curious how it connects to other things and why you're interested in understanding it now ;)

Let me dig into the history....

Okay, it turns out Witt introduced a kind of "Witt vector" associated to an algebra over a field of characteristic pp back in 1936, in his paper Zyklische Körper und Algebren der Characteristik pp vom Grad pnp^n. Struktur diskret bewerteter perfekter Körper mit vollkommenem Restklassenkörper der Charakteristik pnp^n. No, that's not the whole paper - it's just the title.

view this post on Zulip John Baez (Oct 07 2024 at 17:21):

Given a commutative ring and a prime pp we can make up a "pp-typical Witt ring" containing these Witt vectors. But then someone noticed you can combine all these pp-typical Witt rings in a single "big Witt ring", and that's what I'm talking about.

People use these for various mysterious number-theoretic tasks, which I would like to understand someday.

view this post on Zulip John Baez (Oct 07 2024 at 17:28):

But here's why I'm actually interested!

The big Witt ring is now best understood as the cofree lambda-ring on a commutative ring. Lambda-rings are important in representation theory, where they let us study things like exterior powers and symmetric powers of representations. But also, surprisingly, they're important in number theory, where they let us study things like "Frobenius lifts" - lifting the Frobenius endomorphism of a field to commutative algebras over that field.

view this post on Zulip John Baez (Oct 07 2024 at 17:30):

For a long time I've been interested, just as a hobby, in understanding why people think the Riemann Hypothesis is connected to the "field with one element". So I was very interested when lambda-rings and the big Witt ring beame important in James Borger's approach to the field with one element:

His idea is that an algebra over the field with one element is a lambda-ring.

Digging into this, I've come to understand fairly well how lambda-rings arise from decategorifying concepts from representation theory. I've always liked representation theory, and @Joe Moeller and @Todd Trimble and I have showed that categories of representations tend to be 2-rigs, and the Grothendieck group of any 2-rig is a lambda-ring.

view this post on Zulip John Baez (Oct 07 2024 at 17:32):

But I'm less comfortable with how lambda-rings are connected to number theory and Frobenius lifts. Mainly, it seems like a miracle that lambda-rings show up in two different contexts: representation theory in characteristic zero, and number theory in characteristic pp!

They can't really be different; they must be deeply connected, so I want to understand this.

view this post on Zulip John Baez (Oct 07 2024 at 17:36):

One way for me to start understanding this is to take the big Witt ring of a ring RR, which doesn't seem to involve number theory or primes - it's just the cofree lambda-ring on RR - and see how it's built from pp-typical lambda rings, one for each prime.

view this post on Zulip John Baez (Oct 07 2024 at 17:38):

For years I've felt I don't have the intelligence to fully understand the big Witt ring - so I wished there were some sort of "half-Witt ring" to practice on. But it's gradually starting to make sense.

view this post on Zulip Morgan Rogers (he/him) (Oct 07 2024 at 18:11):

That sounds like fun! Someone suggested to me at some point that I should try and apply some of the stuff I've done with monoids to Lambda-rings, so I'll be keeping an eye on this topic.

view this post on Zulip John Baez (Oct 07 2024 at 18:34):

What could you do with lambda-rings, speculatively speaking?

view this post on Zulip John Baez (Oct 07 2024 at 18:38):

David Corfield said:

There was an exchange between you and James Borger back here which touched on Big Witt vectors. Jack Morava and David Ben-Zvi chip in and get to Adams operations. Maybe a conversation worth mining.

Yes! One difficulty I've had is connecting the 'big picture' ideas to the nitty-gritty comutations with lambda-rings and the big Witt ring. So, I've been digging into the nitty-gritty and now maybe I'll be better able to connect it to the big picture.

view this post on Zulip John Baez (Oct 07 2024 at 18:39):

There's a nice connection between Adams operations and Frobenius operators which is starting to make sense to me.

view this post on Zulip Josselin Poiret (Oct 08 2024 at 08:27):

I haven't thought about this in a bit but I remember that the explanation in the nlab was very useful to understand why lambda rings are interesting

view this post on Zulip Morgan Rogers (he/him) (Oct 08 2024 at 16:20):

John Baez said:

What could you do with lambda-rings, speculatively speaking?

It's been just long enough that I can't remember; the general idea was to treat them as internal somethings in a topos of suitably chosen monoid actions, but that much is not exactly profound.

view this post on Zulip John Baez (Oct 08 2024 at 18:59):

Josselin Poiret said:

the explanation in the nlab was very useful to understand why lambda rings are interesting

Yes, thanks! It seems to have improved since I last read it, or maybe I just know more now. This is about what Borger calls the heterodox interpretation of a lambda-ring as a ring with a family of commuting Frobenius lifts, one for each prime.

view this post on Zulip John Baez (Oct 08 2024 at 19:22):

Equivalently it can be seen as a ring with with commuting [[p-derivations]] - a generalization of the concept of derivation. This allows us to develop a number-theoretic generalization of the concept of Taylor series, building on the known analogy between the ring of formal power series Z[[x]]\mathbb{Z}[[x]] and the ring of p-adic integers, which gives a geometrical interpretation of localizing at a prime. If we go down this fascinating road, it's good to treat a lambda-ring as a [[Joyal delta-ring]].

view this post on Zulip John Baez (Oct 08 2024 at 19:25):

But I understand a lot more about what Borger calls the orthodox interpretation of lambda-rings, which is a ring equipped with λ\lambda-operations. The idea here is that if we have a category with well-behaved "exterior power" operations, like a category of vector bundles or group representations, these will endow it's Grothendieck ring with λ\lambda-operations making it into a lambda-ring.

view this post on Zulip John Baez (Oct 08 2024 at 22:23):

I find this to be a great intro to the 'heterodox' interpretation of lambda-rings and the big Witt ring:

Ignore the scary title and read the nice review material!

view this post on Zulip Todd Trimble (Oct 12 2024 at 04:07):

Let me get back to you later.

view this post on Zulip John Baez (Oct 12 2024 at 05:01):

Todd and I were talking about how multiplication in the big Witt ring W(R)W(R) of a commutative ring RR arises from comultiplication in the biring of symmetric functions Λ\Lambda, via the formula

W(R)=CRing(Λ,R) W(R) = \mathsf{CRing}(\Lambda, R)

Recall that as a commutative ring,

Λ=Z[λ1,λ2,]\Lambda = \mathbb{Z}[\lambda^1, \lambda^2, \dots ]

which we could conceivably write as

Λ=Z[λ1(x),λ2(x),]\Lambda = \mathbb{Z}[\lambda^1(x), \lambda^2(x), \dots ]

to emphasizing Λ\Lambda is the free λ\lambda-ring on one generator xx. In fact Todd uses such a notation for the free λ\lambda-ring on two generators xx and yy, which is ΛΛ\Lambda \otimes \Lambda.

view this post on Zulip John Baez (Oct 12 2024 at 05:06):

Since comulltiplication

μ:ΛΛΛ \mu: \Lambda \to \Lambda \otimes \Lambda

is a ring homomorphism, it will be determined once we know what it does to each of the generators λi\lambda^i. I think Todd showed me how it works for λ1\lambda^1 and I asked how it worked for λ2\lambda^2, or something like that. Here was his reply - we've decided to talk about this here.

view this post on Zulip John Baez (Oct 12 2024 at 05:07):

You asked what does the comultiplication

μ:ΛΛΛ\mu: \Lambda \to \Lambda \otimes \Lambda

applied to λ2\lambda^2 look like. Short answer:

(μ(λ2))(x,y)=(λ1(x))2λ2(y)+λ2(x)(λ1(y))22λ2(x)λ2(y)(\mu(\lambda^2))(x, y) = (\lambda^1(x))^2 \lambda^2(y) + \lambda^2(x)(\lambda^1(y))^2 - 2\lambda^2(x)\lambda^2(y)

where we think of ΛΛ\Lambda \otimes \Lambda as the free lambda-ring on two generators x,yx, y. This can also be written more nicely as σ2(x)λ2(y)+λ2(x)σ2(y)\sigma^2(x) \lambda^2(y) + \lambda^2(x) \sigma^2(y).

Longer answer: use the splitting principle, which guarantees that the 2-rig map kSAn\overline{k\mathsf{S}} \to \mathsf{A}^{\boxtimes n}, sending the generator xx to a sum x1xnx_1 \oplus \ldots \oplus x_n of nn independent bosonic subline objects, is an extension when restricted to polynomial functors of degree n\leq n. Since λ2\lambda^2 is degree 22, this means in effect that we can pretend the generator xx of kS\overline{k\mathsf{S}} is a sum x1+x2x_1 + x_2 of two bosonic sublines. Then the 2-rig comultiplication kSkSkS\overline{k\mathsf{S}} \to \overline{k\mathsf{S}} \boxtimes \overline{k\mathsf{S}}, taking xx to xyx \boxtimes y per our first paper, induces the map of lambda-rings ΛΛΛ\Lambda \to \Lambda \otimes \Lambda that takes x=x1+x2x = x_1 + x_2 to xy=(x1+x2)(y1+y2)=x1y1+x1y2+x2y1+x2y2x \otimes y = (x_1 + x_2)(y_1 + y_2) = x_1y_1 + x_1y_2 + x_2y_1 + x_2y_2. Since this lambda-ring map preserves the λ2\lambda^2 operation, we calculate

μ(λ2x)=λ2(μx)=λ2(x1y1+x1y2+x2y1+x2y2)\mu(\lambda^2 x) = \lambda^2 (\mu x) = \lambda^2(x_1y_1 + x_1y_2 + x_2y_1 + x_2y_2)

and use the exponential law for λ2\lambda^2 plus vanishing of λ2(xiyj)\lambda^2(x_i y_j) to write this out long-hand. Sparing some gory details, this gives the answer in the short identity.

view this post on Zulip Todd Trimble (Oct 12 2024 at 16:35):

I wonder how much of this needs to be explained further?

To my taste, the symbols λi\lambda^i and σi\sigma^i refer primarily to certain functors, and secondarily to symmetric functions; ultimately these two points of view are united via the all-powerful splitting principle. More precisely, the ring Λ\Lambda is the Grothendieck group or ring based on isomorphism classes of Schur functors, which are essentially those endofunctors on the category of vector spaces (let's say over C\mathbb{C}) that you can build up using \otimes (together with its symmetric monoidal structure), coproducts, and splitting of idempotents. For example, VVVV \mapsto V \otimes V is a Schur functor. If σ:VVVV\sigma: V \otimes V \to V \otimes V denotes the symmetry that swaps tensor factors, then the endomorphisms

1+σ2:VVVV,1σ2:VVVV\frac{1 + \sigma}{2}: V \otimes V \to V \otimes V, \qquad \frac{1-\sigma}{2}: V \otimes V \to V \otimes V

are idempotent maps, and we can split these idempotents to obtain the symmetric square S2(V)S^2(V) and exterior square Λ2(V)\Lambda^2(V). Likewise, the symmetric power and exterior power functors, VSn(V)V \mapsto S^n(V) and VΛn(V)V \mapsto \Lambda^n(V), are Schur functors. Tensor products and coproducts of Schur functors are again Schur functors, and Schur functors are closed under composition.

view this post on Zulip Todd Trimble (Oct 12 2024 at 16:35):

The symbols λi\lambda^i and σi\sigma^i can be read as the isomorphism classes [Λi][\Lambda^i] and [Si][S^i], regarded as elements of the ring Λ\Lambda (whose addition is induced from taking coproducts, and whose multiplication is induced from taking tensor products). It turns out -- and this is by no means trivial -- that as a ring, Λ\Lambda is isomorphic to the polynomial algebra Z[λ1,λ2,]\mathbb{Z}[\lambda^1, \lambda^2, \ldots]. It is also the polynomial algebra Z[σ1,σ2,]\mathbb{Z}[\sigma^1, \sigma^2, \ldots]: both sets {λi}\{\lambda^i\} and {σi}\{\sigma^i\} serve as polynomial bases. There are other famous bases as well, which I won't mention right now, but you can read about them in famous texts such as Representation Theory by Fulton and Harris, and Symmetric Functions and Hall Polynomials by MacDonald.

view this post on Zulip Todd Trimble (Oct 12 2024 at 16:36):

But there is so much more to Λ\Lambda! People go gaga over the richness of its structure and its interplay with the rest of mathematics. I'll try to indicate some main features of this structure by first pointing to similar features of a far simpler structure, namely the polynomial algebra Z[x]\mathbb{Z}[x], which represents the forgetful functor

U:CRingSetU: \mathsf{CRing} \to \mathsf{Set}

in the sense of a natural isomorphism UCRing(Z[x],)U \cong \mathsf{CRing}(\mathbb{Z}[x], -). Now, we can remind the forgetful functor of its (tautological) ring object structure, by pointing at natural transformations a:U×UUa: U \times U \to U (whose components are the addition functions U(R)×U(R)U(R)U(R) \times U(R) \to U(R)) and m:U×UUm: U \times U \to U (multiplication). At the level of the representing object Z[x]\mathbb{Z}[x], these transformations are induced by ring homomorphisms

α:ZZ[x]Z[x],μ:Z[x]Z[x]Z[x]\alpha: \mathbb{Z} \to \mathbb{Z}[x] \otimes \mathbb{Z}[x], \qquad \mu: \mathbb{Z}[x] \to \mathbb{Z}[x] \otimes \mathbb{Z}[x]

Here we should pause to note that ABA \otimes B is the coproduct of commutative rings A,BA, B, which means, by the universal property of coproducts, that CRing(AB,)CRing(A,)×CRing(B,)\mathsf{CRing}(A \otimes B, -) \cong \mathsf{CRing}(A, -) \times \mathsf{CRing}(B, -). Thus Z[x]Z[x]\mathbb{Z}[x] \otimes \mathbb{Z}[x] is the representing object of U×UU \times U.

view this post on Zulip Todd Trimble (Oct 12 2024 at 16:36):

In any category with coproducts, we can define a notion of co-ring object, dual to the notion of a ring object in a category with products. So what we can say is that according to the above, Z[x]\mathbb{Z}[x] is a co-ring object in the category of rings. We call it a biring. I'll leave it as a semi-advanced exercise in applying the Yoneda lemma to figure out explicit formulas for the co-ring structure on Z[x]\mathbb{Z}[x] that we are talking about here.

Stepping back a little: any time a representable functor CRing(A,):CRingSet\mathsf{CRing}(A, -): \mathsf{CRing} \to \mathsf{Set} has ring object structure -- or in other words lifts up through the forgetful functor U:CRingSetU: \mathsf{CRing} \to \mathsf{Set} to give an endofunctor G:CRingCRingG: \mathsf{CRing} \to \mathsf{CRing}, the representing object AA becomes a biring. It can be shown that such lifts of representable functors are necessarily limit-preserving, or better yet they are right adjoints. Thus right adjoint endofunctors on CRing\mathsf{CRing} are equivalent to biring structures. The case of the biring Z[x]\mathbb{Z}[x] corresponds to the identity endofunctor on CRing\mathsf{CRing} (which of course is a right adjoint).

view this post on Zulip Todd Trimble (Oct 12 2024 at 16:36):

But wait, there's more! In our example, Z[x]\mathbb{Z}[x] carries another binary operation Z[x]×Z[x]Z[x]\mathbb{Z}[x] \times \mathbb{Z}[x] \to \mathbb{Z}[x], namely polynomial composition (p,q)pq(p, q) \mapsto p \circ q (replacing the xx in p(x)p(x) by q(x)q(x)). If you'll allow me to abuse language and write the identity functor as

CRing(Z[x],):CRingCRing\mathsf{CRing}(\mathbb{Z}[x], -): \mathsf{CRing} \to \mathsf{CRing}

for the biring Z[x]\mathbb{Z}[x] sitting in the contravariant slot, then the polynomial composition Z[x]×Z[x]Z[x]\mathbb{Z}[x] \times \mathbb{Z}[x] \to \mathbb{Z}[x] can be read as contravariantly inducing a transformation in the other direction,

CRing(Z[x],)CRing(Z[x],CRing(Z[x],))\mathsf{CRing}(\mathbb{Z}[x], -) \to \mathsf{CRing}(\mathbb{Z}[x], \mathsf{CRing}(\mathbb{Z}[x], -))

(these manipulations might seem a tad puzzling, and indeed it takes some fancy footwork to put it just right, but I'm going to skip over that -- you can read Schur Functors and Categorified Plethysm for details). I'll just say that the polynomial composition on Z[x]\mathbb{Z}[x] corresponds to the identity transformation IdIdId\mathrm{Id} \to \mathrm{Id} \circ \mathrm{Id} that is the comultiplication for the tautological comonad structure on Id\mathrm{Id}.

view this post on Zulip Todd Trimble (Oct 12 2024 at 16:37):

More generally, if AA is a biring, and if there is a comonad structure on the lifted endofunctor CRing(A,):CRingCRing\mathsf{CRing}(A, -): \mathsf{CRing} \to \mathsf{CRing}, then the comultiplication transfers over to an operation (a function) A×AAA \times A \to A that behaves similarly to polynomial composition. This operation is called plethysm, and a biring equipped with a plethysm operation is called a plethory. So in summary, giving a plethory is equivalent to giving a right adjoint comonad on CRing\mathsf{CRing}; the plethory is the representing object for that comonad.

So now I can tell you that the main structural features of Λ\Lambda is that it carries a plethory structure. A rather complicated plethory structure that is still very far away from being fully understood.

view this post on Zulip Todd Trimble (Oct 12 2024 at 16:37):

Where does that plethory structure come from? Well, John mentioned at the top of this thread that the forgetful functor UU from the category of lambda-rings to the category of commutative rings has both a left adjoint FF (expected for abstract nonsense reasons) and a right adjoint GG (a much more specialized circumstance). So GG and also UU are right adjoints, hence the comonad GUG \circ U is also a right adjoint. The plethory Λ\Lambda is the representing object for that right adjoint comonad.

view this post on Zulip Todd Trimble (Oct 12 2024 at 16:38):

This likely still sounds very mysterious, because for example, what are lambda-rings? Instead of saying directly what they are, I'll say in a different way where the comonad GUGU comes from. In our paper Schur Functors and Categorified Plethysm, we explain that it is the result of decategorifying a 2-comonad that is much easier (conceptually) to understand. Namely, what we do is categorify the story I was telling, about how Z[x]\mathbb{Z}[x] with its plethory structure represents the identity comonad on commutative rings. In this particular categorification, commutative rings are replaced by 2-rigs, which are symmetric monoidal Vect\mathsf{Vect}-enriched categories which have coproducts and idempotent splittings. The forgetful functor CRingSet\mathsf{CRing} \to \mathsf{Set} is replaced by the forgetful (2-)functor 2RigCat\textbf{2Rig} \to \mathsf{Cat}. This forgetful functor is representable, by a category (rather, 2-rig) of abstract Schur functors, so this category Schur\mathrm{Schur} is the replacement of Z[x]\mathbb{Z}[x] at the categorified level, and it is a 2-birig that represents the identity functor on the 2-category of 2-rigs. It carries a 2-plethory structure, corresponding to the tautological 2-comonad structure on this identity functor.

view this post on Zulip Todd Trimble (Oct 12 2024 at 16:38):

The air up here on this categorified mountaintop is clean and clear; the view is simple and beautiful. But then we descend, from Schur\mathrm{Schur} down to its set of isomorphism classes, or really I mean its Grothendieck ring Λ\Lambda. That descent process is called 'decategorification'. And it's a little tricky. It took us months of study to make sure of our footing and the right path down to the valley. After all, we start with the "easiest" 2-comonad in the world, the identity functor on 2-rigs, and somehow this decategorifies down to an extremely non-trivial comonad on commutative rings, namely this right adjoint comonad GUGU I mentioned. Then we can (and do) define lambda-rings to be the coalgebras of this comonad.

Okay, I've just given a thumbnail sketch of our first paper.

view this post on Zulip Todd Trimble (Oct 12 2024 at 16:38):

To return to the topic, though: John quoted my reply to a question he asked about a specific calculation, about the biring Λ\Lambda. As I said, Λ\Lambda, even just as a biring let alone a plethory, is a pretty complicated beast, and I believe not completely grasped in terms of giving explicit formulas for the comultiplication (co-addition is rather easier to deal with). But some small calculations you can do by hand, and I was describing to him how to go about calculating what the comultiplication μ:ΛΛΛ\mu: \Lambda \to \Lambda \otimes \Lambda does to the element λ2\lambda^2, by exploiting the subject of our second paper, the splitting principle [set in the context of 2-rigs].

All this is related to the second and third posts at the top of this thread, where John is again in search of ways to wrap one's head around the comultiplication, and he was quoting some stuff he saw in a paper by Niranjan Ramachandran which gives some hints. I can come around to discussing that as well, but in order for others to be able to follow along, I thought it would help to give some background, hence this string of ear-bending comments.

view this post on Zulip John Baez (Oct 12 2024 at 17:49):

Thanks, Todd! I was distracted last night and didn't quite know the best way to kick off the conversation.

view this post on Zulip John Baez (Oct 12 2024 at 17:55):

I just noticed a small notational point that might puzzle and perturb novices:

Todd Trimble said:

To my taste, the symbols λi\lambda^i and σi\sigma^i refer primarily to certain functors, and secondarily to symmetric functions; ultimately these two points of view are united via the all-powerful splitting principle. [...] If σ:VVVV\sigma: V \otimes V \to V \otimes V denotes the symmetry that swaps tensor factors, then the endomorphisms

1+σ2:VVVV,1σ2:VVVV\frac{1 + \sigma}{2}: V \otimes V \to V \otimes V, \qquad \frac{1-\sigma}{2}: V \otimes V \to V \otimes V

are idempotent maps, and we can split these idempotents to obtain the symmetric square Σ2(V)\Sigma^2(V) and exterior square Λ2(V)\Lambda^2(V). Likewise, the symmetric power and exterior power functors, VSn(V)V \mapsto S^n(V) and VΛn(V)V \mapsto \Lambda^n(V) are Schur functors.

This point is that the symmetric functions σi\sigma^i are not powers of the "switch" map σ:VVVV\sigma: V \otimes V \to V \otimes V - instead there is a map sending Schur functors to symmetric functions, and the σi\sigma^i are the symmetric functions corresponding to the "ith symmetrized tensor power" functors VSi(V)V \mapsto S^i(V).

In the study of the symmetric group, people like lots of things named SS, or S\mathsf{S}, or Σ\Sigma, or σ\sigma, and sometimes we get carried away and the notations conflict!

view this post on Zulip Todd Trimble (Oct 12 2024 at 17:56):

Yeah, I edited to change notation in that comment from Σ2(V)\Sigma^2(V) (which is one tradition) to S2(V)S^2(V), which we seem to favor.

view this post on Zulip Todd Trimble (Oct 12 2024 at 18:00):

Also, I did not explain how Schur functors give rise to symmetric functions. I wouldn't think it was completely common knowledge, how that goes. But I mentioned that the connection arises through this splitting principle that keeps coming up.

view this post on Zulip John Baez (Oct 12 2024 at 18:03):

I also have another slightly less picayune comment. If we went ahead and computed (μ(λn))(x,y)(\mu(\lambda^n))(x,y), this formula:

(μ(λ2))(x,y)=(λ1(x))2λ2(y)+λ2(x)(λ1(y))22λ2(x)λ2(y)(\mu(\lambda^2))(x, y) = (\lambda^1(x))^2 \lambda^2(y) + \lambda^2(x)(\lambda^1(y))^2 - 2\lambda^2(x)\lambda^2(y)

would become the 2nd term in a sequence of similar expressions: polynomials in the λi\lambda^i where the total degree of the nth polyomial is 2n, it seems, if we count λi\lambda^i as having degree ii.

Are these polynomials we should recognize? Are they famous?

view this post on Zulip Todd Trimble (Oct 12 2024 at 18:12):

Mm, I imagine they're quite famous, and also pretty intensively studied, but my impression is that we ("we" being the community of mathematicians, including the experts) don't know them all yet. I wish I knew what people called them by name. Atiyah and Tall give the lackluster notation PnP_n to these polynomials (page 258), and I think that notation might be pretty common. But your posts at the top of the thread are all about these polynomials!

view this post on Zulip Todd Trimble (Oct 12 2024 at 18:32):

So for example, the Encyclopedia of Mathematics uses the same notation PnP_n.

I've been scanning Hazewinkel for notation, and I guess it's true that the formulas can be made much more explicit if you use a different polynomial basis, like the power sums basis. He gives a number of such formulas around page 46. So maybe I'll eat my words a little, but anyway I don't know if the explicit list of polynomials purely in terms of the λi\lambda^i are completely known. Maybe someone else can say.

(I suppose I should know Hazewinkel's article a lot better than I do, because there's evidently a lot of great stuff in it. I'm a little put off by his crapola typesetting job, but I guess that'll be more on me than on him.)

view this post on Zulip Todd Trimble (Oct 12 2024 at 18:36):

And now that I'm looking at Hazewinkel further, I see that he brings up quasisymmetric functions in section 11, which are supposed to be really important in this biz. Joachim Kock gave an interesting talk about quasisymmetric functions at the CT 2024 conference. I should be looking more into these things.

view this post on Zulip John Baez (Oct 12 2024 at 18:40):

Do you have any hint as to why quasisymmetric functions are important? I don't know any conceptual explanation of them, so I sometimes have cynically wondered if it's a case of "what are you going to do when you're an expert on symmetric functions and you run out of good ideas? Invent quasisymmetric functions and generalize all the theorems to those!" It's probably not true.

(Lusztig once said "Some people like to take theorems about groups and generalize them to quantum groups. I like to find theorems about quantum groups that aren't like theorems about groups." He came up with some amazing results....)

view this post on Zulip Todd Trimble (Oct 12 2024 at 18:49):

Eh, I don't yet. But Hazewinkel says at the beginning of section 11, "When looking at various universality properties of the Witt vectors and Symm (which is the topic of the next section) one rapidly stumbles over a (maximally) non commutative version, NSymm, and a (maximally) non cocommutative version, QSymm. This section is devoted to a brief discussion of these two objects. Somehow a good many things become easier to see and to formulate in these contexts (including certain explicit calculations). As I have said before, e.g. in [200], p. 56; [199], Ch. H1, p. 1, once one has found the right non commutative version, things frequently become more transparent, easier to understand, and much more elegant.

view this post on Zulip Todd Trimble (Oct 12 2024 at 18:54):

There's a certain amount of hardcore algebraic combinatorics to all this. Another buzzphrase that seems relevant and important to me here is this so-called Cauchy identity; see around pages 457-458 of Fulton and Harris. (Perhaps I'm jotting this down as a reminder just to myself to come back to it -- it would be very boring to the casual onlooker.)

view this post on Zulip John Baez (Oct 12 2024 at 21:15):

Thanks - I'll try to find out if anyone has a name for these polynomials PnP_n. That could unlock a lot of wisdom - or at least piles of cryptic and unmotivated identities that we might find conceptual explanations for. :upside_down:

view this post on Zulip John Baez (Oct 12 2024 at 21:16):

For other people listening in, let me try to give a conceptual explanation of these polynomials based on algebraic topology. Todd already knows this, at least implicitly, but I feel like saying it.

view this post on Zulip John Baez (Oct 12 2024 at 21:19):

The operations of direct sum and tensor product can be applied to matrices, and in particular to unitary matrices, so if U(n)U(n) is the group of n ×n\times n unitary matrices then we get Lie group homomorphisms

:U(m)×U(n)U(m+n) \oplus: U(m) \times U(n) \to U(m + n)

:U(m)×U(n)U(m×n) \otimes: U(m) \times U(n) \to U(m \times n)

We can work with all m,nm,n simultaneously if we use the obvious inclusions

U(n)U(n+1) U(n) \to U(n+1)

to define the colimit of topological (or even 'smooth') groups

U=limU(n) \displaystyle{ U = \lim_{\longrightarrow} U(n) }

view this post on Zulip John Baez (Oct 12 2024 at 21:22):

Then we get binary operations which are group homomorphisms

:U×UU \oplus : U \times U \to U

:U×UU \otimes: U\times U \to U

in addition to the group operation, which is another binary operation. (By the way, I believe books on K-theory use an Eckmann-Hiltonesque argument to show that \oplus is homotopy equivalent to the group operation, and even better.)

view this post on Zulip John Baez (Oct 12 2024 at 21:24):

These maps induce maps on the classifying space for stable complex vector bundles, BUBU:

:BU×BUBU\oplus : BU \times BU \to BU

:BU×BUBU \otimes: BU \times BU \to BU

view this post on Zulip John Baez (Oct 12 2024 at 21:25):

and thus we get maps on K-theory going backward, which we can call coaddition and comultiplication:

α:K(BU)K(BU)K(BU) \alpha : K(BU) \to K(BU) \otimes K(BU)
μ:K(BU)K(BU)K(BU)\mu: K(BU) \to K(BU) \otimes K(BU)

view this post on Zulip John Baez (Oct 12 2024 at 21:32):

Since KK of any space also has a ring structure, these wind up making K(BU)K(BU) into a 'biring'. But this biring is just our friend the free λ\lambda-ring on on generator, which @Todd Trimble has been explaining. This is called Λ\Lambda. As commutative rings we have

K(BU)ΛZ[λ1,λ2,]K(BU) \cong \Lambda \cong \mathbb{Z}[\lambda^1, \lambda^2, \dots ]

Here λi\lambda^i, thought of as a symmetric function, is the ith elementary symmetric polynomial. But

K(BU)Heven(BU,Z)K(BU) \cong H^{\text{even}}(BU, \mathbb{Z})

and thought of as an element of the integral cohomology of BUBU, λi\lambda^i is called the ith Chern class. It's a cohomology class of degree 2i.

view this post on Zulip John Baez (Oct 12 2024 at 21:49):

So now let's think about what comultiplication

μ:K(BU)K(BU)K(BU)\mu: K(BU) \to K(BU) \otimes K(BU)

does to λi\lambda^i, in these terms!

I'll use the fact I hinted at: for a paracompact Hausdorff space XX, the set of homotopy classes [X,BU][X,BU] is isomorphic to the set of stable complex vector bundles over XX: that is, equivalence classes of vector bundles over XX, where we count two as equivalent if they become isomorphic after summing with the same complex vector bundle.

Using this and the definition of K-theory we get

K(X)[X,BU] K(X) \cong [X, BU]

After all K(X)K(X) is defined to be the set of stable complex vector bundles over XX, made into a commutative ring using \oplus and \otimes.

view this post on Zulip John Baez (Oct 12 2024 at 21:57):

Thus, K(BU)[BU,BU]K(BU) \cong [BU, BU] is a somewhat self-referential entity: it's the commutative ring of stable vector bundles on BUBU. It's a ring because of the operations ,:BU×BUBU\oplus, \otimes : BU \times BU \to BU acting on the covariant argument in [BU,BU][BU,BU], and a coring because of these operations acting on the contravariant argument in [BU,BU][BU,BU]

view this post on Zulip John Baez (Oct 12 2024 at 21:59):

I'm sort of meandering, but from all this we get yet another interpretation of the elements

λiZ[λ1,λ2,]ΛK(BU)\lambda^i \in \mathbb{Z}[\lambda^1, \lambda^2, \dots ] \cong \Lambda \cong K(BU)

Namely, they must come from stable vector bundles on BUBU!

view this post on Zulip John Baez (Oct 12 2024 at 22:01):

I should describe these stable vector bundles, but I won't now. Instead, I just want to say what the

μ:K(BU)K(BU)K(BU)K(BU×BU) \mu: K(BU) \to K(BU) \otimes K(BU) \cong K(BU \times BU)

does to each element λi\lambda^i.

view this post on Zulip John Baez (Oct 12 2024 at 22:09):

It works like this: we can take any stable vector bundle on BUBU and pull it back along the tensor product map

:BU×BUBU\otimes : BU \times BU \to BU

and get a stable vector bundle on BU×BUBU \times BU. This induces a map

K(BU)K(BU×BU)K(BU)K(BU) K(BU) \to K(BU \times BU) \cong K(BU) \otimes K(BU)

and this is just our friend the comultiplication μ\mu.

So what does μ(λi)\mu(\lambda^i) mean? We start with the stable vector bundle λi\lambda^i on BUBU, and pull it back along :BU×BUBU\otimes: BU \times BU \to BU. I believe every stable vector bundle on BU×BUBU \times BU is an integral linear combination of tensor products of stable vector bundles λjλk\lambda^j \boxtimes \lambda^k, where \boxtimes is the 'external' tensor product of stable vector bundles: if you've got one on some space XX and one on some space YY, you can tensor them and get one on X×YX \times Y.

view this post on Zulip John Baez (Oct 12 2024 at 22:13):

If so, we should be able to take μ(λi)\mu(\lambda^i), pull it back along \otimes, and write it in terms of the λjλk\lambda^j \boxtimes \lambda^k. And I believe Todd's calculation is an example of this. He wrote

(μ(λ2))(x,y)=(λ1(x))2λ2(y)+λ2(x)(λ1(y))22λ2(x)λ2(y)(\mu(\lambda^2))(x, y) = (\lambda^1(x))^2 \lambda^2(y) + \lambda^2(x)(\lambda^1(y))^2 - 2\lambda^2(x)\lambda^2(y)

but in my current notation I believe this means

μ(λ2)=(λ1)2λ2+λ2(λ1)22λ2λ2\mu(\lambda^2) = (\lambda^1)^{\otimes 2} \boxtimes \lambda^2 + \lambda^2 \boxtimes (\lambda^1)^{\otimes 2} - 2\lambda^2 \boxtimes \lambda^2

view this post on Zulip John Baez (Oct 12 2024 at 22:17):

The first term here is not manifestly a tensor product of stable vector bundles λjλk\lambda^j \boxtimes \lambda^k, but it actually is: it's

(λ1λ2)(λ11)( \lambda^1 \boxtimes \lambda^2) \otimes (\lambda^1 \boxtimes 1)

where 11 is the trivial line bundle (the identity in the ring K(BU)K(BU)).

view this post on Zulip John Baez (Oct 12 2024 at 22:19):

So, in short, the polynomial people call PiP_i answers this question

Given a stable vector bundle λi\lambda^i on BUBU, and pulling it back along :BU×BUBU\otimes : BU \times BU \to BU, how can we express it in terms of the stable vector bundles λjλk\lambda^j \boxtimes \lambda^k?

view this post on Zulip John Baez (Oct 12 2024 at 22:19):

We could also frame this in terms of Chern classes.

All this 'fluffy' stuff doesn't help us compute the polynomials PiP_i. And indeed, Todd already showed one way to do that. It simply says why we should care about these polynomials.

view this post on Zulip John Baez (Oct 16 2024 at 01:24):

@Todd Trimble had written to me some more about the big Witt ring, in which he starts analyzing some formulas from here:

In particular, this paper discusses a formula for multiplication in the big Witt ring which I mentioned earlier:

John Baez said:

3) Since one can show the underlying commutative ring of the free lambda-ring on one generator is

Λ=Z[λ1,λ2,]] \Lambda = \mathbb{Z}[\lambda_1, \lambda_2, \dots ]]

we have an isomorphism of sets

W(R)1+tR[[t]]R[[t]] W(R) \cong 1 + t R[[t]] \subset R[[t]]

where

fZ[λ1,λ2,]] f \in \mathbb{Z}[\lambda_1, \lambda_2, \dots ]]

is mapped to

1+n=1f(λi)tn 1 + \sum_{n = 1}^\infty f(\lambda_i) t^n

Then the challenge is to describe the addition and multiplication on W(R)W(R) in these terms. People often do this with explicit formulas, which I tend to find cryptic. The addition in W(R)W(R) corresponds to the multiplication in 1+R[[t]]1 + R[[t]], which is why we use this description. The multiplication in W(R)W(R) is more tricky. For any aRa \in R we get an element of W(R)W(R) called (1at)1(1 - a t)^{-1}, defined by

(1at)1=1+at+a2t2+ (1 - a t)^{-1} = 1 + a t + a^2 t^2 + \cdots

and the multiplication W\cdot_W on W(R)W(R) turns out to be determined once we know

(1at)1W(1bt)1=(1abt)1 (1 - a t)^{-1} \cdot_W (1 - b t)^{-1} = (1 - a b t)^{-1}

This formula turns out to be very useful but I don't have a good understanding of how it comes from 2).

view this post on Zulip John Baez (Oct 16 2024 at 01:26):

The first question is: why should we care about these elements (1at)1(1-at)^{-1}? And the second is: what does the above formula for a product of them mean? And the third is: does it really determine the product on all of the big Witt ring W(R)W(R)?

Todd was making progress on all these questions.

view this post on Zulip Todd Trimble (Oct 16 2024 at 01:29):

Yes, sorry, I was going to say something about that! But let me collect my thoughts.

view this post on Zulip Todd Trimble (Oct 16 2024 at 01:32):

The paper of Ramachandran that John linked to mentioned that there are several reasonable choices for the (big) Witt ring multiplication. This has a lot to do with how there are various reasonable choices for a nice polynomial basis of Λ\Lambda.

view this post on Zulip Todd Trimble (Oct 16 2024 at 01:36):

So going back to the message at the top of the thread: one conceptual way to define W(R)W(R) is that it is the hom-set CRing(Λ,R)\mathsf{CRing}(\Lambda, R). Thanks to the rich plethory structure on Λ\Lambda that I was sketching earlier, the hom-set acquires a commutative ring structure and even a lambda-ring structure, and indeed furnishes the right adjoint to the forgetful functor from lambda-rings to commutative rings, as John mentioned earlier.

view this post on Zulip Todd Trimble (Oct 16 2024 at 01:42):

Now it seems that a lot of sources introduce W(R)W(R) as consisting of formal power series with constant coefficient 11, i.e., elements in 1+tR[[t]]1 + tR[[t]]. So there is an isomorphism CRing(Λ,R)1+tR[[t]]\mathsf{CRing}(\Lambda, R) \to 1 + tR[[t]], and John showed how this might go: using the polynomial basis λi\lambda^i, we can define this isomorphism as sending f:ΛRf: \Lambda \to R to 1+i1f(λi)ti1 + \sum_{i \geq 1} f(\lambda^i) t^i.

Another possibility is to use the polynomial basis σi\sigma^i, and define the isomorphism so as to send ff to 1+i1f(σi)ti1 + \sum_{i \geq 1} f(\sigma^i) t^i.

view this post on Zulip Todd Trimble (Oct 16 2024 at 01:48):

Each of these has its uses. But before tackling what any of this has to do with those formulas John mentioned, it might not be a bad exercise to look for a moment at how addition works in the Witt ring 1+tR[[t]]1 + tR[[t]]. (No, it is not ordinary addition of power series!) It turns out that the same formula will work whether you use the λi\lambda^i basis or the σi\sigma^i basis, and it's based on the co-addition on Λ\Lambda that was lightly alluded to. Maybe I'll pause a moment.

view this post on Zulip John Baez (Oct 16 2024 at 02:04):

This sounds like a good plan! Please pause all night long if you want... I'm about to have dinner, and it's 3 hours later for you.

view this post on Zulip Todd Trimble (Oct 17 2024 at 18:53):

In fact, I can take this opportunity to go a smidge deeper into our first paper. First, how does co-addition work, from first principles? I plan to be very methodical about this, which might make it look heavy in places -- I'll try to ameliorate that by surrounding some of the main conclusions by extra white space, so that readers can skip ahead to get the main points.

view this post on Zulip Todd Trimble (Oct 17 2024 at 18:53):

The plan is to see how co-addition works in the toy example of Z[x]\mathbb{Z}[x], and then categorify that. In discussion above I said that the explicit formula for co-addition α:Z[x]Z[x]Z[x]\alpha: \mathbb{Z}[x] \to \mathbb{Z}[x] \otimes \mathbb{Z}[x] can be derived as a "semi-advanced exercise" in using the Yoneda lemma, so I'll start with that. I'll use the notation [A,B][A, B] to denote hom-sets (usually hom-sets that acquire extra structure). The Yoneda lemma is about representable functors. Here we have ϕ:[Z[x],]U\phi: [\mathbb{Z}[x], -] \cong U where U:CRingSetU: \mathsf{CRing} \to \mathsf{Set} is the forgetful functor; evaluated at a ring RR, the isomorphism takes f:Z[x]Rf: \mathbb{Z}[x] \to R to f(x)U(R)f(x) \in U(R). Similarly we have [Z[x]Z[x],]U×U[\mathbb{Z}[x] \otimes \mathbb{Z}[x], -] \cong U \times U, instantiated by

[Z[x]Z[x],](π1,π2)[Z[x],]×[Z[x],]ϕ×ϕU×U[\mathbb{Z}[x] \otimes \mathbb{Z}[x], -] \overset{(\pi_1, \pi_2)}{\longrightarrow} [\mathbb{Z}[x], -] \times [\mathbb{Z}[x], -] \overset{\phi \times \phi}{\longrightarrow} U \times U

where the first product projection π1\pi_1 is induced by the first coproduct coprojection i1:Z[x]Z[x]Z[x]:xx1i_1: \mathbb{Z}[x] \to \mathbb{Z}[x] \otimes \mathbb{Z}[x]: x \mapsto x \otimes 1, and π2\pi_2 is induced by the second coproduct coprojection i2:Z[x]Z[x]Z[x]:x1xi_2: \mathbb{Z}[x] \to \mathbb{Z}[x] \otimes \mathbb{Z}[x]: x \mapsto 1 \otimes x. Taking R=Z[x]Z[x]R = \mathbb{Z}[x] \otimes \mathbb{Z}[x], chase the identity element 1Z[x]Z[x]1_{\mathbb{Z}[x] \otimes \mathbb{Z}[x]} through the sequence

[Z[x]Z[x],R](π1,π2)[Z[x],R]×[Z[x],R]ϕ×ϕU(R)×U(R)+RU(R)ϕ1[Z[x],R],[\mathbb{Z}[x] \otimes \mathbb{Z}[x], R] \overset{(\pi_1, \pi_2)}{\longrightarrow} [\mathbb{Z}[x], R] \times [\mathbb{Z}[x], R] \overset{\phi \times \phi}{\longrightarrow} U(R) \times U(R) \overset{+_R}{\longrightarrow} U(R) \overset{\phi^{-1}}{\longrightarrow} [\mathbb{Z}[x], R],

a la the proof of the Yoneda lemma. We get

1Z[x]Z[x](xi1x1,xi21x)(x1,1x)+x1+1x(xx1+1x).1_{\mathbb{Z}[x] \otimes \mathbb{Z}[x]} \mapsto (x \overset{i_1}{\mapsto} x \otimes 1, x \overset{i_2}{\mapsto} 1 \otimes x) \mapsto (x \otimes 1, 1 \otimes x) \overset{+}{\mapsto} x \otimes 1 + 1 \otimes x \mapsto (x \mapsto x \otimes 1 + 1 \otimes x).

view this post on Zulip Todd Trimble (Oct 17 2024 at 18:54):

  \;

In other words, the co-addition α:Z[x]Z[x]Z[x]\alpha: \mathbb{Z}[x] \to \mathbb{Z}[x] \otimes \mathbb{Z}[x] is the unique ring map taking xx to x1+1xx \otimes 1 + 1 \otimes x.

  \;

view this post on Zulip Todd Trimble (Oct 17 2024 at 18:54):

The same type of calculation shows that the comultiplication μ:Z[x]Z[x]Z[x]\mu: \mathbb{Z}[x] \to \mathbb{Z}[x] \otimes \mathbb{Z}[x] is the unique map taking xx to (x1)(1x)=xx(x \otimes 1) \cdot (1 \otimes x) = x \otimes x.

(One could simply guess these formulas and check that they work, but I think it's nice to know how the Yoneda lemma removes any guesswork.)

view this post on Zulip Todd Trimble (Oct 17 2024 at 18:55):

If one uses Z[x]Z[x]Z[x,y]\mathbb{Z}[x] \otimes \mathbb{Z}[x] \cong \mathbb{Z}[x, y], in effect identifying x1x \otimes 1 with xZ[x,y]x \in \mathbb{Z}[x, y] and 1x1 \otimes x with yZ[x,y]y \in \mathbb{Z}[x, y], then the co-addition becomes simply the ring map Z[x]Z[x,y]\mathbb{Z}[x] \to \mathbb{Z}[x, y] taking xx to x+yx + y, which makes everything look simple and obvious. The comultiplication takes xx to xyxy.

view this post on Zulip Todd Trimble (Oct 17 2024 at 18:55):

Moving in the opposite direction, suppose given a (commutative, cocommutative) bi-ring BB. The addition on [B,R]=CRing(B,R)[B, R] = \mathsf{CRing}(B, R) is retrieved from the co-addition α:BBB\alpha: B \to B \otimes B as a composite

[B,R]×[B,R][BB,R][α,1R][B,R](1)[B, R] \times [B, R] \overset{\sim}{\longrightarrow} [B \otimes B, R] \overset{[\alpha, 1_R]}{\longrightarrow} [B, R] \qquad (1)

where the isomorphism obtains by the universal property of BBB \otimes B as a coproduct. To be explicit, this isomorphism takes a pair of homomorphisms (f:BR,g:BR)(f: B \to R, g: B \to R) to the composite

BBfgRRRB \otimes B \overset{f \otimes g}{\longrightarrow} R \otimes R \overset{\nabla}{\longrightarrow} R

where the codiagonal \nabla is precisely the multiplication m:RRRm: R \otimes R \to R.

view this post on Zulip Todd Trimble (Oct 17 2024 at 18:56):

  \;

Hence the composite (1)(1) takes a pair of morphisms (f,g)(f, g) to the composite

BαBBfgRRmR(2)B \overset{\alpha}{\to} B \otimes B \overset{f \otimes g}{\to} R \otimes R \overset{m}{\to} R \qquad (2)

and this defines f+gf + g in the ring [B,R][B, R].

  \;

view this post on Zulip Todd Trimble (Oct 17 2024 at 18:56):

Replacing the co-addition α\alpha by comultiplication μ\mu, the same construction as in (2)(2) produces the product fgf \cdot g in [B,R][B, R].

view this post on Zulip Todd Trimble (Oct 17 2024 at 18:56):

Now categorify all this. Replace Z[x]\mathbb{Z}[x], the free commutative ring on one generator, with the free 2-rig on one generator, which is the (additive) Cauchy completion of the kk-linearization of the free symmetric monoidal category S\mathsf{S} on one generator. (To get anywhere interesting in categorifying commutative rings, you should add in some limits/colimits, and Cauchy completeness is a good place to start.) We write this as kS\overline{k\mathsf{S}}.

view this post on Zulip Todd Trimble (Oct 17 2024 at 18:57):

I think I mentioned before that this kS\overline{k\mathsf{S}} is the representing 2-rig for the forgetful functor 2RigCat\mathbf{2Rig} \to \mathsf{Cat}, and on those grounds alone, through abstract nonsense, one can copy over (or categorify) the development above for Z[x]\mathbb{Z}[x], to derive a 2-birig structure on kS\overline{k\mathsf{S}}, with a categorified co-addition given by the unique (up to isomorphism) 2-rig map

α:kSkS(x,y)\alpha: \overline{k\mathsf{S}} \to \overline{k\mathsf{S}(x, y)}

(where the codomain is the free 2-rig on two generators x,yx, y) that sends the generator xx of kS\overline{k\mathsf{S}} to the formal coproduct xyx \oplus y in kS(x,y)\overline{k\mathsf{S}(x, y)}.

view this post on Zulip Todd Trimble (Oct 17 2024 at 18:58):

It is interesting to watch what α\alpha does to objects like SnS^n and Λn\Lambda^n in kS \overline{k\mathsf{S}}. I'll start with SnS^n, the nthn^{th} symmetric power. For any 2-rig R\mathcal{R}, there is a 2-rig R[N]\mathcal{R}[\mathbb{N}] of graded R\mathcal{R}-objects (whose symmetric monoidal tensor is given by Day convolution, coming from N\mathbb{N}), and one way we can view the symmetric power Sn(r)=rn/SnS^n(r) = r^{\otimes n}/S_n for an object rr of R\mathcal{R}, which I sometimes like to write as rn/n!r^{\otimes n}/n!, is that it is the nthn^{th} homogeneous component of a symmetric algebra construction, which I will write as

exp(r)=n0rnn!=n0Sn(r).\exp(r) = \sum_{n \geq 0} \frac{r^{\otimes n}}{n!} = \sum_{n \geq 0} S^n(r).

Here I'm thinking of the object rr as sitting in grade 11 in R[N]\mathcal{R}[\mathbb{N}], so that Sn(r)S^n(r) sits in grade nn. This exp(r)\exp(r) is the free commutative monoid on rr, and the category of commutative monoid objects has \otimes as its coproduct. This free functor gives a (partially defined) left adjoint [I say "partial" because if for example rr' is in degree 00, then so would be exp(r)\exp(r'), but maybe the 2-rig R\mathcal{R} we started with doesn't have infinite coproducts like n0Sn(r)\sum_{n \geq 0} S^n(r) -- there's a better chance of success if the coproduct is spread across grades]. Since left adjoints preserve coproducts, we deduce an isomorphism

exp(rs)exp(r)exp(s).\exp(r \oplus s) \cong \exp(r) \otimes \exp(s).

view this post on Zulip Todd Trimble (Oct 17 2024 at 19:00):

  \;
Whereupon, focusing on one grade at a time, we further deduce

Sn(rs)j+k=nSj(r)Sk(s)S^n(r \oplus s) \cong \sum_{j + k = n} S^j(r) \otimes S^k(s),

an identity which holds as a natural transformation between Schur functors of type RR\mathcal{R} \to \mathcal{R}. This holds in particular when R=kS(x,y)\mathcal{R} = \overline{k\mathsf{S}(x, y)}.

  \;

view this post on Zulip Todd Trimble (Oct 17 2024 at 19:01):

Decategorifying this "identity" by taking isomorphism classes, i.e., by taking Grothendieck rings, this implies that the induced co-addition α:ΛΛΛ\alpha: \Lambda \to \Lambda \otimes \Lambda satisfies

α(σn)=j+k=nσj(x)σk(y)\alpha(\sigma^n) = \sum_{j + k = n} \sigma^j(x) \otimes \sigma^k(y)

where the right side is visibly a convolution product.

view this post on Zulip Todd Trimble (Oct 17 2024 at 19:02):

Consider now the induced addition on [Λ,R][\Lambda, R] (RR a commutative ring), sending a pair (f,g)(f, g) of homomorphisms ΛR\Lambda \to R to the homomorphism

ΛαΛΛfgRRmR.\Lambda \overset{\alpha}{\longrightarrow} \Lambda \otimes \Lambda \overset{f \otimes g}{ \longrightarrow} R \otimes R \overset{m}{\longrightarrow} R.

This composite takes σn\sigma^n to j+k=nf(σj)g(σk)\sum_{j + k = n} f(\sigma^j) g(\sigma^k). In other words, Witt ring addition is defined by

(f+Wg)(σn)=j+k=nf(σj)g(σk)(f +_W g)(\sigma^n) = \sum_{j + k = n} f(\sigma^j) g(\sigma^k)

and if we set up an isomorphism [Λ,R]1+tR[[t]][\Lambda, R] \overset{\sim}{\longrightarrow} 1 + tR[[t]] by fn0f(σn)tnf \mapsto \sum_{n \geq 0} f(\sigma^n) t^n, then the induced Witt ring addition on 1+tR[[t]]1 + tR[[t]] is given by multiplying power series.

view this post on Zulip Todd Trimble (Oct 17 2024 at 19:02):

It turns out the same is true if we use instead the isomorphism [Λ,R]1+tR[[t]][\Lambda, R] \overset{\sim}{\longrightarrow} 1 + tR[[t]] given by fn0f(λn)tnf \mapsto \sum_{n \geq 0} f(\lambda^n) t^n. At the beginning of that story, we have the identity

Λn(rs)j+k=nΛj(r)Λk(s)\Lambda^n(r \oplus s) \cong \sum_{j + k = n} \Lambda^j(r) \otimes \Lambda^k(s)

as a natural transformation between Schur functors on any 2-rig R\mathcal{R}. To see this, we replace the 2-rig R[N]\mathcal{R}[\mathbb{N}] of graded objects, with its "vanilla" symmetry σ(uv)=vu\sigma(u \otimes v) = v \otimes u for objects uu in grade pp and vv in grade qq, with the more sophisticated symmetry that introduces a sign factor, σ(uv)=(1)pqvu\sigma(u \otimes v) = (-1)^{pq} v \otimes u. Otherwise, the entire story (symmetric algebra construction as free commutative monoid, etc.) remains the same, mutatis mutandis, hence we get this "exponential identity". Therefore, Witt addition on formal power series is again multiplication, even if we opt for this other identification with 1+tR[[t]]1 + tR[[t]].

view this post on Zulip Todd Trimble (Oct 17 2024 at 19:08):

Now that I've fully explained how addition on the big Witt ring works, I can turn my attention to how multiplication works. Multiplication of f,g:ΛRf, g: \Lambda \to R is defined by the composite

ΛμΛΛfgRRmR\Lambda \overset{\mu}{\longrightarrow} \Lambda \otimes \Lambda \overset{f \otimes g}{\longrightarrow} R \otimes R \overset{m}{\longrightarrow} R

where this time we have to figure out how comultiplication μ\mu on Λ\Lambda works. It is, of course, gotten by decategorifying (taking isomorphism classes) the unique (up to isomorphism) 2-rig map

kSkS(x,y)\overline{k\mathsf{S}} \to \overline{k\mathsf{S}(x, y)}

that takes the generator xx to the formal tensor product xyx \otimes y.

Working this out in detail will involve the "splitting principle" for 2-rigs, which John will be discussing at the upcoming Octoberfest meeting, but perhaps I'll pause to take a break from this longish series of comments.

view this post on Zulip John Baez (Oct 18 2024 at 18:06):

Todd Trimble said:

Therefore, Witt addition on [Λ,R][\Lambda, R] again corresponds to multiplication of formal power series, even if we use this second identification [Λ,R]1+tR[[t]][\Lambda, R] \cong 1 + tR[[t]].

That's great! Maybe someday we should write a little exposition of the big Witt ring. (Better than
a big exposition of the little Witt ring, explaining stuff like this. :upside_down:)

It might be fun to see what Witt addition looks like if we use the basis for Λ\Lambda given by power sum symmetric functions.

view this post on Zulip Todd Trimble (Oct 18 2024 at 21:46):

Yes, all these seem like good suggestions!

view this post on Zulip Todd Trimble (Oct 18 2024 at 21:46):

I'm going to push on with this series of posts; the next topic will be this splitting principle that I keep banging on about.

view this post on Zulip Todd Trimble (Oct 18 2024 at 21:47):

For us, the splitting principle is the idea that to establish isomorphisms between Schur functors, it is permissible to pretend that they can be decomposed as coproducts of "line objects". It is analogous to the splitting principle in K-theory, where equations are verified by acting as if vector bundles split as coproducts of line bundles.

This sounds a little flaky perhaps, so I'll put it more precisely in a moment, but first I want to say that the situation reminds me of how those Italian mathematicians from the late Renaissance who developed the cubic formula -- Cardano, Tartaglia, etc. -- made the bold move to act as if i=1i = \sqrt{-1} were a thing. Even when all the roots of the cubic polynomial were real, they were arrived at by making use of imaginary elements. (True, they were fairly uncomfortable with the situation, and it took a few centuries before mathematicians felt generally at home with splitting extensions of fields.) The analogy is apt: the coefficients of a polynomial are symmetric functions of their roots, and the roots are analogous to the line objects we are about to discuss. Don't believe the analogy? That's okay. Humor me anyway by considering a linear transformation T:VVT: V \to V on a vector space. By passing to an extension of the scalar field if need be, i.e., adjoining roots, we can split VV into a coproduct of eigenspaces (which generically are lines), by splitting the characteristic polynomial into linear factors.

(At the risk of too much self-indulgence: this also reminds me of splitting light into a spectrum, where the energy levels of photons are given by eigenvalues of a suitable operator.)

view this post on Zulip Todd Trimble (Oct 18 2024 at 21:47):

Okay, now let me state the splitting principle more precisely. For the purposes of this thread, define a line object in a 2-rig R\mathcal{R} to be a (nonzero) object rr satisfying Λ2(r)0\Lambda^2(r) \cong 0. (In our paper, we say instead "bosonic subline object".) Another way of saying it is that the canonical quotient rrS2(r)r \otimes r \to S^2(r) is an isomorphism, or equivalently that the symmetry σ:rrrr\sigma: r \otimes r \to r \otimes r that transposes factors equals the identity. Or equivalently still, that the tautological action of SnS_n on rnr^{\otimes n} is trivial for all nn. Finally, this last condition is equivalent to saying that for the symmetric powers SnS^n, we have Sn(r)rnS^n(r) \cong r^{\otimes n} for all nn.

view this post on Zulip Todd Trimble (Oct 18 2024 at 21:48):

Just as kS\overline{k\mathsf{S}} with its generator xx is initial among 2-rigs equipped with an object, so kN\overline{k\mathbb{N}} (the linear Cauchy completion of the linearized discrete symmetric monoidal category N\mathbb{N}) is initial among 2-rigs equipped with a line object. Here kN\overline{k\mathbb{N}} is equivalent to the category of N\mathbb{N}-graded vector spaces of finite total dimension, which we in our paper denote as A\mathsf{A}, and the line object in this universal case is a 1-dimensional vector space concentrated in grade 11. Likewise, the walking 2-rig on nn line objects L1,,LnL_1, \ldots, L_n, denoted as An\mathsf{A}^{\boxtimes n} is equivalent to the category of Nn\mathbb{N}^n-graded vector spaces of finite total dimension.

view this post on Zulip Todd Trimble (Oct 18 2024 at 21:48):

Assume the ground field kk is of characteristic zero. Suppose given Schur objects F,G:SVectF, G: \mathsf{S} \to \mathsf{Vect}, also known as polynomial functors, say of degree nn or less; the latter means they are valued in finite-dimensional vector spaces, and that their restrictions to the symmetric groups SmS_m are zero for m>nm > n. Thus FF is given by (let's say right) linear representations F[j]F[j] of SjS_j for j=0,,nj = 0, \ldots, n. The polynomial or Schur functor itself takes a finite-dimensional vector space VV to F~(V):=j=0nF[j]SjVj\tilde{F}(V) := \sum_{j = 0}^n F[j] \otimes_{S_j} V^{\otimes j}. In fact this formula for the Schur functor F~\tilde{F} makes sense for any 2-rig, taking VV to be an arbitrary object in the 2-rig. In particular, it can be applied to V=L1LnV = L_1 \oplus \cdots \oplus L_n in An\mathsf{A}^{\otimes n}.

view this post on Zulip Todd Trimble (Oct 18 2024 at 21:49):

The splitting principle we use concerns properties of the 2-rig map kSAn\overline{k\mathsf{S}} \to \mathsf{A}^{\boxtimes n} that takes FF to F~(L1Ln)\tilde{F}(L_1 \oplus \cdots \oplus L_n). In the form that we use it here, it states that the restriction of this 2-rig map to the subcategory of Schur objects or polynomial functors of degree at most nn is essentially injective, i.e., if

F~(L1Ln)G~(L1Ln)\tilde{F}(L_1 \oplus \cdots \oplus L_n) \cong \tilde{G}(L_1 \oplus \cdots \oplus L_n)

for F,GF, G of degree at most nn, then FGF \cong G. (Actually we prove more: that this restricted functor is faithful and conservative as well. But the essential injectivity property just stated is the one that is really key.)

view this post on Zulip Todd Trimble (Oct 18 2024 at 21:49):

We can play with this a bit. For example, we can calculate

Λj(L1Ln)\Lambda^j(L_1 \oplus \cdots \oplus L_n)

(the Schur functor Λj\Lambda^j is F~\tilde{F} in the case where F=sgnjF = \mathrm{sgn}_j, the sign representation of SjS_j), by exploiting the exponential identity mentioned earlier:

Λj(L1Ln)j1++jn=jΛj1(L1)Λjn(Ln).\Lambda^j(L_1 \oplus \cdots \oplus L_n) \cong \bigoplus_{j_1 + \ldots + j_n = j} \Lambda^{j_1}(L_1) \otimes \cdots \otimes \Lambda^{j_n}(L_n).

For a line object LL, Λp(L)=0\Lambda^p(L) = 0 for all p>1p > 1, as is easily shown by induction (since Λp+1(r)\Lambda^{p+1}(r) is a retract of rΛp(r)r \otimes \Lambda^p(r) in any 2-rig). Thus the only summands that survive on the right in the last display line are the ones where all the indices j1,,jnj_1, \ldots, j_n are 00 or 11 (and add up to jj). Thus

Λj(L1Ln)1i1<<ijnLi1Lij.\Lambda^j(L_1 \oplus \cdots \oplus L_n) \cong \bigoplus_{1 \leq i_1 < \ldots < i_j \leq n} L_{i_1} \otimes \cdots \otimes L_{i_j}.

Letting xix_i denote the isomorphism class [Li][L_i], the Grothendieck ring of An\mathsf{A}^{\boxtimes n} is isomorphic to the polynomial ring Z[x1,,xn]\mathbb{Z}[x_1, \ldots, x_n], and the isomorphism class of the coproduct on the right is

1i1<<ijnxi1xi2xij\sum_{1 \leq i_1 < \ldots < i_j \leq n} x_{i_1} x_{i_2} \cdots x_{i_j}

which is precisely the jthj^{th} elementary symmetric polynomial ej(x1,,xn)e_j(x_1, \ldots, x_n) as defined by the identity

j=0nej(x1,,xn)tn=i=1n(1+xit).\sum_{j = 0}^n e_j(x_1, \ldots, x_n) t^n = \prod_{i=1}^n (1 + x_i t).

view this post on Zulip Todd Trimble (Oct 18 2024 at 21:50):

Similarly, the exponential identity for the symmetric powers yields

Sj(L1Ln)j1++jn=jSj1(L1)Sjn(Ln)S^j(L_1 \oplus \cdots \oplus L_n) \cong \bigoplus_{j_1 + \ldots + j_n = j} S^{j_1}(L_1) \otimes \cdots \otimes S^{j_n}(L_n)

and using the fact noted earlier that for line objects LL we have Sn(L)LnS^n(L) \cong L^{\otimes n}, this may be rewritten as

Sj(L1Ln)j1++jn=jL1j1Lnjn.S^j(L_1 \oplus \cdots \oplus L_n) \cong \bigoplus_{j_1 + \ldots + j_n = j} L_1^{\otimes j_1} \otimes \cdots \otimes L_n^{\otimes j_n}.

The isomorphism class of the expression on the right is

j1++jn=jx1j1xnjn\sum_{j_1 + \ldots + j_n = j} x_1^{j_1} \cdots x_n^{j_n}

which is precisely hj(x1,,xn)h_j(x_1, \ldots, x_n), the jthj^{th} complete homogeneous polynomial in nn variables, as defined by the identity

j=0hj(x1,,xn)tn=i=1n11xit.\sum_{j = 0}^\infty h_j(x_1, \ldots, x_n) t^n = \prod_{i=1}^n \frac1{1 - x_i t}.

view this post on Zulip Todd Trimble (Oct 18 2024 at 21:51):

Letting Λn\Lambda_n be the Grothendieck group of the category of Schur objects of degree at most nn, the splitting principle implies that the induced map

ΛnZ[x1,,xn],\Lambda_n \to \mathbb{Z}[x_1, \ldots, x_n],

taking [F][F] to [F~(L1Ln)][\tilde{F}(L_1 \oplus \cdots \oplus L_n)], is an injection. Of course [F~(L1Ln)][\tilde{F}(L_1 \oplus \cdots \oplus L_n)] is manifestly invariant under permutations of the elements xi=[Li]x_i = [L_i], so that elements in the image of this map are symmetric polynomials in the xix_i (of degree no more than nn). Recall that every symmetric polynomial in variables x1,,xnx_1, \ldots, x_n is uniquely a polynomial p(e1,,en)p(e_1, \ldots, e_n) in the elementary symmetric polynomials ej=ej(x1,,xn)e_j = e_j(x_1, \ldots, x_n). Provided that the total degree of p(e1,,en)p(e_1, \ldots, e_n) is no more than nn (where the degree of eje_j is of course jj), we have

p(λ1,,λn)Λn.p(\lambda^1, \ldots, \lambda^n) \in \Lambda_n.

Since the map ΛnZ[x1,,xn]\Lambda_n \to \mathbb{Z}[x_1, \ldots, x_n] carries λj\lambda^j to ej(x1,,xn)e_j(x_1, \ldots, x_n), we deduce that every class [F]Λn[F] \in \Lambda_n is a polynomial p(λ1,,λn)p(\lambda^1, \ldots, \lambda^n) of total degree no more than nn, which is already a nontrivial theorem. It says that every polynomial functor of degree less than or equal to nn is isomorphic to a suitable coproduct of tensor powers of exterior power functors.

view this post on Zulip Todd Trimble (Oct 18 2024 at 21:52):

We are beginning to come full circle. Passing to suitable limits, explained in our paper, we can summarize one of our consequences of the splitting principle by the intuitive formula

1+λ1t+λ2t2+=n=1(1+[Li]t)1 + \lambda^1 t + \lambda^2 t^2 + \cdots = \prod_{n=1}^\infty (1 + [L_i]t)

so that we are splitting an "infinite polynomial" into linear factors, a la splitting extensions in the sense of Galois theory, and the coefficients λi\lambda^i are thereby manifestly identified with symmetric functions in the xi=[Li]x_i = [L_i], which play a role similar to roots in the splitting extension.

view this post on Zulip Todd Trimble (Oct 18 2024 at 21:52):

Similarly, we may write

1+σ1t+σ2t2+=n=1(1[Li]t)1.1 + \sigma^1 t + \sigma^2 t^2 + \cdots = \prod_{n=1}^\infty (1 - [L_i]t)^{-1}.

This goes at least some distance towards answers to some of John's questions at the top of this thread, although we still have some ways to go. I'm going to take a break for the moment.

view this post on Zulip John Baez (Oct 19 2024 at 04:25):

I will take this opportunity to digress a bit and ponder Todd's suggestion that this formula arising from the splitting principle

1+λ1t+λ2t2+=i=1(1+[Li]t)1 + \lambda^1 t + \lambda^2 t^2 + \cdots = \prod_{i=1}^\infty (1 + [L_i]t)

amounts to splitting "infinite polynomial" into linear factors. This analogy seems extremely strong. We can take any monic polynomial P(t)P(t) of degree nn, factor it as

p(x)=i=1n(t+ai) p(x) = \prod_{i=1}^n (t + a_i)

and then write the coefficients of this polynomial as

p(x)=i=1nei(a1,,an)tni p(x) = \sum_{i=1}^n e_i(a_1, \dots, a_n) t^{n-i}

where eie_i is a degree ii polynomial in a1,,ana_1, \dots, a_n. If I'm doing it right $e_i$$ is the ith elementary symmetric polynomial in nn variables. E.g.

p0=1p_0 = 1

p1=a1++anp_1 = a_1 + \cdots + a_n

p2=1j<knajakp_2 = \sum_{1 \le j < k \le n} a_j a_k

and so on.

But the λi\lambda^i, thought of as symmetric functions, are very similar! They're the elementary symmetric functions, which are essentially elementary symmetric polynomials in infinitely many variables.

What exactly is the relation between what I just said and what you explained about

1+λ1t+λ2t2+=i=1(1+[Li]t)1 + \lambda^1 t + \lambda^2 t^2 + \cdots = \prod_{i=1}^\infty (1 + [L_i]t) ?

There are some differences in convention, e.g. I was talking about monic polynomials while you're looking at a 'comonic' power series (where the coefficient of the constant term is 1), and correspondingly I've got factors of t+ait + a_i while you've got factors of 1+[Li]t1 + [L_i] t. Those can presumably be fiddled so things match up better. But what is this business about factoring a formal power series into linear factors?

In my case I think there's really a Galois extension lurking around: if we treat the aia_i as formal variables, the field k(a1,,an)k(a_1, \dots, a_n) generated by the roots of the polynomial pp is an extension of the field k(e1,,en)k(e_1, \dots, e_n) generated by the coefficients, and the Galois group is SnS_n. But in your case we seem to be doing a field extension that's not Galois, whose 'Galois group' (group of automorphisms over the base field) is something like SS_\infty.

view this post on Zulip Todd Trimble (Oct 19 2024 at 13:49):

I think what you wrote, p(t)=i=1n(t+ai)=i=0nei(a1,,an)p(t) = \prod_{i=1}^n (t + a_i) = \sum_{i=0}^n e_i(a_1, \ldots, a_n), and what I wrote at 49 past the hour, j=0nej(x1,,xn)tj=i=1n(1+xit)\sum_{j=0}^n e_j(x_1, \ldots, x_n) t^j = \prod_{i=1}^n (1 + x_i t), are basically the same thing. "My p(t)p(t)" is tnt^n times your p(1/t)p(1/t), is the way I would match things up.

As I've mentioned to you and Joe in private conversation, I very much have in mind that either way, Q(x1,,xn)\mathbb{Q}(x_1, \ldots, x_n) is a splitting field extension of its subfield Q(e1,,en)\mathbb{Q}(e_1, \ldots, e_n), with Galois group SnS_n. When the textbooks talk about unsolvability of the quintic and so on, that's really the formal framework of what they're discussing (maybe replace here Q\mathbb{Q} by any field). So we start with indeterminates e1,,ene_1, \ldots, e_n that have no special meaning attached to them, and pass to the splitting field of p(t)=tn+e1tn1++enp(t) = t^n + e_1 t^{n-1} + \cdots + e_n (alternate those terms if you like) and basically wind up with Q(x1,,xn)\mathbb{Q}(x_1, \ldots, x_n) which is abstractly isomorphic to the original field, but which (now thinking geometrically) fibers over it differently, with fibers given by SnS_n-torsors.

[I'll say again that I perceive a kind of unity between the various uses of the word "splitting" (splitting a polynomial, splitting into eigenspaces, splitting into line bundles, etc., even splitting field in the sense of representation theory, although I would need to recall the story of why I thought that's similar).]

But anyway, this fibering reminds me of configuration spaces, how nn-tuples of distinct points in C\mathbb{C}, say, fiber over nn-element subsets of C\mathbb{C}. Jim D. sometimes talks about this sort of thing, too.

In my write-up here, I'm thinking of Λ\Lambda as a ring filtered by the Λn\Lambda_n, but one can argue that really what we should be thinking of is that the quotient ring Λ/(λn+1)\Lambda/(\lambda^{n+1}) is Z[e1,,en]\mathbb{Z}[e_1, \ldots, e_n]. This would correspond to a lambda-ring generated by Young diagrams with nn rows or fewer, whereas the filtration component Λn\Lambda_n is not a ring, and corresponds to Young diagrams with nn boxes or fewer.

We didn't quite get to a full explanation of the Λ/(λn+1)\Lambda/(\lambda^{n+1}) picture in our paper, which we certainly hold (without proving this) to be the Grothendieck ring of the 2-rig of algebraic representations of the multiplicative monoid hom(V,V)\hom(V, V) where VV is an nn-dimensional vector space. Same as the monoid MnM_n of n×nn \times n matrices. We denote this 2-rig of algebraic representations as

Rep(Mn)\mathsf{Rep}(M_n)

and the "splitting principle pretense" of acting as if matrices can be diagonalized, split into 1-dimensional eigenspaces, is formalized by considering the pullback functor

Rep(Mn)Rep(kn)\mathsf{Rep}(M_n) \to \mathsf{Rep}(k^n)

where knk^n is the multiplicative submonoid of diagonal matrices, and showing that this pullback functor satisfies our trio of conditions: faithful, conservative, and essentially injective. (For the readers out there, this is one of our main results.) We also observe that Rep(kn)\mathsf{Rep}(k^n) is equivalent to An\mathsf{A}^{\boxtimes n}, the walking 2-rig on nn line objects.

Anyway, what I am leading up to here is that maybe in some ways it's better to think of kS\overline{k\mathsf{S}} not as a colimit or union of its filtered pieces kSn\overline{k\mathsf{S}_{\leq n}}, as we do in the paper, but as a limit of 2-rig quotients kS/(Λn+1)\overline{k\mathsf{S}}/(\Lambda^{n+1}). Not a limit in a "naive" 2-rig sense, but in a graded 2-rig sense. This is analogous to how we typically treat the cohomology algebra H(BU)H^\ast(BU): not as an inverse limit of H(BU(n))Z[e1,,en]H^\ast(BU(n)) \cong \mathbb{Z}[e_1, \ldots, e_n] in the category of rings (now interpreting the eie_i as Chern classes!), but as an inverse limit in the category of graded rings, leading to the polynomial algebra Λ=Z[e1,e2,]\Lambda = \mathbb{Z}[e_1, e_2, \ldots] in infinitely many variables.

I'm thinking that this inverse limit perspective on Λ\Lambda, placing it in the same neighborhood as how we treat A\mathsf{A}^{\boxtimes \infty} as an inverse limit in a graded sense, might lead to a more harmonious picture of what is going on at the level of Galois groups. For example, it might clarify whether we are thinking of this SS_\infty as the full permutation group, or just the union of the SnS_n (I'm thinking the former is more appropriate).

view this post on Zulip John Baez (Oct 19 2024 at 22:11):

All this is really exciting. It's good to sort out what are limits and what are quotients. As far as the 'splitting fields' Q[x1,,xn]\mathbb{Q}[x_1, \dots, x_n] go, if we want homomorphisms between them, the maps clearly must go this way:

Q[x1,,xn]Q[x1,,xn,xn+1] \mathbb{Q}[x_1, \dots, x_n] \to \mathbb{Q}[x_1, \dots, x_n, x_{n+1}]

view this post on Zulip John Baez (Oct 19 2024 at 22:13):

We have similar homomorphisms for the fields these are extending

Q[e1,,en]Q[e1,,en,en+1]\mathbb{Q}[e_1, \dots, e_n] \to \mathbb{Q}[e_1, \dots, e_n, e_{n+1}]

and it seems all this induces inclusions of Galois groups

SnSn+1 S_n \to S_{n+1}

view this post on Zulip Todd Trimble (Oct 19 2024 at 22:22):

Right, if you're using fields, you have to go this direction (maybe you want to use parentheses instead of square brackets for fields). It might be in fact that fields are awkward. Maybe there's an okay sense of speaking of Z[x1,,xn]\mathbb{Z}[x_1, \ldots, x_n] as a "Galois extension" of Z[λ1,,λn]\mathbb{Z}[\lambda^1, \ldots, \lambda^n], though, with Galois group SnS_n?

The reason I bring this up is that we go the "bad" direction A(n+1)An\mathsf{A}^{\boxtimes (n+1)} \to \mathsf{A}^{\boxtimes n} in our paper. Here "bad" means the wrong direction if we consider fields of fractions. (Of course, the fields of fractions construction is not functorial. I guess it is functorial however on the category of integral domains and injective maps between them.)

view this post on Zulip John Baez (Oct 19 2024 at 22:26):

Back in week261 of This Week's Finds, I wrote a lot about this stuff in the special case n = 3. I explained how this is related to thinking of S3S_3 as a Coxeter group, and I hint at generalizations of the theory of symmetric polynomials to other Dynkin diagrams:

Imagine we're trying to solve a cubic equation. We can always divide by the coefficient of the cubic term, so it's enough to consider equations like this:

z^3 + Az^2 + Bz + C = 0

If we could solve this and find the roots a, b, and c, we could
write it as:

(z - a)(z - b)(z - c) = 0

But this means

A = -(a + b + c)
B = ab + bc + ca
C = -abc

Note that A, B, and C don't change when we permute a, b, and c. So, they're called "symmetric polynomials" in the variables a, b, and c.

You see this directly, but there's also a better explanation: the coefficients of a polynomial depend on its roots, but they don't change when we permute the roots.

I can't resist mentioning a cool fact, which is deeply related to the trefoil: every symmetric polynomial of a, b, and c can be written as a polynomial in A, B, and C - and in a unique way!

In fact, this sort of thing works not just for cubics, but for polynomials of any degree. Take a general polynomial of degree n and write the coefficients as functions of the roots. Then these functions are symmetric polynomials, and every symmetric polynomial in n variables can be written as a polynomial of these - and in a unique way.

But, back to our cubic. Note that -A/3 is the average of the three roots. So, if we slide z over like this:

x = z + A/3

we get a new cubic equation for which the average of the three roots is zero. This new cubic equation will be of this form:

x^3 + Bx + C = 0

for some new numbers B and C. In other words, the "A" in this new cubic is zero, since we translated the roots to make their average zero.

So, to solve cubic equations, it's enough to solve cubics like x^3 + Bx + C = 0. This is a great simplification. When you first see it, it's really exciting. But then you realize you have no idea what to do next! This must be why it's called a "depressed cubic".

In fact, Scipione del Ferro figured out how to solve the "depressed cubic" shortly after 1500. So, you might think he could solve any cubic. But, NEGATIVE NUMBERS HADN'T BEEN INVENTED YET. This prevented him from reducing any cubic to a depressed one!

It's sort of hilarious that Ferro was solving cubic equations before negative numbers were worked out. It should serve as a lesson: we mathematicians often work on fancy stuff before understanding the basics. Often that's why math seems hard! But often it's impossible to discover the basics except by working on fancy stuff and getting stuck.

Here's one trick for solving the depressed cubic x^3 + Bx + C = 0. Write

x = y - B/(3y)

Plugging this in the cubic, you'll get a quadratic equation in y^3, which you can solve. From this you can figure out y, and then x.

Alas, I have no idea what this trick means. Does anyone know? Ferro and Tartaglia used a more long-winded method that seems just as sneaky. Later Lagrange solved the cubic yet another way. I like his way because it contains strong hints of Galois theory.

You can see all these methods here:

6) Cubic function, Wikipedia,
http://en.wikipedia.org/wiki/Cubic_equation

So, I won't say more about solving the cubic now. Instead, I want to explain the "discriminant". This is a trick for telling when two roots of our cubic are equal. It turns out to be related to the trefoil knot.

For a quadratic equation ax^2 + bx + c = 0, the two roots are equal precisely when b^2 - 4ac = 0. That's why b^2 - 4ac is called the "discriminant" of the quadratic. The same idea works for other equations; let's see how it goes for the cubic.

Suppose we were smart enough to find the roots of our cubic

x^3 + Bx + C = 0

and write it as

(x - a)(x - b)(x - c) = 0

Then two roots are equal precisely when

(a - b)(b - c)(c - a) = 0

The left side isn't a symmetric polynomial in a, b, and c; it changes sign whenever we switch two of these variables. But if we square it, we get a symmetric polynomial that does the same job:

D = (a - b)^2 (b - c)^2 (c - a)^2

This is the discriminant of the cubic! By what I said about symmetric polynomials, it has to be a polynomial in B and C (since A = 0). If you sweat a while, you'll see

D = -4B^3 - 27C^2

So, here's the grand picture: we've got a 2-dimensional space of cubics with coordinates B and C. Sitting inside this 2d space is a curve consisting of "degenerate" cubics - cubics with two roots the same. This curve is called the "discriminant locus", since it's where the discriminant vanishes:

4B^3 + 27C^2 = 0

If we only consider the case where B and C are real, the discriminant
locus looks like this:

                   |C
           o       |
            o      |
               o   |
       -----------o-------------
               o   |           B
            o      |
           o       |
                   |

It's smooth except at the origin, where it has a sharp point called a "cusp".

Now here's where the trefoil knot comes in. The equation for the discriminant locus:

4B^3 + 27C^2 = 0

view this post on Zulip John Baez (Oct 19 2024 at 22:28):

should remind you of the equation for the trefoil:

u^2 = v^3

Indeed, after a linear change of variables they're the same! But, for the trefoil we need u and v to be complex numbers. We took them to be unit complex numbers, in fact.

So, the story is this: we've got a 2-dimensional complex space of complex cubics. Sitting inside it is a complex curve, the discriminant locus. In our new variables, it's this:

u^2 = v^3

If we intersect this discriminant locus with the torus

|u| = |v| = 1

we get a trefoil knot. But that's not all!

Normal folks think of knots as living in ordinary 3d space, but topologists often think of them as living in a 3-sphere: a sphere in 4d space. That's good for us. We can take this 4d space to be our 2d complex space of complex cubics! We can pick out spheres in this space by equations like this:

|u|^2 + |v|^3 = c (c > 0)

These are not round 3-spheres, thanks to that annoying third power. But, they're topologically 3-spheres. If we take any one of them and intersect it with our discriminant locus, we get a trefoil knot!

This is clear when c = 2, since then we have

|u|^2 + |v|^3 = 2

and

u^2 = v^3

which together imply

|u| = |v| = 1

But if you think about it, we also get a trefoil knot for any other c > 0. This trefoil shrinks as c -> 0, and at c = 0 it reduces to a single point, which is also the cusp here:

                     |u
                     |      o
                     |     o
                     |   o
          -----------o-------------
                     |   o        v
                     |     o
                     |      o
                     |

We don't see trefoil knots in this picture because it's just a real 2d slice of the complex 2d picture. But, they're lurking in the background!

Now let me say how the group of permutations of three things gets into the game. We've already seen the three things: they're the roots a, b, and c of our depressed cubic! So, they're three points on the complex plane that add to zero. Being a physicist at heart, I sometimes imagine them as three equal-mass planets, whose center of mass is at the origin.

The space of possible positions of these planets is a 2d complex vector space, since we can use any two of their positions as coordinates and define the third using the relation

a + b + c = 0

So, there are three coordinate systems we can use: the (a,b) system, the (b,c) system and the (c,a) system. We can draw all three coordinate systems at once like this:

                b
                 \       /
                  \     /
                   \   /
                    \ /
             --------o--------a
                    / \
                   /   \
                  /     \
                 /       \
                c

The group of permutations of 3 things acts on this picture by permuting the three axes. Beware: I've only drawn a 2-dimensional real vector space here, just a slice of the full 2d complex space.

Now suppose we take this 2d complex space and mod out by the permutation symmetries. What do we get? It turns out we get another 2d complex vector space! In this new space, the three coordinate axes shown above become just one thing... but this thing is a curve, like this:

              o
               o
                  o
                     o
                  o
               o
              o

Look familiar? Sure! It's just the discriminant locus we've seen before.

Why does it work this way? The explanation is sitting before us. We've got two 2d complex vector spaces: the space of possible ordered triples of roots of a depressed cubic, and the space of possible coefficients. There's a map from the first space to the second, since the coefficients are functions of the roots:

B = ab + bc + ca
C = -abc

These functions are symmetric polynomials: they don't change when we permute a, b, and c. And, it follows from what I said earlier that we can get any symmetric polynomial as a function of these - under the assumption that a+b+c = 0, that is.

So, the map where we mod out by permutation symmetries of the roots is exactly the map from roots to coefficients.

The lines in this picture are places where two roots are equal:

            c=a
              \       /
               \     /
                \   /
                 \ /
          --------o-------- b=c
                 / \
                /   \
               /     \
              /       \
            a=b

So, when we apply the map from roots to coefficients, these lines get mapped to the discriminant locus:

                   |
           o       |
            o      |
               o   |
       -----------o-------------
               o   |
            o      |
           o       |
                   |

You should now feel happy and quit reading... unless you know a bit of topology. If you do know a little topology, here's a nice spinoff of what we've done. Though I didn't say it using so much jargon, we've already seen that space of nondegenerate depressed cubics is C^2 minus a cone on the trefoil knot. So, the fundamental group of this space is the same as the fundamental group of S^3 minus a trefoil knot. This is a famous group: it has three generators x,y,z, and three relations saying that:

x conjugated by y is z
y conjugated by z is x
z conjugated by x is y

On the other hand, we've seen this space is the space of triples of distinct points in the plane, centered at the origin, mod permutations. The condition "centered at the origin" doesn't affect the fundamental group. So, this fundamental group is another famous group: the "braid group on 3 strands". This has two generators:

\ /  |
 /   |          X
/ \  |

and

|  \ /
|   /           Y
|  / \

and one relation, called the "Yang-Baxter equation" or "third Reidemeister move":

\ /  |        |  \ /
 /   |        |   /
/ \  |        |  / \
|  \ /        \ /   |
|   /     =    /    |           XYX = YXY
|  / \        / \   |
\ /  |        |  \ /
 /   |        |   /
/ \  |        |  / \

So: the 3-strand braid group is isomorphic to the fundamental group of the complement of the trefoil! You may enjoy checking this algebraically, using generators and relations, and then figuring out how this algebraic proof relates to the geometrical proof.

I find all this stuff very pretty...

... but what's really magnificent is that most of it generalizes to any Dynkin diagram, or even any Coxeter diagram! (See "week62" for those.)

Yes, we've secretly been studying the Coxeter diagram A_2, whose "Coxeter group" is the group of permutations of 3 things, and whose "Weyl chambers" look like this:

                 \       /
                  \     /
                   \   /
                    \ /
             --------o--------
                    / \
                   /   \
                  /     \
                 /       \

Let me just sketch how we can generalize this to A_{n-1}. Here the Coxeter group is the group of permutations of n things, which I'll call n!.

Let X be the space of n-tuples of complex numbers summing to 0. X is a complex vector space of dimension n-1. We can think of any point in X as the ordered n-tuple of roots of some depressed polynomial of degree n. Here "depressed" means that the leading coefficient is 1 and the sum of the roots is zero. This condition makes polynomials sad.

The permutation group n! acts on X in an obvious way. The quotient X/n! is isomorphic (as a variety) to another complex vector space of dimension n-1: namely, the space of depressed polynomials of degree n. The quotient map

X -> X/n!

is just the map from roots to coefficients!

Sitting inside X is the set D consisting of n-tuples of roots where two or more roots are equal. D is the union of a bunch of hyperplanes, as we saw in our example:

                 \       /
                  \     /
                   \   /
                    \ /
             --------o--------
                    / \
                   /   \
                  /     \
                 /       \

Sitting inside X/n! is the "discriminant locus" D/n!, consisting of degenerate depressed polynomials of degree n - that is, those with two or more roots equal. This is a variety that's smooth except for some sort of "cusp" at the origin:

              o
               o
                  o
                     o
                  o
               o
              o

The fundamental group of the complement of the discriminant locus is the braid group on n strands. The reason is that this group describes homotopy classes of ways that n points in the plane can move around and come back to where they were (but possibly permuted). These points are the roots of our polynomial.

On the other hand, the discriminant locus is topologically the cone on some higher-dimensional knot sitting inside the unit sphere in C^{n-1}. So, the fundamental group of the complement of this knot is the braid group on n strands.

This relation between higher-dimensional knots and singularities was investigated by Milnor, not just for the A_n series of Coxeter diagrams but more generally:

7) John W. Milnor, Singular Points of Complex Hypersurfaces, Princeton U. Press, 1969.

The other Coxeter diagrams give generalizations of braid groups called Artin-Brieskorn groups. Algebraically you get them by taking the usual presentations of the Coxeter groups and dropping the relations saying the generators (reflections) square to 1.

view this post on Zulip Todd Trimble (Oct 19 2024 at 22:39):

Thanks for recalling all this! I've never read this carefully, but now it seems I shall.

This gives a much more elaborated picture (for n=3n = 3) of what I was waving my hands at earlier, when I spoke of configuration spaces of nn-tuples of distinct points mapping down onto the space of nn-element subsets (or is the latter called the configuration space?). The fundamental group of the space below is the full braid group, and the fundamental group of the space above is the pure braid group, which is the kernel of the quotient BnSnB_n \to S_n.

view this post on Zulip John Baez (Oct 19 2024 at 22:58):

Now I really want to find an analogous story for the BCn\text{BC}_n series of Coxeter groups and their Artin-Brieskorn braid groups. (The Bn\text{B}_n Dynkin diagrams have the same Coxeter groups as the Cn\text{C}_n, so I'm lumping them together, especially since BnB_n also has a completely different meaning in what you just wrote!)

The BCn\text{BC}_n Coxeter group is the symmetry group of the nn-cube (I hope I'm getting the numbers to match correctly here), or if you prefer, the nn-dimensional 'orthoplex', the nn-dimensional generalization of an octahedron. I like to think of it as the full rotation-reflection symmetry group of the nn coordinate axes in Rn\mathbb{R}^n.

This is the symmetry group of nn pairs of things, where the pairs can be permuted, and the two roots within each pair can be switched, but the roots within each pair are "joined at the hip". It's the wreath product of SnS_n and Z/2\mathbb{Z}/2.

To get this group as the Galois group of something, maybe I should be looking for a polynomial with 2n2n roots that come in pairs, something like

Q(x)=i=1n(x2ai2)Q(x) = \prod_{i=1}^n (x^2 - a_i^2)

view this post on Zulip Todd Trimble (Oct 19 2024 at 23:12):

Let's see, in the case of the Coxeter group SnS_n, down below, you localize away from (geometrically, take the complement of) the discriminant locus; up above, you remove some hyperplanes where two coordinates are equated.

So I guess in your proposal, the space above would consist of pairs (a1,a1),(an,an)(a_1, -a_1), \ldots (a_n, -a_n) which are all distinct from each other and their negatives, and also nonzero, and down below, the space of possible coefficients of this polynomial, which again can be given by localizing away from the locus of a suitable variant of a discriminant (so as to forbid any ai=0a_i = 0 from being a root; I guess it's the usual discriminant times a12an2a_1^2 \cdots a_n^2).

view this post on Zulip John Baez (Oct 20 2024 at 00:15):

Hmm. I'm confused about a lot of things, but people have written about "symplectic symmetric functions", and I'm hoping these should give a way of working with the cohomology ring H(BSp)H^\ast(\mathrm{BSp}) just as the usual symmetric functions can be identified with elements of H(BU)H(BGL)H^\ast(\mathrm{BU}) \cong H^\ast(\mathrm{BGL}).

view this post on Zulip Todd Trimble (Oct 20 2024 at 00:20):

Very interesting idea! (Did I say something wrong or confusing in my last message?) Oh well, we can certainly discuss this.

view this post on Zulip John Baez (Oct 20 2024 at 00:20):

There's a paper that might be relevant:

I haven't read it, but the MathSciNet review says:

The authors extend the use of Young diagram methods from the case of GL(n) to the case of the other classical groups SO(2n+1), Sp(2n) and SO(2n). They establish a number of propositions attesting to the fact that the character rings of rational representations of all the classical groups are polynomial rings over certain irreducible characters associated with totally symmetric and totally antisymmetric irreducible representations. Various determinantal expansions and generating functions for the characters of arbitrary irreducible representations of each of the classical groups [H. Weyl, The classical groups, their invariants and representations, second edition, Princeton Univ. Press, Princeton, N.J., 1946; D. E. Littlewood, The theory of group characters and matrix representation of groups, second edition, Clarendon Press, Oxford, 1950] are discussed and the connection between characters of GL(n) and both Young diagrams and Young tableaux is noted. The authors then introduce the universal character ring, Λ [I. G. Macdonald, Symmetric functions and Hall polynomials, Clarendon Press, New York, 1979; MR0553598], and define bases {χGL(λ)}λ, {χO(λ)}λ and {χSp(λ)}λ from which the characters of irreducible representations of each of the classical groups may be obtained by means of appropriate specialization homomorphisms. The relationship between the bases is established and the branching rules for the restriction from GL(n) to SO(n) and Sp(n) are determined as well as the rules for decomposing the tensor product of representations of both SO(n) and Sp(n).
  The results are not new; those not already given in the works of Littlewood [op. cit.] and F. D. Murnaghan[The theory of group representations, Johns Hopkins Press, Baltimore, Md. 1938; Jbuch 64, 964; The unitary and rotation groups.

view this post on Zulip John Baez (Oct 20 2024 at 00:23):

I'd read in Weyl's book about generalizations of Young diagram techniques for orthogonal and symplectic groups, but for some reason I'd never thought about that stuff in conjunction with symmetric functions. Macdonald's anemic index doesn't contain the words "symplectic" or "orthogonal", so I'm having trouble finding about these various bases. It sounds like he's somehow embedding the rings of orthogonal and symplectic characteristic classes into the ring of characteristic classes for BGL\mathrm{BGL}, i.e. the usual ring Λ\Lambda of symmetric functions. I would like to start by treating them separately, as their own independent rings.

view this post on Zulip John Baez (Oct 20 2024 at 00:25):

Todd Trimble said:

(Did I say something wrong or confusing in my last message?)

I just don't have a clear idea of what's going on. I guess you're probably right in what you said.

view this post on Zulip John Baez (Oct 20 2024 at 00:28):

My mind is just a bit blown by the bigger picture we seem to be stumbling on. It gets even bigger if we remember Bott periodicity and thus that BO\mathrm{BO} looped 4 times gets you BSp\mathrm{BSp}, which looped 4 times gets you back to BO\mathrm{BO}.

view this post on Zulip Todd Trimble (Oct 20 2024 at 00:38):

Let me now pick up where I left off in the general exposition. I promised to get into some nitty-gritties involving the comultiplication μ:ΛΛΛ\mu: \Lambda \to \Lambda \otimes \Lambda, considering Λ\Lambda as a biring. Now that we have the splitting principle in hand, we're in a position to do that.

view this post on Zulip Todd Trimble (Oct 20 2024 at 00:39):

By the way, for anyone reading these posts but who are relatively new to the general topic of lambda rings, I might pose an exercise at this point: what can you say about line objects in kS\overline{k\mathsf{S}}; alternatively, what can you say about elements \ell in Λ\Lambda such that λ2()=0\lambda^2(\ell) = 0? I'll reveal the answer later.

view this post on Zulip Todd Trimble (Oct 20 2024 at 00:39):

Here's another exercise we can do right now: in any 2-rig, prove that the tensor product of two line objects is also a line object. (Let's drop the nonzero requirement for that one. Actually, in our paper, we don't impose that condition. I only put it in there, parenthetically, as a kind of nod to what is done elsewhere in the literature, but morally it's better to leave it out. That's why in our paper we use the term "subline" by the way, because for example 00 satisfies λ2(0)0\lambda^2(0) \cong 0, but it isn't at all 1-dimensional. Nonetheless, here in this series, I'll keep saying "line" because it's short.)

view this post on Zulip Todd Trimble (Oct 20 2024 at 00:39):

For this exercise, one should decide which formulation of line object is the most convenient. Our choices are

If you chose the third as the most convenient, then I think you chose well, because for one thing you don't need all the infrastructure of a 2-rig to make sense of it; you only use the symmetric monoidal structure. (Recall that in our paper we call these objects "bosonic subline objects", as distinguished from "fermionic subline objects". The terminology is meant to recall the mathematics of supersymmetry, which actually plays an important role, especially in categorified calculations that have anything to do with negatives -- see for example section 7 of our first paper, where we need to transition from a rig-plethory Λ+\Lambda_+ to the ring-plethory Λ\Lambda.)

Having made this choice, the exercise becomes fairly straightforward. For example, it's easy to give a proof via string diagrams (go ahead and try it!).

view this post on Zulip Todd Trimble (Oct 20 2024 at 00:40):

That exercise is important for what we do. In the spirit of the splitting principle, given objects r,rr, r' in a 2-rig that are finite coproducts of line objects, rL1Lmr \cong L_1 \oplus \cdots \oplus L_m and r=L1Lnr' = L_1' \oplus \cdots \oplus L_n', their tensor product is also a coproduct of line bundles,

rri,jLiLj,r \otimes r' \cong \bigoplus_{i, j} L_i \otimes L_j',

according to that exercise.

view this post on Zulip Todd Trimble (Oct 20 2024 at 00:40):

In a moment we'll actually apply the splitting principle to understand comultiplication μ\mu, but first I want to extend it to say that not only is our canonical kSAn\overline{k\mathsf{S}} \to \mathsf{A}^{\boxtimes n} (to the category of Nn\mathbb{N}^n-graded spaces) essentially injective when restricted to polynomial functors of degree no more than nn, so are 2-tensor products of such 2-rig maps, i.e.,

kS(x,y)kSkSAmAn\overline{k\mathsf{S}(x, y)} \simeq \overline{k\mathsf{S}} \boxtimes \overline{k\mathsf{S}}\to \mathsf{A}^{\boxtimes m} \boxtimes \mathsf{A}^{\boxtimes n}

is essentially injective when restricted to polynomial functors of degree no more than mm in xx and nn in yy.

view this post on Zulip Todd Trimble (Oct 20 2024 at 00:40):

On to comultiplication. Abstractly, we understand μ:ΛΛΛ\mu: \Lambda \to \Lambda \otimes \Lambda. Its effect on isomorphism classes [F][F] of a Schur object or polynomial functor (polynomial species) FF is to take it to [F~(xy)][\tilde{F}(x \otimes y)], where F~(xy)\tilde{F}(x \otimes y) lives in kS(x,y)kSkS\overline{k\mathsf{S}(x, y)} \simeq \overline{k\mathsf{S}} \boxtimes \overline{k\mathsf{S}}. But now we want to understand how to calculate explicitly this map in terms of a chosen polynomial basis for Λ\Lambda, say

ΛZ[λ1,λ2,]\Lambda \cong \mathbb{Z}[\lambda^1, \lambda^2, \ldots]

and for that it suffices to know how to calculate λn(xy)\lambda^n(x \otimes y) as an element in

Λ(x,y)Z[λ1(x),λ2(x),;λ1(y),λ2(y),].\Lambda(x, y) \cong \mathbb{Z}[\lambda^1(x), \lambda^2(x), \ldots; \lambda^1(y), \lambda^2(y), \dots].

view this post on Zulip Todd Trimble (Oct 20 2024 at 00:41):

I'm not going to do it for all nn. I just want to explain how I would go about it for small nn if I were locked in a room with pen and paper and no internet or books. Experts in the area would know how to do it efficiently (as far as the state of the art allows), but that's not the point here; the modest point is simply to understand it.

In fact I'm just going to recall how it goes for n=2n = 2 (which we will need later anyway), and wave my hands a little at the case for higher nn. In fact, I can just quote John quoting me:

John Baez said:

You asked what does the comultiplication

μ:ΛΛΛ\mu: \Lambda \to \Lambda \otimes \Lambda

applied to λ2\lambda^2 look like. Short answer:

(μ(λ2))(x,y)=(λ1(x))2λ2(y)+λ2(x)(λ1(y))22λ2(x)λ2(y)(\mu(\lambda^2))(x, y) = (\lambda^1(x))^2 \lambda^2(y) + \lambda^2(x)(\lambda^1(y))^2 - 2\lambda^2(x)\lambda^2(y)

where we think of ΛΛ\Lambda \otimes \Lambda as the free lambda-ring on two generators x,yx, y. This can also be written more nicely as σ2(x)λ2(y)+λ2(x)σ2(y)\sigma^2(x) \lambda^2(y) + \lambda^2(x) \sigma^2(y).

Longer answer: use the splitting principle, which guarantees that the 2-rig map kSAn\overline{k\mathsf{S}} \to \mathsf{A}^{\boxtimes n}, sending the generator xx to a sum x1xnx_1 \oplus \ldots \oplus x_n of nn independent bosonic subline objects, is an extension when restricted to polynomial functors of degree n\leq n. Since λ2\lambda^2 is degree 22, this means in effect that we can pretend the generator xx of kS\overline{k\mathsf{S}} is a sum x1+x2x_1 + x_2 of two bosonic sublines. Then the 2-rig comultiplication kSkSkS\overline{k\mathsf{S}} \to \overline{k\mathsf{S}} \boxtimes \overline{k\mathsf{S}}, taking xx to xyx \boxtimes y per our first paper, induces the map of lambda-rings ΛΛΛ\Lambda \to \Lambda \otimes \Lambda that takes x=x1+x2x = x_1 + x_2 to xy=(x1+x2)(y1+y2)=x1y1+x1y2+x2y1+x2y2x \otimes y = (x_1 + x_2)(y_1 + y_2) = x_1y_1 + x_1y_2 + x_2y_1 + x_2y_2. Since this lambda-ring map preserves the λ2\lambda^2 operation, we calculate

μ(λ2x)=λ2(μx)=λ2(x1y1+x1y2+x2y1+x2y2)\mu(\lambda^2 x) = \lambda^2 (\mu x) = \lambda^2(x_1y_1 + x_1y_2 + x_2y_1 + x_2y_2)

and use the exponential law for λ2\lambda^2 plus vanishing of λ2(xiyj)\lambda^2(x_i y_j) to write this out long-hand. Sparing some gory details, this gives the answer in the short identity.

view this post on Zulip Todd Trimble (Oct 20 2024 at 00:42):

I'll just add a few notes to this explanation:

view this post on Zulip Todd Trimble (Oct 20 2024 at 00:42):

Anyhow, to sum it all up, what you do to compute the λn(xy)\lambda^n(xy) is expand 1i,jn(1+xiyjt)\prod_{1 \leq i, j \leq n} (1 + x_i y_j t) and then write the coefficients as a polynomial in the elementary symmetric polynomials ei(x),ej(y)e_i(x), e_j(y).

view this post on Zulip Todd Trimble (Oct 20 2024 at 00:43):

Finally: the answer to the exercise: there are no nonzero line objects in kS\overline{k\mathsf{S}}!

One might hastily think that the generator xx itself is surely a line object: as a functor SVect\mathsf{S} \to \mathsf{Vect}, it vanishes at nn for every n1n \neq 1; at n=1n = 1, its value is the 1-dimensional (trivial) representation of S1S_1. The trouble is that the tensor product on kS\overline{k\mathsf{S}} is not the pointwise one (that would be the Hadamard product, as some people say); it's the Day convolution one (aka the Cauchy product). Tensors powers using the Day convolution "spread out", and their retracts are also spread out.

Or, one might think that you can get line objects in kS\overline{k\mathsf{S}} by pulling back the canonical line object NVect\mathbb{N} \to \mathsf{Vect} (in kN\overline{k\mathsf{N}}) along the symmetric monoidal quotient SN\mathsf{S} \to \mathbb{N}. The trouble is that the resulting pullback functor kNkS\overline{k\mathsf{N}} \to \overline{k\mathsf{S}} is only lax monoidal; we require strong monoidality for our 2-rig maps. It's pushforward (i.e., left Kan) along this symmetric monoidal quotient functor that is strong monoidal.

view this post on Zulip Todd Trimble (Oct 20 2024 at 01:03):

John Baez said:

My mind is just a bit blown by the bigger picture we seem to be stumbling on.

Yes, I get a little sense of the enormity as well.

It gets even bigger if we remember Bott periodicity and thus that BO\mathrm{BO} looped 4 times gets you BSp\mathrm{BSp}, which looped 4 times gets you back to BO\mathrm{BO}.

Yes, well, I'm not ready to think about that! I suppose if I were trying to return my mind back to real Bott periodicity, I would at first take a Clifford algebra approach, which I probably never properly learned in the first place.

The amount of latent geometry we began to stumble on in the second paper is itself pretty daunting. In the long series of comments I just posted, it seems we're running into the Segre embedding again, induced by

km×knkmnk^m \times k^n \overset{\otimes}{\longrightarrow} k^{mn}

(x1,,xm);(y1,,yn)(xiyj)1im,1jn.\langle(x_1, \ldots, x_m); (y_1, \ldots, y_n) \mapsto (x_i y_j)_{1 \leq i \leq m, 1 \leq j \leq n}.

This appeared around the point that we were describing 2-rigs of algebraic representations, but you thought it would be better to remove the words "Segre embedding", so as to not scare off readers. :-)

view this post on Zulip Todd Trimble (Oct 20 2024 at 16:03):

I said:

Finally: the answer to the exercise: there are no nonzero line objects in kS\overline{k\mathsf{S}}!

I lied. The species that takes 0S0 \in \mathsf{S} to kk and otherwise nn to 00, i.e., the monoidal unit, is a nonzero line object. (The monoidal unit in any 2-rig is a line object.) But that's the only one, up to isomorphism.

view this post on Zulip John Baez (Oct 20 2024 at 21:52):

Todd Trimble said:

John Baez said:

It gets even bigger if we remember Bott periodicity and thus that BO\mathrm{BO} looped 4 times gets you BSp\mathrm{BSp}, which looped 4 times gets you back to BO\mathrm{BO}.

Yes, well, I'm not ready to think about that! I suppose if I were trying to return my mind back to real Bott periodicity, I would at first take a Clifford algebra approach, which I probably never properly learned in the first place.

I got to know Bott periodicity pretty well blogging and giving talks about the tenfold way, which unifies real and complex Bott periodicity. For some reason I never noticed until now that our friend Λ\Lambda arises from U\mathrm{U}, which is one of the ten infinite loop spaces in the tenfold way, and that others should give 'mutant' versions of symmetric functions. But I will resist derailing this thread with that.

On with the big Witt ring!

view this post on Zulip Todd Trimble (Oct 21 2024 at 03:11):

It's now time to look at the big Witt ring W(R)=[Λ,R]W(R) = [\Lambda, R], for a commutative ring RR. This on the other hand has lots of line objects, I mean line elements! But we will need to familiarize ourselves with how its lambda-ring structure works.

view this post on Zulip Todd Trimble (Oct 21 2024 at 03:12):

There are two abstract ways to define lambda-rings: either as coalgebras of the right adjoint comonad [Λ,]:CRingCRing[\Lambda, -]: \mathsf{CRing} \to \mathsf{CRing}, or as algebras of its left adjoint monad, typically denoted Λ:CRingCRing\Lambda \odot -: \mathsf{CRing} \to \mathsf{CRing}. A standard abuse of language is that \odot may be thought of either as the action constraint of an actegory structure where birings act on rings, or as a monoidal product on the category of birings, but abstractly it is easy to understand: if BB is a biring, then [B,]:CRingCRing[B, -]: \mathsf{CRing} \to \mathsf{CRing} is a right adjoint, and we define the functor B:CRingCRingB \cdot -: \mathsf{CRing} \to \mathsf{CRing} to be its left adjoint. If B,CB, C are two birings, then the composition of right adjoints [C,[B,]]=[C,][B,][C, [B, -]] = [C, -] \circ [B, -] is again a right adjoint endofunctor on CRing\mathsf{CRing}, and therefore is of the form [D,][D, -] for some new biring DD. We define BCB \odot C to be this biring, so that [BC,R][C,[B,R][B \odot C, R] \cong [C, [B, R], naturally in rings RR. This defines a monoidal (just monoidal, not symmetric monoidal) product on the category of birings, and by abuse of language there is an isomorphism

(BC)RB(CR)(B \odot C) \odot R \cong B \odot (C \odot R)

for birings B,CB, C and a (commutative) ring RR. A plethory may be defined in at least three ways:

view this post on Zulip Todd Trimble (Oct 21 2024 at 03:12):

The ur-example, perhaps the original example of a plethory, is Λ\Lambda, but there is a plethora of plethories. I'll just mention one class of examples quickly. If CC is a cocommutative Z\mathbb{Z}-coalgebra, i.e., a cocommutative comonoid in the category of abelian groups under tensor product, then for any ring RR, the abelian group of additive homomorphisms Ab(UC,UR)\mathsf{Ab}(UC, UR) between underlying abelian groups carries a commutative ring structure, where multiplication of homomorphisms f,g:UCURf, g: UC \to UR is given by the expected formula

UCδUCUCfgURURmUR.UC \overset{\delta}{\longrightarrow} UC \otimes UC \overset{f \otimes g}{\longrightarrow} UR \otimes UR \overset{m}{\longrightarrow} UR.

The functor Ab(UC,U):CRingCRing\mathsf{Ab}(UC, U-): \mathsf{CRing} \to \mathsf{CRing} is a right adjoint endofunctor. Since the symmetric Z\mathbb{Z}-algebra construction S:AbCRingS: \mathsf{Ab} \to \mathsf{CRing} is left adjoint to U:CRingAbU: \mathsf{CRing} \to \mathsf{Ab}, we have that the right adjoint endofunctor is represented by SUCSUC, which is thereby a biring. But if moreover CC carries a cocommutative Z\mathbb{Z}-bialgebra structure, then the multiplication μ:UCUCUC\mu: UC \otimes UC \to UC induces a comonad structure on Ab(UC,U)\mathsf{Ab}(UC, U-), and in this way SUCSUC becomes a plethory. This type of plethory is called a linear plethory.

view this post on Zulip Todd Trimble (Oct 21 2024 at 03:12):

Anyway, for technical reasons we chose in our first paper to emphasize the first point of view, defining a lambda-ring as a coalgebra of [Λ,][\Lambda, -], or of the big Witt ring comonad WW. Of course W(R)W(R) itself would be the cofree lambda-ring cogenerated by RR. Its coalgebra structure is given by the comonad comultiplication δR:WRWWR\delta R: WR \to WWR, which is a map of type [Λ,R][Λ,[Λ,R]][\Lambda, R] \to [\Lambda, [\Lambda, R]]. If hom(R,S)\hom(R, S) denotes the ordinary hom-set of functions between rings R,SR, S, then [Λ,R][\Lambda, R] sits inside hom(Λ,R)\hom(\Lambda, R), and [Λ,[Λ,R]][\Lambda, [\Lambda, R]] sits inside

hom(Λ,hom(Λ,R))hom(Λ×Λ,R):Φ((r,s)Φ(r)(s)\hom(\Lambda, \hom(\Lambda, R)) \cong \hom(\Lambda \times \Lambda, R): \Phi \mapsto ((r, s) \mapsto \Phi(r)(s)

and it turns out that the comonad comultiplication [Λ,][Λ,[Λ,]][\Lambda, -] \to [\Lambda, [\Lambda, -]] is a restriction of a map hom(Λ,)hom(Λ×Λ,)\hom(\Lambda, -) \to \hom(\Lambda \times \Lambda, -) induced by an operation Λ×ΛΛ\Lambda \times \Lambda \to \Lambda. This map is closely related to "plethystic multiplication", except that one has to be careful to get the order right. Given a pair of isomorphism classes ([τ],[ρ])Λ×Λ([\tau], [\rho]) \in \Lambda \times \Lambda of polynomial functors, the map Λ×ΛΛ\Lambda \times \Lambda \to \Lambda takes this pair to their composition (aka substitution) ρτ\rho \circ \tau as polynomial functors; considered as species, the formula is

ρτ=nρ[n]kSnτn\rho \circ \tau = \sum_n \rho[n] \otimes_{kS_n} \tau^{\otimes n}

where the tensorial exponent refers to the Day convolution on kS\overline{k\mathsf{S}}. (See our first paper, top of page 35.) The class [ρτ][\rho \circ \tau] is denoted [ρ][τ][\rho] \bullet [\tau].

view this post on Zulip Todd Trimble (Oct 21 2024 at 03:13):

We can now say that the comonad comultiplication on the big Witt ring W(R)=[Λ,R]W(R) = [\Lambda, R] is given by the map

(f:ΛR)([τ]([ρ]f([ρ τ]))(f: \Lambda \to R) \mapsto ([\tau] \mapsto ([\rho] \mapsto f([\rho \ \tau]))

If we want to compute the value λi(f)\lambda^i(f) for an element f:ΛRf: \Lambda \to R of the lambda-ring W(R)W(R), this formula tells us what to do: it's the homomorphism λi(f):ΛR\lambda^i(f): \Lambda \to R defined by

[λi(f)](λj):=f(λjλi).[\lambda^i(f)](\lambda^j) := f(\lambda^j \bullet \lambda^i).

view this post on Zulip Todd Trimble (Oct 21 2024 at 03:16):

Now, very much in keeping with our ruminations on the connection between the splitting principle and splittings of polynomials into linear factors 1+xit1 + x_i t, we can at least guess what line elements should look like in W(R)W(R). First, I ought to define a "line element" in a general lambda-ring RR. The definition I'll adopt (and I believe something like this appears in the literature; I need to check up on that) is that it's an element rRr \in R such that λn(r)=0\lambda^n(r) = 0 for all n2n \geq 2.

view this post on Zulip Todd Trimble (Oct 21 2024 at 03:17):

The statement is that line elements in W(R)W(R) are homomorphisms ΛR\Lambda \to R that take λ1\lambda^1 to some element aRa \in R, and λn\lambda^n to 00 for n>1n > 1. In other words, under the identification [Λ,R]1+tR[[t]][\Lambda, R] \cong 1 + tR[[t]] taking ff to 1+n1f(λn)tn1 + \sum_{n \geq 1} f(\lambda^n) t^n, the line elements in 1+tR[[t]]1 + tR[[t]] are of the form 1+at1 + at. In the first place, it is necessary that a line element ff be of this form, because if λnf=0\lambda^n f = 0 for all n2n \geq 2, then evaluation at λ1\lambda^1 yields

0=[λn(f)](λ1)=f(λ1λn)=f(λn)0 = [\lambda^n(f)](\lambda^1) = f(\lambda^1 \bullet \lambda^n) = f(\lambda^n)

for all n2n \geq 2. For sufficiency, we must show that if f(λn)=0f(\lambda^n) = 0 for all n2n \geq 2, then λm(f)=0\lambda^m(f) = 0 for all m2m \geq 2, or that f(λiλm)=0f(\lambda^i \bullet \lambda^m) = 0 for all i1i \geq 1 and m2m \geq 2. Since ff is a ring homomorphism, it is enough to see that λiλm\lambda^i \bullet \lambda^m belongs to the ideal (λ2,λ3,)(\lambda^2, \lambda^3, \ldots). For now I am going to leave this as an exercise in using the splitting lemma. The basic point is that

ei(xi1xi2xim),e_i(x_{i_1}x_{i_2}\cdots x_{i_m}),

where the arguments of eie_i are indexed over all cases where i1<<imi_1 < \ldots < i_m, written as a polynomial in the symmetric functions eje_j, contain no terms of type e1ke_1^k, because for example no monomials terms of type x1kx_1^k can occur in the expansion in terms of the xix_i.

view this post on Zulip Todd Trimble (Oct 21 2024 at 03:17):

Thus, we have identified line elements in 1+tR[[t]]1 + tR[[t]] with elements of type 1+at1 + at, aRa \in R, under the identification [Λ,R]1+tR[[t]][\Lambda, R] \cong 1 + tR[[t]] using the λi\lambda^i polynomial basis. What do these elements look like when we use the identification [Λ,R]1+tR[[t]][\Lambda, R] \cong 1 + tR[[t]] using the σi\sigma^i basis?

I claim that if f(λ1)=af(\lambda^1) = a and f(λn)=0f(\lambda^n) = 0 for all n2n \geq 2, then f(σn)=anf(\sigma^n) = a^n for all nn. One way to see this is by induction on nn. It is true for n=1n = 1, since σ1=λ1\sigma^1 = \lambda^1. For n>1n > 1, use the following beautiful identity that holds in Λ\Lambda:

σnσn1λ1+σn2λ2+(1)nλn=0.\sigma^n - \sigma^{n-1} \lambda^1 + \sigma^{n-2}\lambda^2 - \cdots + (-1)^n \lambda^n = 0.

(John and Joe and I have discussed this many times; it has a relatively short conceptual proof at the 2-rig level, using ideas of superalgebra. I may include a proof a little later in these notes.) Applying the ring homomorphism f:ΛRf: \Lambda \to R to this identity, and the assumption that f(λn)f(\lambda^n) vanishes for n2n \geq 2, we see that

f(σn)=f(σn1)f(λ1)=f(σn1)af(\sigma^n) = f(\sigma^{n-1}) \cdot f(\lambda^1) = f(\sigma^{n-1}) a

and the induction goes through.

view this post on Zulip Todd Trimble (Oct 21 2024 at 03:18):

Thus, when written in the σn\sigma^n basis, a line element in W(R)W(R) necessarily has the form

1+at+a2t2+=11at.1 + at + a^2 t^2 + \cdots = \frac1{1 - at}.

Finally, if 1+at,1+bt1 + at, 1 + bt are line elements [again in the λi\lambda^i basis], I claim that their Witt product is 1+abt1 + ab t. (I claim) this can be deduced again by exploiting the splitting principle. As a consequence, in the σi\sigma^i basis, the Witt product of 11at\frac1{1-at} and 11bt\frac1{1-bt} must be 11abt\frac1{1 - abt}.

view this post on Zulip Todd Trimble (Oct 21 2024 at 03:18):

Incidentally, there is a famous ring involution ω:ΛΛ\omega: \Lambda \to \Lambda that takes λi\lambda^i to σi\sigma^i and σi\sigma^i to λi\lambda^i. (We explain this in terms of 2-rig theory at the end of our second paper.) This induces a ring involution on 1+tR[[t]]1 + tR[[t]] that takes 1+at1 + at to 11at\frac1{1-at}. This allows us to deduce that

11atW11bt=11abt\frac1{1-at} \ast_W \frac1{1-bt} = \frac1{1 - abt}

even if we use the λi\lambda^i basis throughout!

view this post on Zulip Todd Trimble (Oct 21 2024 at 03:22):

(It's possible I'm slipping up somewhere in this last part, but it's late where I am, and I'm going to bed.)

view this post on Zulip Todd Trimble (Oct 21 2024 at 18:21):

Yes, I retract the claim that 11atW11bt=11abt\frac1{1-at} \ast_W \frac1{1-bt} = \frac1{1 - abt} even if we use the λi\lambda^i-basis. If ϕ:[Λ,R]1+tR[[t]]\phi: [\Lambda, R] \to 1 + tR[[t]] sends f:ΛRf: \Lambda \to R to i0f(λi)ti\sum_{i \geq 0} f(\lambda^i) t^i, then certainly the composite

1+tR[[t]]ϕ1[Λ,R][ω,1][Λ,R]ϕ1+tR[[t]]1 + tR[[t]] \overset{\phi^{-1}}{\longrightarrow} [\Lambda, R] \overset{[\omega, 1]}{\longrightarrow} [\Lambda, R] \overset{\phi}{\longrightarrow} 1 + tR[[t]]

is an involution, and it sends 1+at1 + at to 11at\frac1{1-at}. But [ω,1][\omega, 1] isn't a ring involution itself, because ω:ΛΛ\omega: \Lambda \to \Lambda is not a biring involution; it's only a ring involution. (There should be a quick counterexample to show it's not a biring involution; maybe I'll cook one up later.)

view this post on Zulip Todd Trimble (Oct 21 2024 at 18:21):

So anyway, we do have the explicit formula

11atW11bt=11abt\frac1{1-at} \ast_W \frac1{1-bt} = \frac1{1 - abt}

under the σi\sigma^i-basis, i.e., transferring the god-given big Witt multiplication on [Λ,R][\Lambda, R] over to 1+tR[[t]]1 + tR[[t]] using the identification [Λ,R]1+tR[[t]][\Lambda, R] \to 1 + tR[[t]] given by fi0f(σi)tif \mapsto \sum_{i \geq 0} f(\sigma^i) t^i. But I actually like this formula less than the corresponding explicit formula using the λi\lambda^i-basis, which is

(1+at)W(1+bt)=1+abt.(1 + at) \ast_W (1 + bt) = 1 + abt.

This is because I want to think in terms of good old-fashioned product formulas

1+i1aiti=n1(1+bnt)1 + \sum_{i \geq 1} a_i t^i = \prod_{n \geq 1} (1 + b_n t)

and liken these to splitting principles, like 1+i1λiti=n1(1+xit)1 + \sum_{i \geq 1} \lambda^i t^i = \prod_{n \geq 1} (1 + x_i t), which comes from the 2-rig map kSA\overline{k\mathsf{S}} \to \mathsf{A}^{\boxtimes \infty} that sends the generator x=Λ1x = \Lambda^1 to the sum of line objects L1L2L_1 \oplus L_2 \oplus \cdots.

view this post on Zulip Todd Trimble (Oct 21 2024 at 18:22):

Ramachandran remarks that the explicit formulas (whether in terms of 1+at1 + at or 11at\frac1{1-at}) plus functoriality of WW are enough to pin down the big Witt multiplication on 1+tR[[t]]1 + tR[[t]]. It would be nice to understand this point better. I'm thinking there must be a splitting principle type of explanation. It might be something like this. A general element in 1+tR[[t]]1 + tR[[t]], say i0aiti\sum_{i \geq 0} a_i t^i where a0=1a_0 = 1, may be written as i0f(λi)ti\sum_{i \geq 0} f(\lambda^i) t^i where f:ΛRf: \Lambda \to R is the unique ring map sending λi\lambda^i to aia_i. In the category of commutative rings, consider the pushout RR' of the span

RfΛZ[[x1,x2,]]bdR \overset{f}{\longleftarrow} \Lambda \to \mathbb{Z}[[x_1, x_2, \ldots]]_{\mathrm{bd}}

where on the right we have power series of bounded degree (admitting formal infinite sums like x12+x22+x32+x_1^2 + x_2^2 + x_3^2 + \ldots). This thing on the right is the Grothendieck ring K0(AK_0(\mathsf{A}^{\boxtimes \infty}, and the map on the right is obtained by applying K0K_0 to the canonical 2-rig map kSA\overline{k\mathsf{S}} \to \mathsf{A}^{\boxtimes \infty}. The idea is that, for example, the image of a2=f(λ2)a_2 = f(\lambda^2) under the pushout coprojection i:RRi: R \to R' should match

f(x1x2+x1x3++x2x3+)f'(x_1 x_2 + x_1 x_3 + \cdots + x_2 x_3 + \cdots)

where f:Z[[x1,x2,]]bdRf': \mathbb{Z}[[x_1, x_2, \ldots]]_{\mathrm{bd}} \to R' is the other coprojection. Adapting the formal manipulations of Ramachandran to this framework, the idea is that

f(x1x2+x1x3++x2x3+)=f(x1)f(x2)+f(x1)f(x3)+f'(x_1 x_2 + x_1 x_3 + \cdots + x_2 x_3 + \cdots) = f'(x_1)f'(x_2) + f'(x_1)f'(x_3) + \cdots

is the coefficient of t2t^2 of a formal product n1(1+f(xi)t)1+tR[[t]]\prod_{n \geq 1} (1 + f'(x_i) t) \in 1 + tR'[[t]], and this formal infinite product is supposed to be (if I'm interpreting Ramachandran correctly) an infinite Witt sum of line elements, since Witt addition is given by multiplying power series, as we saw earlier. So, then, the lambda-ring map W(i):W(R)W(R)W(i): W(R) \to W(R') takes 1+i1aiti1 + \sum_{i \geq 1} a_i t^i to an infinite Witt sum

W[1+f(xi)t]\sum_W [1 + f'(x_i) t]

('WW' stands for 'Witt') and similarly would take some other element 1+i1biti1 + \sum_{i \geq 1} b_i t^i to some other formal infinite Witt sum

W[1+g(xi)t]\sum_W [1 + g'(x_i) t]

and the Witt product of these two elements (supposing for now they wind up in the same 1+tR[[t]]1 + tR'[[t]]) is going to be defined by a third infinite Witt sum

W[1+f(xi)g(xj)t]]\sum_W [1 + f'(x_i) g'(x_j) t]]

which can be expanded and rewritten back in 1+tR[[t]]1 + tR[[t]]. As you can see, it looks like a bunch of shenanigans, and there is the mild challenge of making honest sense of these formal manipulations, but that seems to be the idea.

view this post on Zulip John Baez (Oct 21 2024 at 23:13):

Now you've gotten to the stuff I really want to understand! It will take me a while to absorb this and reply.

view this post on Zulip Todd Trimble (Oct 22 2024 at 01:56):

The pushout of rings I mentioned at the end probably ought to be replaced by the pushout of the span

R(f,g)ΛΛZ[[x1,x2,;y1,y2,]]bdR \overset{(f, g)}{\longleftarrow} \Lambda \otimes \Lambda \to \mathbb{Z}[[x_1, x_2, \ldots; y_1, y_2, \ldots]]_{\mathrm{bd}}

where f,gf, g are the two elements we want to Witt-multiply.

view this post on Zulip John Baez (Oct 22 2024 at 21:23):

It's taking me a while to find enough time to think about this stuff, but please don't mistake that for lack of interest. I will get around to it.

view this post on Zulip Todd Trimble (Oct 22 2024 at 21:25):

No worries at all! I'm mulling some stuff over anyway, about ways of putting those manipulations of Ramachandran on solid ground.

view this post on Zulip Todd Trimble (Oct 22 2024 at 22:14):

So the more I think about this, the more I suspect what Ramachandran is doing is a kind of "joke" in the sense of Littlewood. (For anyone who doesn't know what is meant by this, see this MO post.)

My own favorite example of a Littlewood joke is the proof of a statement from spectral theory: if A,BA, B are operators on a vector space and 1AB1 - AB is invertible, then so is 1BA1 - BA. Proof: Write

(1BA)1=1+BA+BABA+BABABA+=1+B(1+AB+ABAB+)A=1+B(1AB)1A.(1 - BA)^{-1} = 1 + BA + BABA + BABABA + \cdots = 1 + B(1 + AB + ABAB + \cdots)A = 1 + B(1 - AB)^{-1} A.

The punchline of the joke is that 1+B(1AB)1A1 + B(1 - AB)^{-1} A is in fact the inverse of 1BA1 - BA, as one can easily verify. The telling of the joke is the stuff in the middle, which strictly speaking isn't really legitimate, but it doesn't really matter: it's just a vehicle for how to remember the punchline, which is all that is needed in a rigorous proof. (I'm reminded here of Abel's comment on Gauss's mathematical writing style: "He is like the fox, who effaces his tracks in the sand with his tail"; in other words, he never gives you the inner motivation, or explanations of how he arrived at his insights. Presumably Gauss would have written down 1+B(1AB)1A1 + B(1 - AB)^{-1} A without further commentary.)

If I'm right, those infinite Witt sums in Ramachandran's proof are part of a Littlewood joke about how Witt products work, in a way that can be made rigorous but at the cost of using somewhat more roundabout expressions. I'll try to flesh out these thoughts soon.

view this post on Zulip Oscar Cunningham (Oct 23 2024 at 11:12):

(I don't think foxes actually do that.)

view this post on Zulip Jean-Baptiste Vienney (Oct 23 2024 at 15:47):

(Sounds like expressing yourself in a very literary but logically and scientifically discutable way can be seen as cool when you’re a serious math guy.

It reminds me of this extract from the Wikipedia page « Geometric group theory »:
« In the introduction to his book Topics in Geometric Group TheoryPierre de la Harpe wrote: "One of my personal beliefs is that fascination with symmetries and groups is one way of coping with frustrations of life's limitations: we like to recognize symmetries which allow us to recognize more than what we can see. In this sense the study of geometric group theory is a part of culture, and reminds me of several things that Georges de Rhampracticed on many occasions, such as teaching mathematics, reciting  Mallarmé. »

This phenomenon can also be seen in the titles of math papers on the Arxiv. The words that are more rare are more used than the most common ones even when it makes less sense according do dictionaries. For instance the word « via » is more used than « by » or « through » in titles of the form « characterization of … via/by/through … ».)

view this post on Zulip Kevin Carlson (Oct 23 2024 at 17:03):

"Via" and "by" and "through" aren't syntactically interchangeable, to be sure. You shouldn't say "Characterization of a class A by theorem B", whereas "via" works well there. Instead you say "by application of theorem B", or something. This is usually the situation with rarer words; there are few direct substitutions in English or, I'm sure, in most languages. "Utilize" for "use" is a famous exception that proves the rule!

view this post on Zulip John Baez (Oct 23 2024 at 17:14):

A little point:

Todd Trimble said:

First, I ought to define a "line element" in a general lambda-ring RR. The definition I'll adopt (and I believe something like this appears in the literature; I need to check up on that) is that it's an element rRr \in R such that λn(r)=0\lambda^n(r) = 0 for all n2n \geq 2.

I think I see how to show any object in a 2-rig with Λ2x0\Lambda^2 x \cong 0 automatically has Λnx0\Lambda^n x \cong 0 for n>2n > 2. So I would hope that in any lambda-ring λ2(r)=0\lambda^2(r) = 0 implies λn(r)=0\lambda^n(r) = 0 for n>2n > 2. However, my ability to do computations in lambda-rings is not up to the job of proving this!

view this post on Zulip Jean-Baptiste Vienney (Oct 23 2024 at 17:29):

Kevin Carlson said:

"Via" and "by" and "through" aren't syntactically interchangeable, to be sure. You shouldn't say "Characterization of a class A by theorem B", whereas "via" works well there. Instead you say "by application of theorem B", or something. This is usually the situation with rarer words; there are few direct substitutions in English or, I'm sure, in most languages. "Utilize" for "use" is a famous exception that proves the rule!

Ok for « by ». But isn’t « through » better than « via »? If I remember what I found by looking on the internet, it was said that « through » can be interpreted as « by mean of » whereas « via » is also interpreted as « by mean of » today but in the past was used more strictly to mention a city by which you pass on a travel. I guess it is used also in the sense of « by mean of » today because it is a Latin word and so sounds more poetic and sophisticated.

I also discovered that « cf. » is for « confere » which means « compare » in latin. It was used in the past in a more restricted way, in particular in legal texts to mean « compare with the other reference which says something different. » But today it is used to mean « see », almost all the time to point to a reference where more details are given, so to defend a point which is almost the contrary of what it means in Latin.

At least for cf., it shows that people tend to use uncommon words just to sounds cool, distorting their original meaning.

view this post on Zulip Jean-Baptiste Vienney (Oct 23 2024 at 17:30):

(Sorry for this interruption which has nothing to do with the very cool math you were talking about.)

view this post on Zulip Todd Trimble (Oct 23 2024 at 17:32):

Responding to John here: frankly, I get confused about some things here myself, especially when it comes to dealing with negatives. A question that's been needling me is whether every lambda-ring comes from a 2-rig, i.e., whether we can cook up a 2-rig whose Grothendieck ring is the given lambda-ring. That would be a big help!

view this post on Zulip Jean-Baptiste Vienney (Oct 23 2024 at 17:33):

A related observation (which can be made this time on the paper on the splitting principle!) is that people tend to say that two categories are equivalent when in fact they are even isomorphic, because « equivalent » sounds more cool.

view this post on Zulip Todd Trimble (Oct 23 2024 at 17:46):

I've noticed this thing about "cf." (some people write instead c.f., not knowing where the abbreviation comes from), and sometimes I have pangs of conscience about using it in this slightly looser way, but then again, language is constantly changing, so I waffle back and try not to worry about it much, feeling that most people reading would get the drift.

I wouldn't attribute "equivalence" to wanting to sound cooler, necessarily (and it sounds unkind to think so). For some personalities, it could be an act of hedging one's bets or playing it safe. It's a tricky business.

There is a spot in the splitting principles paper where we deal with a 1-limit in a 2-category (where we introduce A\mathsf{A}^{\boxtimes \infty}), which requires thinking up to isomorphism with very specific models in mind, but then other ways of referring to the object up to equivalence could be appropriate if one wishes to evoke other ways of thinking about it. But I'm actually not sure what you're talking about. Do you want to tell us more specifically what you had in mind?

view this post on Zulip Jean-Baptiste Vienney (Oct 23 2024 at 17:50):

Sure. I was thinking about Lemma 5.2. I think the equivalence is actually an isomorphism here.

view this post on Zulip Jean-Baptiste Vienney (Oct 23 2024 at 17:52):

Todd Trimble said:

I've noticed this thing about "cf." (some people write instead c.f., not knowing where the abbreviation comes from), and sometimes I have pangs of conscience about using it in this slightly looser way, but then again, language is constantly changing, so I waffle back and try not to worry about it much, feeling that most people reading would get the drift.

Ahah, almost the same for me. Now that I know precisely the original meaning, I feel bad about using it to mean "see" but I'm so used to that it's difficult to stop doing it. But I also think that we don't have to be so conservative about words and so it doesn't matter much.

view this post on Zulip Todd Trimble (Oct 23 2024 at 17:54):

Jean-Baptiste Vienney said:

Sure. I was thinking about Lemma 5.2. I think the equivalence is actually an isomorphism here.

Eh, I think I'll let John or Joe respond. (I have to leave the house!)

view this post on Zulip Jean-Baptiste Vienney (Oct 23 2024 at 17:58):

Well, now I think that it is probably not an isomorphism :sweat_smile:. It is just that some category theory people defined the category of affine schemes as CommAlgop\mathrm{CommAlg}^{op} in talks. So I might be completely wrong.

view this post on Zulip Jean-Baptiste Vienney (Oct 23 2024 at 18:00):

Hmm, but you actually define the category of affine schemes precisely like this ahah (as the opposite of CommAlg\mathbf{CommAlg}). So I'm not wrong about your lemma 5.2 finally.

view this post on Zulip Jean-Baptiste Vienney (Oct 23 2024 at 18:02):

It must be that you want to give a simple definition but then you feel bad about it in the proof and statement of Lemma 5.2 because you know that this is not the usual definition and the usual definition is only equivalent and not isomorphic to your definition.

view this post on Zulip Joe Moeller (Oct 23 2024 at 18:09):

I think it is something like this, but maybe upside down. It's morally correct to treat them as equivalent, but it's also good manners to give a definition of things. So we take advantage of the equivalence to shortcut the definition. Then if we asked ourselves if these two things are at odds, we moved on from it quickly. I don't actually see the benefit of the isomorphism. I think of isomorphisms of categories as equivalences with a bijection on objects. Bijections are good for counting, but I don't want to count anything here.

view this post on Zulip Jean-Baptiste Vienney (Oct 23 2024 at 18:36):

Equivalence of categories confuse me because I'm always thinking to the equivalence between the category of matrices over kk and the category of finite dimensional vector spaces over kk which feel quite different to me: the first one is a world where you think in terms of coordinates and need to do arbitrary choices to do so (if you start from abstract vector spaces), in the second one you think without coordinates. You can compute easily in the first one with a computer, but it would be more difficult to work with the second one on a computer. I tend to feel like category theory is too coarse here (like a coarser topology) and puts under the rug some subtleties (I mean, if you consider that two equivalent categories are more or less the same categories). But it's maybe just a psychological problem of me. So when I read "equivalent" but it is in fact "isomorphic" (or when it is not even clear what it should be because of how things are written), it creates these questionings in my mind and distracts me from the real point of the work under consideration.

view this post on Zulip John Baez (Oct 23 2024 at 20:33):

Jean-Baptiste Vienney said:

Well, now I think that it is probably not an isomorphism :sweat_smile:. It is just that some category theory people defined the category of affine schemes as CommAlgop\mathrm{CommAlg}^{op} in talks. So I might be completely wrong.

This is an example of why I say "equivalence": to show you have an isomorphism of categories you have to check everything very carefully.

But also there's no advantage to doing so! If we are doing category theory in the usual style, isomorphisms between objects in a 2-category like Cat\mathbf{Cat} are considered no more useful than equivalences. They are merely distracting.

(I am avoiding the word [[evil]], because it's not politically correct. :upside_down:)

view this post on Zulip Todd Trimble (Oct 23 2024 at 20:33):

I think that saying "isomorphism" instead would also be distracting for many readers, because it would stop them in their tracks and make them think, "really, isomorphic?", when so much else is already going on. I agree with Joe that there's no real benefit to saying "isomorphism" here, and I expect most readers will go along with "equivalence" without a murmur (it's certainly not wrong to say equivalence). The distinction is not worth bothering about.

Jean-Baptiste Vienney said:

It must be that you want to give a simple definition but then you feel bad about it in the proof and statement of Lemma 5.2 because you know that this is not the usual definition and the usual definition is only equivalent and not isomorphic to your definition.

I assure you that I didn't "feel bad" about anything! Well, not here anyway. I feel bad that we didn't finish the proofs of more theorems! :-)

view this post on Zulip John Baez (Oct 23 2024 at 20:43):

I don't want to think about whether Lemma 5.2 could be stating an isomorphism - it's like thinking about how many threads of cotton are in my sock, when I just want to put on my sock.

Let me just tell lurkers what this lemma says.

This lemma says that the category of monoids internal to the category of affine schemes is equivalent to the category of commutative bialgebras. It's a triviality, but the two viewpoints have a different feel to them. In one case I picture a geometrical object, an affine scheme, which is equipped with a multiplication making it a monoid. In the other case I imagine a vector space which is equipped with a commutative multiplication and a comultiplication that get along with each other. So the first picture is 'geometrical' while the second is 'algebraic'.

For example in the first picture I might imagine the Lie group SL(2,R)\mathrm{SL}(2,\mathbb{R}), which is shaped like some sort of 3-dimensional hyperboloid in 4-space, but I'd view it as an algebraic variety equipped with a group structure. In the second picture I'd imagine the commutative algebra

k[a,b,c,d]/adbc=1 k[a,b,c,d]/\langle ad - bc = 1 \rangle

and think about how to equip it with a comultiplication arising from the group structure of SL(2,R)\mathrm{SL}(2, \mathbb{R}) . I find this second picture a lot less intuitive, but it has the advantage that in the end we just have a vector space with some operations and co-operations - so our linear algebra skills become useful.

view this post on Zulip Morgan Rogers (he/him) (Oct 23 2024 at 21:21):

In a talk I'm giving to some algebraic topologists tomorrow I'm going to talk about the general process of externalization (of which the above amounts to a particular case). These different perspectives on objects can be wildly different and I'm confident there are many cases where the different perspectives have not been adequately exploited :star_struck:

view this post on Zulip Todd Trimble (Oct 23 2024 at 21:24):

I suspect that many people who will open that paper have never gotten seriously familiar with categories of comodules of coalgebras, or at least not to anything like the extent they are familiar with modules over algebras. I got an inkling of their importance for algebraic representations of algebraic monoids through conversations some years back with Jim Dolan, but even so it was really only while we were developing ideas for this paper that I really began getting my hands dirty with them. (That's just an expression; no shade on comodule theory, which is very clean and beautiful, and sometimes surprising!)

view this post on Zulip Todd Trimble (Oct 24 2024 at 00:04):

Getting back to our topic --

The jokey aspect is that Ramachandran manipulates certain infinite Witt sums (= infinite products of formal power series with constant coefficient 11), but the language of commutative rings doesn't accommodate infinite products in general. So there's a faint odor of bullshit to what he's doing, even though his arguments are succinct and suggestive, in the style of a good Littlewood joke.

view this post on Zulip Todd Trimble (Oct 24 2024 at 00:05):

Here is my attempt to put what Ramachandran does on grounds that seem more rigorous to me. Taking it from the top: let f,g:ΛRf, g: \Lambda \to R be two elements in the big Witt ring W(R)W(R), and let fgf \ast g denote their Witt product that we are trying to describe. Using the fact that ΛΛ\Lambda \otimes \Lambda is the coproduct of Λ\Lambda with itself in the category of commutative rings, there is an induced ring map that I'll denote as (f,g):ΛΛR(f, g): \Lambda \otimes \Lambda \to R. As mentioned earlier, this is the composite

ΛΛfgRRmR.\Lambda \otimes \Lambda \overset{f \otimes g}{\longrightarrow} R \otimes R \overset{m}{\longrightarrow} R.

view this post on Zulip Todd Trimble (Oct 24 2024 at 00:05):

Next, the splitting principle gives a 2-rig extension ϕ:kSA\phi: \overline{k \mathsf{S}} \to \mathsf{A}^{\boxtimes \infty}, or better yet an extension of graded 2-rigs. Here A\mathsf{A}^{\boxtimes \infty} is defined to be the 2-rig consisting of functors N()FinVect\mathbb{N}^{(\infty)} \to \mathsf{FinVect}, where N()\mathbb{N}^{(\infty)} is the commutative monoid of natural number sequences (m1,m2,)(m_1, m_2, \ldots) whose sum m1+m2+m_1 + m_2 + \cdots is finite, and we regard this commutative monoid as a discrete symmetric monoidal category. This is graded by the sum m=m1+m2+m = m_1 + m_2 + \cdots; the set of sequences with that sum is denoted N()(m)\mathbb{N}^{(\infty)}(m). The 2-rig extension takes xkSx \in \overline{k \mathsf{S}} to the functor that is constantly the ground field kk on elements of the component m=1m = 1, and 00 for other mm. The elements of N()(m)\mathbb{N}^{(\infty)}(m) are (1,0,0,)(1, 0, 0, \ldots), (0,1,0,)(0, 1, 0, \ldots), (0,0,1,)(0, 0, 1, \ldots), etc., and the ithi^{th} line object LiL_i takes the ithi^{th} element of this sequence to kk, and all other elements of N()\mathbb{N}^{(\infty)} to 00. Hence ϕ(x)\phi(x) is L1L2L_1 \oplus L_2 \oplus \cdots. Decategorifying this 2-rig map gives a commutative ring map, even a lambda-ring map Λ=K(kS)K(A)\Lambda = K(\overline{k\mathsf{S}}) \to K(\mathsf{A}^{\boxtimes \infty}), sending λj\lambda^j to the jthj^{th} elementary symmetric function ej(x1,x2,)e_j(x_1, x_2, \ldots) as we have discussed, where xix_i is the isomorphism class [Li][L_i]. So far, this is all on solid ground.

view this post on Zulip Todd Trimble (Oct 24 2024 at 00:06):

Applying WW to this ring map ϕ:ΛK(A)\phi: \Lambda \to K(\mathsf{A}^{\boxtimes \infty}), we get the assignment

i0λitin1(1+xnt)\sum_{i \geq 0} \lambda^i t^i \mapsto \prod_{n \geq 1} (1 + x_n t)

as long as we say to ourselves that the right side is suggestive shorthand for the well-founded expression j0ej(x1,x2,)tj\sum_{j \geq 0} e_j(x_1, x_2, \ldots) t^j, which makes sense in our context (yes, the coefficients eje_j are "infinite sums", but they make sense as elements in K(A)[N(),Z]K(\mathsf{A}^{\boxtimes \infty}) \cong [\mathbb{N}^{(\infty)}, \mathbb{Z}], just as ordinary formal power series in A[[t]]A[[t]], which ostensibly are infinite sums n0anxn\sum_{n \geq 0} a_n x^n, make perfect sense as sequences NA\mathbb{N} \to A of elements of AA).

view this post on Zulip Todd Trimble (Oct 24 2024 at 00:06):

[By the way, if we interpret what Ramachandran is doing with his infinite Witt sums, we are led to write down infinite products of type

i=111xit\prod_{i = 1} \frac1{1 - x_i t}

as an element of W(K(A))W(K(\mathsf{A}^{\boxtimes \infty})). Yes, one can make sense of this, but... I don't know about you, but taking infinite products of linear terms 1+xit1 + x_i t feels more familiar and comfortable to me than taking infinite products of geometric series. I suppose that's silly, since Euler didn't blink an eye writing down

p11ps=n1ns,\prod_p \frac1{1 - p^{-s}} = \sum_{n \geq 1} n^{-s},

but anyway I'll stick to how I'm setting this up here.]

view this post on Zulip Todd Trimble (Oct 24 2024 at 00:07):

I said above that I want to consider the pushout i:RRi: R \to R', where we push out the injection ϕϕ:ΛΛK(A)K(A)\phi \otimes \phi: \Lambda \otimes \Lambda \to K(\mathsf{A}^{\boxtimes \infty}) \otimes K(\mathsf{A}^{\boxtimes \infty}) along the map (f,g):ΛΛR(f, g): \Lambda \otimes \Lambda \to R.

And one could hope, at least for the limited purpose of trying putting a gloss on what Ramachandran is doing, that this i:RRi: R \to R' is also an injection. (There might be some really principled way of seeing that, but I don't know what it would be.) It would follow that the induced map W(i):W(R)W(R)W(i): W(R) \to W(R'), which by definition is [Λ,i][\Lambda, i], is also an injection.

view this post on Zulip Todd Trimble (Oct 24 2024 at 00:07):

In the pushout square, there's also a map coming "down",

K(A)K(A)R.K(\mathsf{A}^{\boxtimes \infty}) \otimes K(\mathsf{A}^{\boxtimes \infty}) \to R'.

So the plan is to Witt-multiply the elements i1(1+xit)\prod_{i \geq 1} (1 + x_i t) and j1(1+yjt)\prod_{j \geq 1} (1 + y_j t), where the first product sits in the image of

W(K(A))W(i1)W(K(A)K(A))W(K(\mathsf{A}^{\boxtimes \infty})) \overset{W(i_1)}{\hookrightarrow} W(K(\mathsf{A}^{\boxtimes \infty}) \otimes K(\mathsf{A}^{\boxtimes \infty}))

with i1i_1 being the first coproduct coprojection in the category of commutative rings, and the second product similarly sits in the image of W(i2)W(i_2). Their Witt-product is the simple-looking

i,j(1+xiyjt)W(K(A)K(A))\prod_{i, j} (1 + x_i y_j t) \in W(K(\mathsf{A}^{\boxtimes \infty}) \otimes K(\mathsf{A}^{\boxtimes \infty}))

which is again a shorthand for something more complicated-looking, but what is going on can be derived at the 2-rig level. (I am tempted to give this 2-rig level explanation now, but I'll resist.) Now push this element down to W(R)W(R'). The result is a corresponding Witt-product in W(R)W(R'). This gives the image in W(R)W(R') of the desired Witt-product in W(R)W(R); this desired element is uniquely determined, by injectivity of W(i):W(R)W(R)W(i): W(R) \to W(R').

view this post on Zulip Todd Trimble (Oct 24 2024 at 00:08):

This joke is by now more like a shaggy dog story, and I think that's about as far as I'll take it for now. I'm hoping it will make sense to John at least, how this account fits with the verbiage set down in Ramachandran's paper. Somehow I imagine the cognoscenti reading his paper smiling and nodding knowingly at this passage, and with others having cartoon question marks popping out of their heads, because what he writes really is cryptic unless you already know (or until you figure out) the story.

Having unraveled what I think he was getting at, I think it makes matters harder than necessary, although all the ideas are there. But again, all we have to do is figure out how comultiplication μ:ΛΛΛ\mu: \Lambda \to \Lambda \otimes \Lambda works, then take the composite

ΛμΛΛfgRRmR.\Lambda \overset{\mu}{\longrightarrow} \Lambda \otimes \Lambda \to \overset{f \otimes g}{\longrightarrow} R \otimes R \overset{m}{\longrightarrow} R.

You figure out how comultiplication works using the splitting principle, along the lines sketched way back here in a simple case, which was amplified further here.

view this post on Zulip John Baez (Oct 24 2024 at 15:27):

Thanks for getting to the bottom of multiplication in the big Witt ring, @Todd Trimble! It looks a lot simpler and less problematics in the λi\lambda^i basis, which almost makes me wonder: why bother with the σi\sigma^i basis?

I guess Ramachandran provides one answer to this question. As you pointed out, in the σi\sigma^i basis a line element in W(R)W(R) looks like

11at \displaystyle{\frac{1}{1 - at}}

Ramachandran wants to relate the big Witt ring to zeta functions over finite fields Fq\mathbb{F}_q; one of the simplest of these is the zeta function of the affine line, which is

Z(A1,t)=11qt \displaystyle{ Z(\mathbb{A}^1, t) = \frac{1}{1 - qt} }

I suspect the resemblance is no coincidence.

I just now noticed the double appearance of word "line" - line element versus affine line. That could be a coincidence: it's hard for me to connect these two kinds of line.

view this post on Zulip Todd Trimble (Oct 24 2024 at 15:33):

Oh! Interesting observation. I'll/we'll have to ponder whether there's some reason that zeta functions would jibe better with the σi\sigma^i.

view this post on Zulip John Baez (Oct 24 2024 at 15:34):

It's all rather mysterious. But the zeta function of the affine line should not be too mysterious.

In general, the coefficient of tnt^n in the zeta function Z(X,t)Z(X,t) is defined to be the number of Fqn\mathbb{F}_{q^n}-points of the scheme XX.

The number of Fqn\mathbb{F}_{q^n}-points of the affine line is simply qnq^n. So we get

Z(A1,t)=1+qt+q2t2+ Z(\mathbb{A}^1, t) = 1 + qt + q^2 t^2 + \dots

or in other words

Z(A1,t)=11qt\displaystyle{ Z(\mathbb{A}^1, t) = \frac{1}{1 - qt} }

view this post on Zulip John Baez (Oct 24 2024 at 15:38):

The really big result Ramachandran proves in this area is that

Z(X×Y,t)=Z(X,t)WZ(Y,t) Z(X \times Y, t) = Z(X,t) \ast_W Z(Y,t)

view this post on Zulip Todd Trimble (Oct 24 2024 at 15:39):

Curious. The σn\sigma^n refer to the class of SnS^n, and for a line object LL we have Sn(L)LnS^n(L) \cong L^{\otimes n}. But this is in contrast to the cartesian product LnL^n, whose size is qnq^n.

I'll have to peer at that paper some more.

view this post on Zulip Todd Trimble (Oct 24 2024 at 15:40):

That's a very pretty formula for the zeta function!

view this post on Zulip John Baez (Oct 24 2024 at 15:41):

We can test Ramachandran's formula

Z(X×Y,t)=Z(X,t)WZ(Y,t) Z(X \times Y, t) = Z(X,t) \ast_W Z(Y,t)

in an example:

Z(A2,t)=11qtW11qt=11q2t Z(\mathbb{A^2}, t) = \displaystyle{\frac{1}{1 - qt} \ast_W \frac{1}{1 - qt} = \frac{1}{1 - q^2t} }

=1+q2t+q4t2+ = 1 + q^2 t + q^4 t^2 + \cdots

This is right, since the affine plane has q2nq^{2n} points over Fq\mathbb{F}_q.

view this post on Zulip Todd Trimble (Oct 24 2024 at 15:42):

Yes indeed.

view this post on Zulip Todd Trimble (Oct 24 2024 at 15:46):

Ah, so the fleeting remark I made about the Euler product formula might not have been too far off the mark.

view this post on Zulip John Baez (Oct 24 2024 at 15:50):

Right! Actually I'm confused: rereading Ramachandran I think my claim that Z(X,t)Z(X,t) is just the generating function for the number of Fqn\mathbb{F}_{q^n}-points of XX is wrong, but weirdly all the specific computations I did seem to work.

view this post on Zulip John Baez (Oct 24 2024 at 15:51):

The top formula of his equation (11) is the right formula relating Z(X,t)Z(X,t) to numbers of points.

view this post on Zulip Todd Trimble (Oct 24 2024 at 15:53):

Yes, that formula is familiar. You and Jim have those nice papers on the nLab, whose titles you will be able to recall more quickly than I can. One of them interprets the Hasse-Weil zeta function. The other is about zeta functions of Z-sets generally.

view this post on Zulip Todd Trimble (Oct 24 2024 at 15:57):

The stuff on Euler characteristics is extremely interesting. (And the very coarse "cutting apart" of a scheme XX into YY and XYX \setminus Y reminds me very much of Schanuel's papers on negative sets and Euler characteristic. I mean like this one. I don't know if there are others particularly, except for "What is the length of a potato?" where the Euler characteristic plays a starring role. Here it is, courtesy of Tom Leinster's website -- thanks @Tom Leinster !)

view this post on Zulip Todd Trimble (Oct 24 2024 at 16:01):

You were also telling me about motives some months back, which by my memory also involve this coarse cutting apart of schemes, reminiscent of how a projective space is a "sum"

kn+11k1=1+k++kn\frac{k^{n+1} - 1}{k - 1} = 1 + k + \ldots + k^n

where the left side is the quotient of punctured (n+1)(n+1)-space by the action of the multiplicative group kk^\ast, and the right side is a decomposition into Schubert cells.

view this post on Zulip Todd Trimble (Oct 24 2024 at 16:06):

Hopefully you can remind me at some point of the things you were telling me about motives, if what I said rings any bells (or even if doesn't).

view this post on Zulip Todd Trimble (Oct 24 2024 at 16:31):

I'll mention one pretty cool result from Schanuel's negative sets paper. It starts off with a reason that the open interval can be thought of as a negative set, indeed a proxy for 1-1. Take an open interval xx, say (0,1)(0, 1) and divide it into three parts: (0,1/2){1/2}(1/2,1)(0, 1/2) \cup \{1/2\} \cup (1/2, 1) Hence "2x+1x2x + 1 \sim x". If we could cancel xx, then "x1x \sim -1".

Next, consider the category of bounded polyhedra. A polyhedron is by definition a subset of some Euclidean space Rn\mathbb{R}^n contained in the smallest Boolean subalgebra of P(Rn)P(\mathbb{R}^n) that contains loci of the form L(x1,xn)0L(x_1, \ldots x_n) \geq 0 where LL is an affine function. A bounded polyhedron is what you think it is. A morphism between polyhedron is the graph of a function between them that is itself a polyhedron. The category of polyhedra PP or the subcategory of bounded polyhedra P0P_0 has some good properties, such as extensitivity. The equivalence \sim comes from an isomorphism in this category.

Let B(P0)B(P_0) be the Burnside rig of P0P_0

Theorem (Schanuel): The canonical rig map

N[x]/(x2x+1)B(P0)\mathbb{N}[x]/ (x \sim 2x + 1) \to B(P_0)

is an isomorphism. This is well worth pondering.

view this post on Zulip John Baez (Oct 24 2024 at 16:32):

Everything you say is ringing bells! :bell:

Ramachandran actually talks about zeta functions from a somewhat "motivic" point of view, but this is based on the low-budget approach to motives based on the "Grothendieck ring of varieties", as explained quite tersely on page 6. The idea is that you take the rig category of varieties, decategorify and group complete it to get a commutative ring, and then impose the extra relations

[X]=[XY]+[Y] [X] = [X-Y] + [Y]

whenever YY is a subvariety of XX. (I am simplifying a bit here.)

view this post on Zulip John Baez (Oct 24 2024 at 16:34):

The high-end, difficult approach to motives seeks instead to define a rig category of motives, rather than merely this commutative ring, which is intended to be some sort of decategorification of that dreamt-of category.

view this post on Zulip Todd Trimble (Oct 24 2024 at 16:35):

Mm. I think in fact we touched recently on the fact that motives should form a 2-rig, no?

view this post on Zulip Todd Trimble (Oct 24 2024 at 16:36):

Or something close to it!

view this post on Zulip John Baez (Oct 24 2024 at 16:37):

Umm, maybe!

Ramachandran uses GFKGF_K to denote the Grothendieck ring of varieties over a field KK , and intriguingly writes:

The genesis of GFKGF_K dates back to 1964 (it was considered by Grothendieck [8, p.174] in his letter (dated August 16, 1964) to J.-P. Serre; it is the first written mention of the word ”motives”). The ring GFKGF_K is a shadow (decategorification) of the category of motives; some aspects of the yoga of motives are not seen at the level of GFKGF_K.

Wow, he even says "decategorification".

view this post on Zulip Todd Trimble (Oct 24 2024 at 16:38):

I mean, we maybe have to change kk-linearity to something else, but we should have a symmetric monoidal category with coproducts and idempotent splittings. Just a passing thought for the moment.

view this post on Zulip Todd Trimble (Oct 24 2024 at 17:00):

I want to get back to the Schanuel paper I was just describing. It must be understood of course that morphisms in the category P0P_0, let's say isomorphisms, need not be continuous at all -- the graph of a function can be broken up in pieces, as we saw in the case x2x+1x \to 2x + 1.

Two very interesting definitions: (1) the Euler characteristic of a commutative rig RR is the universal quotient RE(R)R \to E(R) where to a commutative rig that enjoys additive cancellation; (2) The dimension of a commutative rig RR is the quotient RR/(1+11)R \to R/(1 + 1 \sim 1).

view this post on Zulip Todd Trimble (Oct 24 2024 at 17:05):

The Euler characteristic of N[x]/(x2x+1)\mathbb{N}[x]/ (x \sim 2x + 1) is the expected quotient to Z\mathbb{Z} (easy to check I think).

view this post on Zulip Todd Trimble (Oct 24 2024 at 17:09):

I'll copy what Schanuel says about the dimension of N[x]/(x2x+1)\mathbb{N}[x]/ (x \sim 2x + 1):

Equally simple, if less familiar, is D(R)D(R): it is

D(R)={0=d,1=d0,dl,d2}D(R) = \{ 0 = d^{-\infty}, 1 = d^0 , d^l, d^2 \ldots \}

with didj=di+jd^i d^j = d^{i+j} and di+dj=dmax(i,j)d^i + d^j = d^{\max(i, j)}. The exponential notation is in keeping with the idea that multiplying poIyhedra adds dimensions, while adding gives the maximum of the two dimensions.

view this post on Zulip John Baez (Oct 24 2024 at 17:20):

Todd Trimble said:

I mean, we maybe have to change kk-linearity to something else, but we should have a symmetric monoidal category with coproducts and idempotent splittings.

Yes, I believe motives form a 2-rig. I can't believe I didn't notice that. I often find myself thinking about two things and noticing only later that they're connected.

I was a bit confused about the kk-linearity but yes, the category of pure motives defined using 'numerical equivalence' is a kk-linear abelian category; people don't emphasize the symmetric monoidal structure so much but it should exist.

(There are potentially many categories of pure motives defined using different 'adequate equivalence relations' on cycles, but some of the Standard Conjectures say some of these equivalence relations are the same as numerical equivalence... let us not sink into this mire now!)

Indeed, something that confused me for a while (and apparently still) is that the field kk we're talking about here is typically different than the field our varieties (or schemes) are defined over! You know how cohomology has 'coefficients'. Motives are like a universal cohomology theory for varieties defined over some field F\mathbb{F} with coefficients in some field kk. Right now we're talking about the field of coefficients.

view this post on Zulip Todd Trimble (Oct 24 2024 at 17:25):

Instead of typing this out, I can just refer you to page 382 for the demonstration of the theorem (that I just ascribed to Schanuel).

The same considerations apply to other structures, the so-called o-minimal structures. An archetypal example is where semialgebraic sets replace the semilinear sets that constitute the category PP. These examples are the propositionally (or Boolean-)definable sets of a model R\mathbb{R} of a logical theory where the language for semialgebraic sets, say, would be given by (R,0,1,+,,,)(R, 0, 1, +, -, \cdot, \leq). In the types of theories I have in mind, there is a quantifier elimination theorem (e.g., Tarski-Seidenberg theorem) that says the image of a definable set under a linear projection is itself definable. And there is also an o-minimality, which says that the only definable subsets of the real line are finite unions of points and intervals. Model theorists, those clever devils, know how to tease out an incredible amount of geometric structure from these two conditions.

Anyway, I think the rough upshot is that Schanuel's theorem extends to such cases as semialgebraic sets.

view this post on Zulip John Baez (Oct 24 2024 at 17:30):

For anyone interested, we're talking about

and

The latter, unfortunately, is paywalled: I hope the Kazakhs have liberated it.

view this post on Zulip Todd Trimble (Oct 24 2024 at 17:30):

Yes, I believe motives form a 2-rig. I can't believe I didn't notice that. I often find myself thinking about two things and noticing only later that they're connected.

I was a bit confused about the kk-linearity but yes, the category of pure motives defined using 'numerical equivalence' is a kk-linear abelian category; people don't emphasize the symmetric monoidal structure so much but it should exist.

Well, this is incredible. I sort of took a seat-of-the-pants guess there. I'm eager to learn more!

Where do you find this?

view this post on Zulip Todd Trimble (Oct 24 2024 at 17:31):

(The Kazakhs are doing just fine, btw. I have a tab open.)

view this post on Zulip Todd Trimble (Oct 24 2024 at 17:37):

By the way, I read about 25 minutes ago that the zeta functions under study are valued in W(Z)W(\mathbb{Z}). It should be mentioned that Z\mathbb{Z} is itself a lambda-ring, where the λi\lambda^i act on Z\mathbb{Z} by n(ni)n \mapsto \binom{n}{i}. These functions generate, as a ring, precisely the integer-valued polynomial functions ZZ\mathbb{Z} \to \mathbb{Z}.

view this post on Zulip John Baez (Oct 24 2024 at 17:39):

Where do you find this?

I've read a lot about motives here and there. I believe you'd like this, which defines pure motives and states their known properties:

He also has more advanced papers but this covers everything I just said.

By the way, when he says the category of motives is not Tannakian, he (like many other people) is sort of crying wolf: they're not Tannakian with a certain bad symmetric monoidal structure, but they are with the good 'super' symmetric monoidal structure, where you stick in minus signs in the expected places.

(So I take it back about the symmetric monoidal structure being less discussed - I seem to have forgotten lots of stuff I knew.)

view this post on Zulip Todd Trimble (Oct 24 2024 at 17:49):

I'm making a note for later about the occurrence of Chow rings on page 5 of Milne's paper (we were talking about these in connection with Grothendieck-Riemann-Roch).

view this post on Zulip John Baez (Oct 24 2024 at 18:07):

While I do want to learn algebraic geometry, I don't feel I have much of a chance doing anything new when it comes to Chow rings. To work with those, you need a good understanding of algebraic varieties. For example, to define the intersection map in equation (1) on Milne's page 5, you need Chow's moving lemma, which says that given two subvarieties you can move one a bit so that they're in general position. This is like an algebraic geometry version of the fact that given an kk-dimensional submanifold and an (nk)(n-k)-dimensional submanifold of an nn-manifold, you can isotope one of them so that they intersect in finitely many points. I suffered through learning the techniques for doing such things in differential topology, but I have no desire to go through it all again in the more rigid context of algebraic geometry! And this is just the basic stuff: it gets a lot worse. This is why the Standard Conjectures remain conjectures.

Where I think I might contribute is in figuring out how to distill some concepts from algebraic geometry, formulate them using category theory, and prove (easy) things about them.

view this post on Zulip John Baez (Oct 24 2024 at 18:18):

So, when it comes to Ramachandran's paper, I'm not seriously interested in proving anything about motives or the Grothendieck ring of varieties. But his proof that the zeta function of a variety obeys

Z(X×Y,t)=Z(X,t)WZ(Y,t) Z(X \times Y, t) = Z(X,t) \ast_W Z(Y,t)

is just a calculation which doesn't really use anything about varieties! It probably works for the Hasse-Weil zeta function of any functor from finite commutative rings to finite sets. (We think of such functor as telling us the set of RR-points of some gadget for each finite commutative ring RR, but we don't have to say what this gadget is! The functor says it all.)

view this post on Zulip John Baez (Oct 24 2024 at 18:20):

So, the kind of question I'm really interested in now is what does multiplication in the big Witt ring really mean - and why does it make Ramachandran's identity hold?

view this post on Zulip John Baez (Oct 24 2024 at 18:20):

And that's what you've been explaining!

view this post on Zulip Todd Trimble (Oct 24 2024 at 18:36):

Slogan from Ramachandran (p. 15):

Motivic measures are invariants of algebraic varieties that behave like Euler characteristics.

view this post on Zulip Todd Trimble (Oct 24 2024 at 18:44):

To be honest, I'm having trouble identifying where Ramachandran's proof of Theorem 2.1 begins and ends. I'll continue reading and scanning (between this and the Milne paper you linked to).

view this post on Zulip John Baez (Oct 24 2024 at 18:57):

It probably starts at the bottom of page 10 where it says Proof (of Thm. 2.1). But it relies on the previous lemmas.

view this post on Zulip Todd Trimble (Oct 24 2024 at 18:59):

Oh, I glided right over that text. Thanks.

view this post on Zulip Todd Trimble (Oct 24 2024 at 19:00):

Oh I see, these ghost components are all about the Adams operations.

view this post on Zulip John Baez (Oct 24 2024 at 19:01):

Good god, it's Adams' ghost! :ghost:

view this post on Zulip Todd Trimble (Oct 24 2024 at 19:02):

Well, this just looks ridiculously easy, I have to say. :-)

view this post on Zulip John Baez (Oct 24 2024 at 19:02):

Seriously, that's frigging amazing.

view this post on Zulip John Baez (Oct 24 2024 at 19:02):

It's another case of how two things I'm struggling to understand turn out to be related.

view this post on Zulip Todd Trimble (Oct 24 2024 at 19:04):

I mean, I think this is more or less how it goes. Give me a minute to think.

view this post on Zulip Todd Trimble (Oct 24 2024 at 19:10):

So I'm looking at my notes that I shared with you, split2rigs. (I want time to think whether I want those publicly shared yet.) Notation: σt=n0σntn\sigma_t = \sum_{n \geq 0} \sigma^n t^n. So then the Adams operations ψn\psi^n can be defined by a generating function

ψt=n0ψntn\psi_t = \sum_{n \geq 0} \psi^n t^n

where ψt\psi_t is this thing, tt times the logarithmic derivative of σt\sigma_t. We've talked about these things before. The Adams operations are of course ring homomorphisms.

view this post on Zulip Todd Trimble (Oct 24 2024 at 19:12):

Well, "of course". I mean that of course you've seen me talk about this before. We recently went through a proof in that split2rigs, where I invoke the beautiful identity

σnσn1λ1+σn2λ2=0\sigma^n - \sigma^{n-1} \lambda^1 + \sigma^{n-2} \lambda^2 - \ldots = 0

for all nn.

view this post on Zulip Todd Trimble (Oct 24 2024 at 19:14):

What the hell, split2rigs.pdf.

view this post on Zulip Todd Trimble (Oct 24 2024 at 19:15):

Anyway, you see tt times the logarithmic derivative in the proof of Lemma 2.3 in Ramachandran.

view this post on Zulip John Baez (Oct 24 2024 at 19:17):

Yes, I think we talked about that. But are the ghost components simply the components of an element of W(R)W(R) in the ψn\psi^n basis? Is that what you're about to tell me?

view this post on Zulip John Baez (Oct 24 2024 at 19:18):

If so, the ring homomorphism property of the ψn\psi^n should do something good for these ghost components.

view this post on Zulip Todd Trimble (Oct 24 2024 at 19:19):

Todd Trimble said:

Well, "of course". I mean that of course you've seen me talk about this before. We recently went through a proof in that split2rigs, where I invoke the beautiful identity

σnσn1λ1+σn2λ2=0\sigma^n - \sigma^{n-1} \lambda^1 + \sigma^{n-2} \lambda^2 - \ldots = 0

for all nn.

I'll just mention that the "beautiful identity" can be written as σtλt=1\sigma_t \cdot \lambda_{-t} = 1, symbolically

σt=1λt\sigma_t = \frac1{\lambda_{-t}}

and that's where this whole connection with 11at\frac1{1 - at} being the σi\sigma^i basis analogue of the line elements 1+at1 + at comes into play.

view this post on Zulip Todd Trimble (Oct 24 2024 at 19:20):

John Baez said:

Yes, I think we talked about that. But are the ghost components simply the components of an element of W(R)W(R) in the ψn\psi^n basis? Is that what you're about to tell me?

Yes, I think you could put it that way.

view this post on Zulip Todd Trimble (Oct 24 2024 at 19:30):

John Baez said:

If so, the ring homomorphism property of the ψn\psi^n should do something good for these ghost components.

Yes, that's the message! The ring homomorphism property of the ψn\psi^n gives the first line of the proof of Theorem 2.1.

view this post on Zulip Todd Trimble (Oct 24 2024 at 19:49):

So I'm speculating out loud here, really just playing around, but looking at Ramachandran's Remark 2.4, I get a sense that a zeta function Z(X,t)W(Z)Z(X, t) \in W(\mathbb{Z}) on motives XX might be derivable by taking advantage of the 2-rig structure on the category of motives (I'm following Milne and considering rational equivalence classes of algebraic cycles -- he uses this symbol \sim to cover either the rational equivalence case or the numerical equivalence case). I'll let MM denote this 2-category of motives. Then its Grothendieck ring K(M)K(M) is a lambda-ring. That means we get a canonical WW-coalgebra structure (which is a WW-coalgebra map = lambda-ring map)

η:K(M)W(K(M))\eta: K(M) \to W(K(M))

and now I'm half-wondering whether composition with the lambda-ring map W(d):W(K(M))W(Z)W(d): W(K(M)) \to W(\mathbb{Z}) where I'll describe dd in a moment, could morally be a zeta function

ζ:K(M)W(Z).\zeta: K(M) \to W(\mathbb{Z}).

Here I'm guessing that d:K(M)Zd: K(M) \to \mathbb{Z} could be a dimension function, maybe. Or better yet -- Euler characteristic?! It might itself come from a suitable 2-rig map MFinVectQM \to \mathsf{FinVect}_\mathbb{Q}. (Eh, maybe not.)

(As I say, this is playing around at the moment.)

view this post on Zulip Todd Trimble (Oct 24 2024 at 20:00):

Ha ha ha, see just about the first line of the paper --

Steve Lichtenbaum’s philosophy [38, 37, 39] that special values of arithmetic zeta functions and motivic L-functions are given by suitable Euler characteristics.

view this post on Zulip Todd Trimble (Oct 24 2024 at 20:02):

And also of course "Euler characteristic" appears in Remark 2.4. (But I need to think about this more slowly and carefully.)

view this post on Zulip Todd Trimble (Oct 24 2024 at 20:03):

But my god, equation (15) is beautiful!

view this post on Zulip Todd Trimble (Oct 24 2024 at 20:09):

There is a pretty pregnant comment on page 15:

• Theorem 2.1 says that XZ(X,t)X \mapsto Z(X, t) gives rise to a motivic measure Z:GKFqW(Z)Z : GK_{\mathbb{F}_q} \to W(\mathbb{Z}).

and now I remember the slogan I jotted down:

Todd Trimble said:

Slogan from Ramachandran (p. 15):

Motivic measures are invariants of algebraic varieties that behave like Euler characteristics.

(Sorry, this is a real jumble of half-baked thoughts...)

view this post on Zulip Todd Trimble (Oct 24 2024 at 20:27):

On a different front: Ramachandran talks about the Grothendieck ring K(VarF)K(\mathrm{Var}_F) of schemes of finite type over a field FF. He says this is a pre-lambda-ring. So that's a kind of poor cousin of an actually lambda ring. A pre-lambda-ring RR is given by a map

RW(R)=[Λ,R]:r(λiλi(r))R \to W(R) = [\Lambda, R]: r \mapsto (\lambda^i \mapsto \lambda^i(r))

(so I'm defining the lambda operations in terms of the given structure map) satisfying an exponential law

λn(r+s)=n=j+kλj(r)λk(s)\lambda^n(r + s) = \sum_{n = j + k} \lambda^j(r) \lambda^k(s)

and I think that's about it. I'm roughly thinking that if K(VarF)K(\mathrm{Var}_F) is merely a pre-lambda ring, and not a lambda-ring, that may be because VarF\mathrm{Var}_F lacks good categorical properties (its not being a 2-rig, for instance). Working with a 2-rig of motives could address this. (?)

view this post on Zulip John Baez (Oct 24 2024 at 20:28):

Indeed, all this stuff is great! What I'd really like to do is "devein" it - as people say of shrimp - and remove the stuff related to varieties, leaving pure 2-rig theory. Of course we may need a 2-rig with extra structure and properties to get various things to work.

view this post on Zulip John Baez (Oct 24 2024 at 20:29):

Todd Trimble said:

On a different front: Ramachandran talks about the Grothendieck ring K(VarF)K(\mathrm{Var}_F) of schemes of finite type over a field FF. He says this is a pre-lambda-ring.

I seem to recall somewhere he says it's an open question whether it's a lambda-ring. Let me see what he says about that...

view this post on Zulip John Baez (Oct 24 2024 at 20:30):

Yeah, at the bottom of page 18 he says there are 4 pre-lambda-ring structures on K(VarF)K(\mathsf{Var}_F), and says for one it's not known if it's a lambda-ring. I guess this one is the only pre-lambda-ring structure he actually discusses.

view this post on Zulip Todd Trimble (Oct 24 2024 at 20:32):

I'm getting a little lost there...

view this post on Zulip Todd Trimble (Oct 24 2024 at 20:33):

I'd want to study this ζμ^\hat{\zeta_{\mu}} thing.

view this post on Zulip John Baez (Oct 24 2024 at 20:34):

I edited my comment. I'm annoyed that he has a ζμ\zeta_\mu and a ζ^μ\hat{\zeta}_\mu running around and I don't see how he defines ζ^μ\hat{\zeta}_\mu.

view this post on Zulip John Baez (Oct 24 2024 at 20:35):

Oh, I see, it's defined by that factorization diagram near the bottom of page 18.

view this post on Zulip John Baez (Oct 24 2024 at 20:36):

Anyway, I don't understand this portion of the paper yet.

view this post on Zulip Todd Trimble (Oct 24 2024 at 20:37):

Yeah, I'm still not getting it.

view this post on Zulip Todd Trimble (Nov 07 2024 at 21:52):

Looping back to this query:

John Baez said:

A little point:

Todd Trimble said:

First, I ought to define a "line element" in a general lambda-ring RR. The definition I'll adopt (and I believe something like this appears in the literature; I need to check up on that) is that it's an element rRr \in R such that λn(r)=0\lambda^n(r) = 0 for all n2n \geq 2.

I think I see how to show any object in a 2-rig with Λ2x0\Lambda^2 x \cong 0 automatically has Λnx0\Lambda^n x \cong 0 for n>2n > 2. So I would hope that in any lambda-ring λ2(r)=0\lambda^2(r) = 0 implies λn(r)=0\lambda^n(r) = 0 for n>2n > 2. However, my ability to do computations in lambda-rings is not up to the job of proving this!

I think I can show how to refute this by making use of the big Witt ring W(Q)=[Λ,Q]W(\mathbb{Q}) = [\Lambda, \mathbb{Q}]. The idea is to cook up an element f:ΛQf: \Lambda \to \mathbb{Q} for which λ3(f)W(Q)\lambda^3(f) \in W(\mathbb{Q}) can't possibly be zero (e.g., by arranging f(λ3)=1f(\lambda^3) = 1 in Q\mathbb{Q}), but then force λ2(f)W(Q)\lambda^2(f) \in W(\mathbb{Q}) to be zero by a kind of inductive procedure. Here λ2(f)\lambda^2(f) is defined by the formula

(λ2(f))(λi)=f(λiλ2)(\lambda^2(f))(\lambda^i) = f(\lambda^i \bullet \lambda^2)

where λiλ2\lambda^i \bullet \lambda^2 is some polynomial in λ1,λ2,\lambda^1, \lambda^2, \ldots, and what we need to do is define the rational numbers f(λk)f(\lambda^k) so as to satisfy an infinite sequence of equations f(λiλ2)=0f(\lambda^i \bullet \lambda^2) = 0. At the kthk^{th} step of the induction, we will have recursively defined f(λ1),f(λ2),,f(λ2k+1)f(\lambda^1), f(\lambda^2), \ldots, f(\lambda^{2k+1}) so as to make f(λiλ2)=0f(\lambda^i \bullet \lambda^2) = 0 true for i=1i = 1 up to i=ki = k.

So, assume as inductive hypothesis that we're good up to i=k1i = k-1. The "hard part" is to define f(λ2k)f(\lambda^{2k}) so that we pick up f(λkλ2)=0f(\lambda^k \bullet \lambda^2) = 0; then we define f(λ2k+1)f(\lambda^{2k+1}) to be anything you like (at the base case k=1k = 1, remember we chose f(λ3)=1f(\lambda^3) = 1). The key fact to notice is that when we write out λkλ2\lambda^k \bullet \lambda^2 as a polynomial in exterior powers λj\lambda^j, the highest jj that appears is j=2kj = 2k:

λkλ2=Ckλ2k+stuff involving exterior powers less than 2k\lambda^k \bullet \lambda^2 = C_k \lambda^{2k} + \text{stuff involving exterior powers less than } 2k

where CkC_k is some positive integer. So in other words, define

f(λ2k)=1Ckf(stuff involving exterior powers less than 2k)f(\lambda^{2k}) = -\frac1{C_k} f(\text{stuff involving exterior powers less than } 2k)

and you're done.

The plethysm calculation at the level of symmetric polynomials involves writing out the kthk^{th} elementary symmetric polynomial

ek(x1x2,x1x3,x1x2k,x2x3,x2x2k,xixj,)e_k(x_1x_2, x_1x_3, \ldots x_1 x_{2k}, x_2x_3, \ldots x_2 x_{2k}, \ldots x_i x_j, \ldots)

where 1i<j2k1 \leq i < j \leq 2k, where there are CkC_k many contributions of the product x1x2x2kx_1 x_2 \cdots x_{2k} occurring in the expansion. For example, when k=2k = 2, there are 33 ways to get the monomial x1x2x3x4x_1x_2x_3x_4 when you write out e2(x1x2,x1x3,x1x4,x2x3,x2x4,x3x4)e_2(x_1x_2, x_1x_3, x_1x_4, x_2x_3, x_2x_4, x_3x_4).

view this post on Zulip Todd Trimble (Nov 08 2024 at 02:37):

By the way, I think this CkC_k is one of these double factorial thingies: Ck=135(2k1)C_k = 1 \cdot 3 \cdot 5 \cdots (2k-1).

view this post on Zulip John Baez (Nov 08 2024 at 18:42):

That's impressive, Todd! What made you choose to look for this example in the big Witt ring (instead of the free lambda-ring on one generator, or something)?

view this post on Zulip Todd Trimble (Nov 08 2024 at 20:48):

Partly is that if there is an example in any lambda-ring RR whatsoever, then there's going to be one in the cofree lambda-ring W(R)W(R), because a lambda-ring RR comes equipped with a WW-coalgebra structure η:RW(R)\eta: R \to W(R) which by abstract nonsense is a WW-coalgebra homomorphism, and it is also injective, so any inequalities (like λ3(r)0\lambda^3(r) \neq 0) will be preserved. Therefore, you might as well look for examples in cofree lambda-rings.

But also, working with ring maps f:ΛRf: \Lambda \to R felt like an engineering job where you just have to twiddle the parameters f(λi)f(\lambda^i) to make something work, so the task was to see what that something was. Intuitively I felt like λkλ2\lambda^k \bullet \lambda^2, as a polynomial in λj\lambda^j s, was going to have λ2k\lambda^{2k} as its highest exterior power, opening the door to an inductive procedure. Then it was simply a matter of verifying that intuition, which turned out to be straightforward if you merely roll up your sleeves and calculate a few lines.

view this post on Zulip Todd Trimble (Nov 08 2024 at 23:01):

One simple way of seeing there are no higher exterior powers than λ2k\lambda^{2k} in λkλ2\lambda^k \bullet \lambda^2 is to see what this would entail for the initial commutative ring Z\mathbb{Z}, which is also the initial λ\lambda-ring. For Z\mathbb{Z}, the lambda operations λi:ZZ\lambda^i: \mathbb{Z} \to \mathbb{Z} are defined by

λi(n)=(ni)\lambda^i(n) = \binom{n}{i}

and so

(λkλ2)(n)=((n2)k)(\lambda^k \bullet \lambda^2)(n) = \binom{\binom{n}{2}}{k}

which is an integer-valued polynomial function in nn of degree 2k2k. There can't be any lambda-operations λj\lambda^j occurring in the expansion λkλ2Z[λ1,λ2,]\lambda^k \bullet \lambda^2 \in \mathbb{Z}[\lambda^1, \lambda^2, \ldots] with jj greater than 2k2k, because otherwise, the polynomial function n(nj)n \mapsto \binom{n}{j} is of degree greater than 2k2k, and that would lead to a contradiction.