Big Witt ring · theory: mathematics

I'm trying to understand the "big Witt ring" of a commutative ring; there are various perspectives on it.

1) One slick abstract perspective is that the forgetful functor from [[lambda-rings]] to commutative rings has both the expected left adjoint but also a right adjoint

and

W(R)

is the big Witt ring. This perspective is helpful in proportion to how well you understand lambda-rings. There are several perspectives on lambda-rings, each of which should give a different outlook on the big Witt ring. My favorite is that when you decategorify a 2-rig by taking its Grothendieck group, you get not only a ring but a lambda-ring. But there is another important perspective based on number theory, where a lambda-ring is a commutative ring equipped with commuting [[Frobenius lifts]], one for each prime

p

John Baez (Oct 06 2024 at 17:14):

where

\Lambda

is the underlying ring of the free lambda-ring on one generator. I will explain why later.

\Lambda

is a biring, i.e. a ring object in

\mathsf{CommRing}^{\text{op}}

, and the multiplication in

W(R)

comes from the comultiplication in

\Lambda

John Baez (Oct 06 2024 at 17:23):

3) Since one can show the underlying commutative ring of the free lambda-ring on one generator is

Then the challenge is to describe the addition and multiplication on

W(R)

in these terms. People often do this with explicit formulas, which I tend to find cryptic. The addition in

W(R)

corresponds to the multiplication in

1 + R[[t]]

, which is why we use this description. The multiplication in

W(R)

is more tricky. For any

a \in R

we get an element of

W(R)

called

(1 - a t)^{-1}

, defined by

and the multiplication

\cdot_W

W(R)

turns out to be determined once we know

This formula turns out to be very useful but I don't have a good understanding of how it comes from 2).

John Baez (Oct 06 2024 at 17:31):

sends

a \in R

to something called its ith ghost component. Here's an explicit formula for the ghost components that one often sees. Start by using the isomorphism

This is quite cryptic at first sight, but there has got to be some conceptual interpretation, probably involving [[Adams operations]].

John Baez (Oct 06 2024 at 17:44):

5) Back to conceptual interpretations, Cartier noticed that the big Witt ring

W(R)

can be seen as the ring of all formal curves starting at

1

in the multiplicative group of

R

... or something like that. This is explained here, where that ring of formal curves starting at

1

is called

C(\mathbb{G}_m, R)

This is probably just another way of thinking about 3), but it connects the big Witt ring to formal group laws, and in particular the "multiplicative formal group law", and I believe this should ultimately clarify the following trio of facts: a) the

K

-theory of any space is a

\lambda

-ring, b)

K

-theory is a complex oriented cohomology theory, 3) such cohomology theories are classified by formal group laws, 4)

K

-theory corresponds to the multiplicative formal group law.

John Baez (Oct 06 2024 at 19:19):

It should be possible to clearly see how all these facts follow from the definition 1), but I'm not there yet!

Let's use this to compute the underlying set of

W(R)

for some commutative ring

R

The underlying set of any commutative ring

A

\mathsf{CommRing}(\mathbb{Z}[x], A)

since

\mathbb{Z}[x]

is the free commutative ring on one generator.

By general nonsense

F(\mathbb{Z}[x])

is the free lambda-ring on one generator. The underlying commutative ring of this,

U(F(\mathbb{Z}[x]))

, is denoted

\Lambda

. So, from this we get

The underlying set of $W(R)$ is isomorphic to the set of ring homomorphisms from $\Lambda$ to $R$ .

It happens that

\Lambda

is the ring of [[symmetric functions]], but we didn't need this in the above argument. It also happens that the ring of symmetric functions is the polynomial ring on generators

\lambda^i

called [[elementary symmetric functions]]. This is called the fundamental theorem of symmetric functions. We used this to see why 2) implies 3).

Morgan Rogers (he/him) (Oct 07 2024 at 05:50):

Where did this come from original and why is it named after Witt? I've never heard of it, so I'm curious how it connects to other things and why you're interested in understanding it now ;)

David Corfield (Oct 07 2024 at 08:09):

There was an exchange between you and James Borger back here which touched on Big Witt vectors. Jack Morava and David Ben-Zvi chip in and get to Adams operations. Maybe a conversation worth mining.

John Baez (Oct 07 2024 at 17:18):

Okay, it turns out Witt introduced a kind of "Witt vector" associated to an algebra over a field of characteristic

p

back in 1936, in his paper Zyklische Körper und Algebren der Characteristik $p$ vom Grad $p^n$ . Struktur diskret bewerteter perfekter Körper mit vollkommenem Restklassenkörper der Charakteristik $p^n$ . No, that's not the whole paper - it's just the title.

John Baez (Oct 07 2024 at 17:21):

Given a commutative ring and a prime

p

we can make up a "

p

-typical Witt ring" containing these Witt vectors. But then someone noticed you can combine all these

p

-typical Witt rings in a single "big Witt ring", and that's what I'm talking about.

People use these for various mysterious number-theoretic tasks, which I would like to understand someday.

John Baez (Oct 07 2024 at 17:28):

The big Witt ring is now best understood as the cofree lambda-ring on a commutative ring. Lambda-rings are important in representation theory, where they let us study things like exterior powers and symmetric powers of representations. But also, surprisingly, they're important in number theory, where they let us study things like "Frobenius lifts" - lifting the Frobenius endomorphism of a field to commutative algebras over that field.

John Baez (Oct 07 2024 at 17:30):

For a long time I've been interested, just as a hobby, in understanding why people think the Riemann Hypothesis is connected to the "field with one element". So I was very interested when lambda-rings and the big Witt ring beame important in James Borger's approach to the field with one element:

Digging into this, I've come to understand fairly well how lambda-rings arise from decategorifying concepts from representation theory. I've always liked representation theory, and @Joe Moeller and @Todd Trimble and I have showed that categories of representations tend to be 2-rigs, and the Grothendieck group of any 2-rig is a lambda-ring.

John Baez (Oct 07 2024 at 17:32):

But I'm less comfortable with how lambda-rings are connected to number theory and Frobenius lifts. Mainly, it seems like a miracle that lambda-rings show up in two different contexts: representation theory in characteristic zero, and number theory in characteristic

p

They can't really be different; they must be deeply connected, so I want to understand this.

John Baez (Oct 07 2024 at 17:36):

One way for me to start understanding this is to take the big Witt ring of a ring

R

, which doesn't seem to involve number theory or primes - it's just the cofree lambda-ring on

R

- and see how it's built from

p

-typical lambda rings, one for each prime.

John Baez (Oct 07 2024 at 17:38):

For years I've felt I don't have the intelligence to fully understand the big Witt ring - so I wished there were some sort of "half-Witt ring" to practice on. But it's gradually starting to make sense.

Morgan Rogers (he/him) (Oct 07 2024 at 18:11):

That sounds like fun! Someone suggested to me at some point that I should try and apply some of the stuff I've done with monoids to Lambda-rings, so I'll be keeping an eye on this topic.

John Baez (Oct 07 2024 at 18:34):

John Baez (Oct 07 2024 at 18:38):

Yes! One difficulty I've had is connecting the 'big picture' ideas to the nitty-gritty comutations with lambda-rings and the big Witt ring. So, I've been digging into the nitty-gritty and now maybe I'll be better able to connect it to the big picture.

John Baez (Oct 07 2024 at 18:39):

There's a nice connection between Adams operations and Frobenius operators which is starting to make sense to me.

Josselin Poiret (Oct 08 2024 at 08:27):

I haven't thought about this in a bit but I remember that the explanation in the nlab was very useful to understand why lambda rings are interesting

Morgan Rogers (he/him) (Oct 08 2024 at 16:20):

It's been just long enough that I can't remember; the general idea was to treat them as internal somethings in a topos of suitably chosen monoid actions, but that much is not exactly profound.

John Baez (Oct 08 2024 at 18:59):

Yes, thanks! It seems to have improved since I last read it, or maybe I just know more now. This is about what Borger calls the heterodox interpretation of a lambda-ring as a ring with a family of commuting Frobenius lifts, one for each prime.

John Baez (Oct 08 2024 at 19:22):

Equivalently it can be seen as a ring with with commuting [[p-derivations]] - a generalization of the concept of derivation. This allows us to develop a number-theoretic generalization of the concept of Taylor series, building on the known analogy between the ring of formal power series

\mathbb{Z}[[x]]

and the ring of p-adic integers, which gives a geometrical interpretation of localizing at a prime. If we go down this fascinating road, it's good to treat a lambda-ring as a [[Joyal delta-ring]].

John Baez (Oct 08 2024 at 19:25):

But I understand a lot more about what Borger calls the orthodox interpretation of lambda-rings, which is a ring equipped with

\lambda

-operations. The idea here is that if we have a category with well-behaved "exterior power" operations, like a category of vector bundles or group representations, these will endow it's Grothendieck ring with

\lambda

-operations making it into a lambda-ring.

John Baez (Oct 08 2024 at 22:23):

I find this to be a great intro to the 'heterodox' interpretation of lambda-rings and the big Witt ring:

Todd Trimble (Oct 12 2024 at 04:07):

John Baez (Oct 12 2024 at 05:01):

Todd and I were talking about how multiplication in the big Witt ring

W(R)

of a commutative ring

R

arises from comultiplication in the biring of symmetric functions

\Lambda

, via the formula

to emphasizing

\Lambda

is the free

\lambda

-ring on one generator

x

. In fact Todd uses such a notation for the free

\lambda

-ring on two generators

x

and

y

, which is

\Lambda \otimes \Lambda

John Baez (Oct 12 2024 at 05:06):

is a ring homomorphism, it will be determined once we know what it does to each of the generators

\lambda^i

. I think Todd showed me how it works for

\lambda^1

and I asked how it worked for

\lambda^2

, or something like that. Here was his reply - we've decided to talk about this here.

John Baez (Oct 12 2024 at 05:07):

Todd Trimble (Oct 12 2024 at 16:35):

To my taste, the symbols

\lambda^i

and

\sigma^i

refer primarily to certain functors, and secondarily to symmetric functions; ultimately these two points of view are united via the all-powerful splitting principle. More precisely, the ring

\Lambda

is the Grothendieck group or ring based on isomorphism classes of Schur functors, which are essentially those endofunctors on the category of vector spaces (let's say over

\mathbb{C}

) that you can build up using

\otimes

(together with its symmetric monoidal structure), coproducts, and splitting of idempotents. For example,

V \mapsto V \otimes V

is a Schur functor. If

\sigma: V \otimes V \to V \otimes V

denotes the symmetry that swaps tensor factors, then the endomorphisms

\frac{1 + \sigma}{2}: V \otimes V \to V \otimes V, \qquad \frac{1-\sigma}{2}: V \otimes V \to V \otimes V

are idempotent maps, and we can split these idempotents to obtain the symmetric square

S^2(V)

and exterior square

\Lambda^2(V)

. Likewise, the symmetric power and exterior power functors,

V \mapsto S^n(V)

and

V \mapsto \Lambda^n(V)

, are Schur functors. Tensor products and coproducts of Schur functors are again Schur functors, and Schur functors are closed under composition.

Todd Trimble (Oct 12 2024 at 16:35):

The symbols

\lambda^i

and

\sigma^i

can be read as the isomorphism classes

[\Lambda^i]

and

[S^i]

, regarded as elements of the ring

\Lambda

(whose addition is induced from taking coproducts, and whose multiplication is induced from taking tensor products). It turns out -- and this is by no means trivial -- that as a ring,

\Lambda

is isomorphic to the polynomial algebra

\mathbb{Z}[\lambda^1, \lambda^2, \ldots]

. It is also the polynomial algebra

\mathbb{Z}[\sigma^1, \sigma^2, \ldots]

: both sets

\{\lambda^i\}

and

\{\sigma^i\}

serve as polynomial bases. There are other famous bases as well, which I won't mention right now, but you can read about them in famous texts such as Representation Theory by Fulton and Harris, and Symmetric Functions and Hall Polynomials by MacDonald.

Todd Trimble (Oct 12 2024 at 16:36):

But there is so much more to

\Lambda

! People go gaga over the richness of its structure and its interplay with the rest of mathematics. I'll try to indicate some main features of this structure by first pointing to similar features of a far simpler structure, namely the polynomial algebra

\mathbb{Z}[x]

, which represents the forgetful functor

in the sense of a natural isomorphism

U \cong \mathsf{CRing}(\mathbb{Z}[x], -)

. Now, we can remind the forgetful functor of its (tautological) ring object structure, by pointing at natural transformations

a: U \times U \to U

(whose components are the addition functions

U(R) \times U(R) \to U(R)

) and

m: U \times U \to U

(multiplication). At the level of the representing object

\mathbb{Z}[x]

, these transformations are induced by ring homomorphisms

\alpha: \mathbb{Z} \to \mathbb{Z}[x] \otimes \mathbb{Z}[x], \qquad \mu: \mathbb{Z}[x] \to \mathbb{Z}[x] \otimes \mathbb{Z}[x]

Here we should pause to note that

A \otimes B

is the coproduct of commutative rings

A, B

, which means, by the universal property of coproducts, that

\mathsf{CRing}(A \otimes B, -) \cong \mathsf{CRing}(A, -) \times \mathsf{CRing}(B, -)

. Thus

\mathbb{Z}[x] \otimes \mathbb{Z}[x]

is the representing object of

U \times U

Todd Trimble (Oct 12 2024 at 16:36):

In any category with coproducts, we can define a notion of co-ring object, dual to the notion of a ring object in a category with products. So what we can say is that according to the above,

\mathbb{Z}[x]

is a co-ring object in the category of rings. We call it a biring. I'll leave it as a semi-advanced exercise in applying the Yoneda lemma to figure out explicit formulas for the co-ring structure on

\mathbb{Z}[x]

that we are talking about here.

Stepping back a little: any time a representable functor

\mathsf{CRing}(A, -): \mathsf{CRing} \to \mathsf{Set}

has ring object structure -- or in other words lifts up through the forgetful functor

U: \mathsf{CRing} \to \mathsf{Set}

to give an endofunctor

G: \mathsf{CRing} \to \mathsf{CRing}

, the representing object

A

becomes a biring. It can be shown that such lifts of representable functors are necessarily limit-preserving, or better yet they are right adjoints. Thus right adjoint endofunctors on

\mathsf{CRing}

are equivalent to biring structures. The case of the biring

\mathbb{Z}[x]

corresponds to the identity endofunctor on

\mathsf{CRing}

(which of course is a right adjoint).

Todd Trimble (Oct 12 2024 at 16:36):

But wait, there's more! In our example,

\mathbb{Z}[x]

carries another binary operation

\mathbb{Z}[x] \times \mathbb{Z}[x] \to \mathbb{Z}[x]

, namely polynomial composition

(p, q) \mapsto p \circ q

(replacing the

x

p(x)

q(x)

). If you'll allow me to abuse language and write the identity functor as

for the biring

\mathbb{Z}[x]

sitting in the contravariant slot, then the polynomial composition

\mathbb{Z}[x] \times \mathbb{Z}[x] \to \mathbb{Z}[x]

can be read as contravariantly inducing a transformation in the other direction,

\mathsf{CRing}(\mathbb{Z}[x], -) \to \mathsf{CRing}(\mathbb{Z}[x], \mathsf{CRing}(\mathbb{Z}[x], -))

(these manipulations might seem a tad puzzling, and indeed it takes some fancy footwork to put it just right, but I'm going to skip over that -- you can read Schur Functors and Categorified Plethysm for details). I'll just say that the polynomial composition on

\mathbb{Z}[x]

corresponds to the identity transformation

\mathrm{Id} \to \mathrm{Id} \circ \mathrm{Id}

that is the comultiplication for the tautological comonad structure on

\mathrm{Id}

Todd Trimble (Oct 12 2024 at 16:37):

More generally, if

A

is a biring, and if there is a comonad structure on the lifted endofunctor

\mathsf{CRing}(A, -): \mathsf{CRing} \to \mathsf{CRing}

, then the comultiplication transfers over to an operation (a function)

A \times A \to A

that behaves similarly to polynomial composition. This operation is called plethysm, and a biring equipped with a plethysm operation is called a plethory. So in summary, giving a plethory is equivalent to giving a right adjoint comonad on

\mathsf{CRing}

; the plethory is the representing object for that comonad.

So now I can tell you that the main structural features of

\Lambda

is that it carries a plethory structure. A rather complicated plethory structure that is still very far away from being fully understood.

Todd Trimble (Oct 12 2024 at 16:37):

Where does that plethory structure come from? Well, John mentioned at the top of this thread that the forgetful functor

U

from the category of lambda-rings to the category of commutative rings has both a left adjoint

F

(expected for abstract nonsense reasons) and a right adjoint

G

(a much more specialized circumstance). So

G

and also

U

are right adjoints, hence the comonad

G \circ U

is also a right adjoint. The plethory

\Lambda

is the representing object for that right adjoint comonad.

Todd Trimble (Oct 12 2024 at 16:38):

This likely still sounds very mysterious, because for example, what are lambda-rings? Instead of saying directly what they are, I'll say in a different way where the comonad

GU

comes from. In our paper Schur Functors and Categorified Plethysm, we explain that it is the result of decategorifying a 2-comonad that is much easier (conceptually) to understand. Namely, what we do is categorify the story I was telling, about how

\mathbb{Z}[x]

with its plethory structure represents the identity comonad on commutative rings. In this particular categorification, commutative rings are replaced by 2-rigs, which are symmetric monoidal

\mathsf{Vect}

-enriched categories which have coproducts and idempotent splittings. The forgetful functor

\mathsf{CRing} \to \mathsf{Set}

is replaced by the forgetful (2-)functor

\textbf{2Rig} \to \mathsf{Cat}

. This forgetful functor is representable, by a category (rather, 2-rig) of abstract Schur functors, so this category

\mathrm{Schur}

is the replacement of

\mathbb{Z}[x]

at the categorified level, and it is a 2-birig that represents the identity functor on the 2-category of 2-rigs. It carries a 2-plethory structure, corresponding to the tautological 2-comonad structure on this identity functor.

Todd Trimble (Oct 12 2024 at 16:38):

The air up here on this categorified mountaintop is clean and clear; the view is simple and beautiful. But then we descend, from

\mathrm{Schur}

down to its set of isomorphism classes, or really I mean its Grothendieck ring

\Lambda

. That descent process is called 'decategorification'. And it's a little tricky. It took us months of study to make sure of our footing and the right path down to the valley. After all, we start with the "easiest" 2-comonad in the world, the identity functor on 2-rigs, and somehow this decategorifies down to an extremely non-trivial comonad on commutative rings, namely this right adjoint comonad

GU

I mentioned. Then we can (and do) define lambda-rings to be the coalgebras of this comonad.

Todd Trimble (Oct 12 2024 at 16:38):

To return to the topic, though: John quoted my reply to a question he asked about a specific calculation, about the biring

\Lambda

. As I said,

\Lambda

, even just as a biring let alone a plethory, is a pretty complicated beast, and I believe not completely grasped in terms of giving explicit formulas for the comultiplication (co-addition is rather easier to deal with). But some small calculations you can do by hand, and I was describing to him how to go about calculating what the comultiplication

\mu: \Lambda \to \Lambda \otimes \Lambda

does to the element

\lambda^2

, by exploiting the subject of our second paper, the splitting principle [set in the context of 2-rigs].

All this is related to the second and third posts at the top of this thread, where John is again in search of ways to wrap one's head around the comultiplication, and he was quoting some stuff he saw in a paper by Niranjan Ramachandran which gives some hints. I can come around to discussing that as well, but in order for others to be able to follow along, I thought it would help to give some background, hence this string of ear-bending comments.

John Baez (Oct 12 2024 at 17:49):

Thanks, Todd! I was distracted last night and didn't quite know the best way to kick off the conversation.

John Baez (Oct 12 2024 at 17:55):

This point is that the symmetric functions

\sigma^i

are not powers of the "switch" map

\sigma: V \otimes V \to V \otimes V

- instead there is a map sending Schur functors to symmetric functions, and the

\sigma^i

are the symmetric functions corresponding to the "ith symmetrized tensor power" functors

V \mapsto S^i(V)

In the study of the symmetric group, people like lots of things named

S

, or

\mathsf{S}

, or

\Sigma

, or

\sigma

, and sometimes we get carried away and the notations conflict!

Todd Trimble (Oct 12 2024 at 17:56):

Yeah, I edited to change notation in that comment from

\Sigma^2(V)

(which is one tradition) to

S^2(V)

, which we seem to favor.

Todd Trimble (Oct 12 2024 at 18:00):

Also, I did not explain how Schur functors give rise to symmetric functions. I wouldn't think it was completely common knowledge, how that goes. But I mentioned that the connection arises through this splitting principle that keeps coming up.

John Baez (Oct 12 2024 at 18:03):

I also have another slightly less picayune comment. If we went ahead and computed

(\mu(\lambda^n))(x,y)

, this formula:

(\mu(\lambda^2))(x, y) = (\lambda^1(x))^2 \lambda^2(y) + \lambda^2(x)(\lambda^1(y))^2 - 2\lambda^2(x)\lambda^2(y)

would become the 2nd term in a sequence of similar expressions: polynomials in the

\lambda^i

where the total degree of the nth polyomial is 2n, it seems, if we count

\lambda^i

as having degree

i

Todd Trimble (Oct 12 2024 at 18:12):

Mm, I imagine they're quite famous, and also pretty intensively studied, but my impression is that we ("we" being the community of mathematicians, including the experts) don't know them all yet. I wish I knew what people called them by name. Atiyah and Tall give the lackluster notation

P_n

to these polynomials (page 258), and I think that notation might be pretty common. But your posts at the top of the thread are all about these polynomials!

Todd Trimble (Oct 12 2024 at 18:32):

I've been scanning Hazewinkel for notation, and I guess it's true that the formulas can be made much more explicit if you use a different polynomial basis, like the power sums basis. He gives a number of such formulas around page 46. So maybe I'll eat my words a little, but anyway I don't know if the explicit list of polynomials purely in terms of the

\lambda^i

are completely known. Maybe someone else can say.

(I suppose I should know Hazewinkel's article a lot better than I do, because there's evidently a lot of great stuff in it. I'm a little put off by his crapola typesetting job, but I guess that'll be more on me than on him.)

Todd Trimble (Oct 12 2024 at 18:36):

And now that I'm looking at Hazewinkel further, I see that he brings up quasisymmetric functions in section 11, which are supposed to be really important in this biz. Joachim Kock gave an interesting talk about quasisymmetric functions at the CT 2024 conference. I should be looking more into these things.

John Baez (Oct 12 2024 at 18:40):

Do you have any hint as to why quasisymmetric functions are important? I don't know any conceptual explanation of them, so I sometimes have cynically wondered if it's a case of "what are you going to do when you're an expert on symmetric functions and you run out of good ideas? Invent quasisymmetric functions and generalize all the theorems to those!" It's probably not true.

(Lusztig once said "Some people like to take theorems about groups and generalize them to quantum groups. I like to find theorems about quantum groups that aren't like theorems about groups." He came up with some amazing results....)

Todd Trimble (Oct 12 2024 at 18:49):

Eh, I don't yet. But Hazewinkel says at the beginning of section 11, "When looking at various universality properties of the Witt vectors and Symm (which is the topic of the next section) one rapidly stumbles over a (maximally) non commutative version, NSymm, and a (maximally) non cocommutative version, QSymm. This section is devoted to a brief discussion of these two objects. Somehow a good many things become easier to see and to formulate in these contexts (including certain explicit calculations). As I have said before, e.g. in [200], p. 56; [199], Ch. H1, p. 1, once one has found the right non commutative version, things frequently become more transparent, easier to understand, and much more elegant.

Todd Trimble (Oct 12 2024 at 18:54):

There's a certain amount of hardcore algebraic combinatorics to all this. Another buzzphrase that seems relevant and important to me here is this so-called Cauchy identity; see around pages 457-458 of Fulton and Harris. (Perhaps I'm jotting this down as a reminder just to myself to come back to it -- it would be very boring to the casual onlooker.)

John Baez (Oct 12 2024 at 21:15):

Thanks - I'll try to find out if anyone has a name for these polynomials

P_n

. That could unlock a lot of wisdom - or at least piles of cryptic and unmotivated identities that we might find conceptual explanations for. :upside_down:

John Baez (Oct 12 2024 at 21:16):

For other people listening in, let me try to give a conceptual explanation of these polynomials based on algebraic topology. Todd already knows this, at least implicitly, but I feel like saying it.

John Baez (Oct 12 2024 at 21:19):

The operations of direct sum and tensor product can be applied to matrices, and in particular to unitary matrices, so if

U(n)

is the group of n

\times n

unitary matrices then we get Lie group homomorphisms

John Baez (Oct 12 2024 at 21:22):

in addition to the group operation, which is another binary operation. (By the way, I believe books on K-theory use an Eckmann-Hiltonesque argument to show that

\oplus

is homotopy equivalent to the group operation, and even better.)

John Baez (Oct 12 2024 at 21:24):

These maps induce maps on the classifying space for stable complex vector bundles,

BU

John Baez (Oct 12 2024 at 21:25):

and thus we get maps on K-theory going backward, which we can call coaddition and comultiplication:

John Baez (Oct 12 2024 at 21:32):

Since

K

of any space also has a ring structure, these wind up making

K(BU)

into a 'biring'. But this biring is just our friend the free

\lambda

-ring on on generator, which @Todd Trimble has been explaining. This is called

\Lambda

. As commutative rings we have

and thought of as an element of the integral cohomology of

BU

\lambda^i

is called the ith Chern class. It's a cohomology class of degree 2i.

John Baez (Oct 12 2024 at 21:49):

I'll use the fact I hinted at: for a paracompact Hausdorff space

X

, the set of homotopy classes

[X,BU]

is isomorphic to the set of stable complex vector bundles over

X

: that is, equivalence classes of vector bundles over

X

, where we count two as equivalent if they become isomorphic after summing with the same complex vector bundle.

After all

K(X)

is defined to be the set of stable complex vector bundles over

X

, made into a commutative ring using

\oplus

and

\otimes

John Baez (Oct 12 2024 at 21:57):

Thus,

K(BU) \cong [BU, BU]

is a somewhat self-referential entity: it's the commutative ring of stable vector bundles on

BU

. It's a ring because of the operations

\oplus, \otimes : BU \times BU \to BU

acting on the covariant argument in

[BU,BU]

, and a coring because of these operations acting on the contravariant argument in

[BU,BU]

John Baez (Oct 12 2024 at 21:59):

I'm sort of meandering, but from all this we get yet another interpretation of the elements

\lambda^i \in \mathbb{Z}[\lambda^1, \lambda^2, \dots ] \cong \Lambda \cong K(BU)

John Baez (Oct 12 2024 at 22:01):

I should describe these stable vector bundles, but I won't now. Instead, I just want to say what the

John Baez (Oct 12 2024 at 22:09):

It works like this: we can take any stable vector bundle on

BU

and pull it back along the tensor product map

So what does

\mu(\lambda^i)

mean? We start with the stable vector bundle

\lambda^i

BU

, and pull it back along

\otimes: BU \times BU \to BU

. I believe every stable vector bundle on

BU \times BU

is an integral linear combination of tensor products of stable vector bundles

\lambda^j \boxtimes \lambda^k

, where

\boxtimes

is the 'external' tensor product of stable vector bundles: if you've got one on some space

X

and one on some space

Y

, you can tensor them and get one on

X \times Y

John Baez (Oct 12 2024 at 22:13):

If so, we should be able to take

\mu(\lambda^i)

, pull it back along

\otimes

, and write it in terms of the

\lambda^j \boxtimes \lambda^k

. And I believe Todd's calculation is an example of this. He wrote

(\mu(\lambda^2))(x, y) = (\lambda^1(x))^2 \lambda^2(y) + \lambda^2(x)(\lambda^1(y))^2 - 2\lambda^2(x)\lambda^2(y)

\mu(\lambda^2) = (\lambda^1)^{\otimes 2} \boxtimes \lambda^2 + \lambda^2 \boxtimes (\lambda^1)^{\otimes 2} - 2\lambda^2 \boxtimes \lambda^2

John Baez (Oct 12 2024 at 22:17):

The first term here is not manifestly a tensor product of stable vector bundles

\lambda^j \boxtimes \lambda^k

, but it actually is: it's

John Baez (Oct 12 2024 at 22:19):

All this 'fluffy' stuff doesn't help us compute the polynomials

P_i

. And indeed, Todd already showed one way to do that. It simply says why we should care about these polynomials.

John Baez (Oct 16 2024 at 01:24):

@Todd Trimble had written to me some more about the big Witt ring, in which he starts analyzing some formulas from here:

In particular, this paper discusses a formula for multiplication in the big Witt ring which I mentioned earlier:

John Baez (Oct 16 2024 at 01:26):

The first question is: why should we care about these elements

(1-at)^{-1}

? And the second is: what does the above formula for a product of them mean? And the third is: does it really determine the product on all of the big Witt ring

W(R)

Todd Trimble (Oct 16 2024 at 01:29):

Yes, sorry, I was going to say something about that! But let me collect my thoughts.

Todd Trimble (Oct 16 2024 at 01:32):

The paper of Ramachandran that John linked to mentioned that there are several reasonable choices for the (big) Witt ring multiplication. This has a lot to do with how there are various reasonable choices for a nice polynomial basis of

\Lambda

Todd Trimble (Oct 16 2024 at 01:36):

So going back to the message at the top of the thread: one conceptual way to define

W(R)

is that it is the hom-set

\mathsf{CRing}(\Lambda, R)

. Thanks to the rich plethory structure on

\Lambda

that I was sketching earlier, the hom-set acquires a commutative ring structure and even a lambda-ring structure, and indeed furnishes the right adjoint to the forgetful functor from lambda-rings to commutative rings, as John mentioned earlier.

Todd Trimble (Oct 16 2024 at 01:42):

Now it seems that a lot of sources introduce

W(R)

as consisting of formal power series with constant coefficient

1

, i.e., elements in

1 + tR[[t]]

. So there is an isomorphism

\mathsf{CRing}(\Lambda, R) \to 1 + tR[[t]]

, and John showed how this might go: using the polynomial basis

\lambda^i

, we can define this isomorphism as sending

f: \Lambda \to R

1 + \sum_{i \geq 1} f(\lambda^i) t^i

Another possibility is to use the polynomial basis

\sigma^i

, and define the isomorphism so as to send

f

1 + \sum_{i \geq 1} f(\sigma^i) t^i

Todd Trimble (Oct 16 2024 at 01:48):

Each of these has its uses. But before tackling what any of this has to do with those formulas John mentioned, it might not be a bad exercise to look for a moment at how addition works in the Witt ring

1 + tR[[t]]

. (No, it is not ordinary addition of power series!) It turns out that the same formula will work whether you use the

\lambda^i

basis or the

\sigma^i

basis, and it's based on the co-addition on

\Lambda

that was lightly alluded to. Maybe I'll pause a moment.

John Baez (Oct 16 2024 at 02:04):

This sounds like a good plan! Please pause all night long if you want... I'm about to have dinner, and it's 3 hours later for you.

Todd Trimble (Oct 17 2024 at 18:53):

In fact, I can take this opportunity to go a smidge deeper into our first paper. First, how does co-addition work, from first principles? I plan to be very methodical about this, which might make it look heavy in places -- I'll try to ameliorate that by surrounding some of the main conclusions by extra white space, so that readers can skip ahead to get the main points.

Todd Trimble (Oct 17 2024 at 18:53):

The plan is to see how co-addition works in the toy example of

\mathbb{Z}[x]

, and then categorify that. In discussion above I said that the explicit formula for co-addition

\alpha: \mathbb{Z}[x] \to \mathbb{Z}[x] \otimes \mathbb{Z}[x]

can be derived as a "semi-advanced exercise" in using the Yoneda lemma, so I'll start with that. I'll use the notation

[A, B]

to denote hom-sets (usually hom-sets that acquire extra structure). The Yoneda lemma is about representable functors. Here we have

\phi: [\mathbb{Z}[x], -] \cong U

where

U: \mathsf{CRing} \to \mathsf{Set}

is the forgetful functor; evaluated at a ring

R

, the isomorphism takes

f: \mathbb{Z}[x] \to R

f(x) \in U(R)

. Similarly we have

[\mathbb{Z}[x] \otimes \mathbb{Z}[x], -] \cong U \times U

, instantiated by

[\mathbb{Z}[x] \otimes \mathbb{Z}[x], -] \overset{(\pi_1, \pi_2)}{\longrightarrow} [\mathbb{Z}[x], -] \times [\mathbb{Z}[x], -] \overset{\phi \times \phi}{\longrightarrow} U \times U

where the first product projection

\pi_1

is induced by the first coproduct coprojection

i_1: \mathbb{Z}[x] \to \mathbb{Z}[x] \otimes \mathbb{Z}[x]: x \mapsto x \otimes 1

, and

\pi_2

is induced by the second coproduct coprojection

i_2: \mathbb{Z}[x] \to \mathbb{Z}[x] \otimes \mathbb{Z}[x]: x \mapsto 1 \otimes x

. Taking

R = \mathbb{Z}[x] \otimes \mathbb{Z}[x]

, chase the identity element

1_{\mathbb{Z}[x] \otimes \mathbb{Z}[x]}

through the sequence

[\mathbb{Z}[x] \otimes \mathbb{Z}[x], R] \overset{(\pi_1, \pi_2)}{\longrightarrow} [\mathbb{Z}[x], R] \times [\mathbb{Z}[x], R] \overset{\phi \times \phi}{\longrightarrow} U(R) \times U(R) \overset{+_R}{\longrightarrow} U(R) \overset{\phi^{-1}}{\longrightarrow} [\mathbb{Z}[x], R],

1_{\mathbb{Z}[x] \otimes \mathbb{Z}[x]} \mapsto (x \overset{i_1}{\mapsto} x \otimes 1, x \overset{i_2}{\mapsto} 1 \otimes x) \mapsto (x \otimes 1, 1 \otimes x) \overset{+}{\mapsto} x \otimes 1 + 1 \otimes x \mapsto (x \mapsto x \otimes 1 + 1 \otimes x).

Todd Trimble (Oct 17 2024 at 18:54):

In other words, the co-addition

\alpha: \mathbb{Z}[x] \to \mathbb{Z}[x] \otimes \mathbb{Z}[x]

is the unique ring map taking

x

x \otimes 1 + 1 \otimes x

Todd Trimble (Oct 17 2024 at 18:54):

The same type of calculation shows that the comultiplication

\mu: \mathbb{Z}[x] \to \mathbb{Z}[x] \otimes \mathbb{Z}[x]

is the unique map taking

x

(x \otimes 1) \cdot (1 \otimes x) = x \otimes x

(One could simply guess these formulas and check that they work, but I think it's nice to know how the Yoneda lemma removes any guesswork.)

Todd Trimble (Oct 17 2024 at 18:55):

If one uses

\mathbb{Z}[x] \otimes \mathbb{Z}[x] \cong \mathbb{Z}[x, y]

, in effect identifying

x \otimes 1

with

x \in \mathbb{Z}[x, y]

and

1 \otimes x

with

y \in \mathbb{Z}[x, y]

, then the co-addition becomes simply the ring map

\mathbb{Z}[x] \to \mathbb{Z}[x, y]

taking

x

x + y

, which makes everything look simple and obvious. The comultiplication takes

x

xy

Todd Trimble (Oct 17 2024 at 18:55):

Moving in the opposite direction, suppose given a (commutative, cocommutative) bi-ring

B

. The addition on

[B, R] = \mathsf{CRing}(B, R)

is retrieved from the co-addition

\alpha: B \to B \otimes B

as a composite

[B, R] \times [B, R] \overset{\sim}{\longrightarrow} [B \otimes B, R] \overset{[\alpha, 1_R]}{\longrightarrow} [B, R] \qquad (1)

where the isomorphism obtains by the universal property of

B \otimes B

as a coproduct. To be explicit, this isomorphism takes a pair of homomorphisms

(f: B \to R, g: B \to R)

to the composite

B \otimes B \overset{f \otimes g}{\longrightarrow} R \otimes R \overset{\nabla}{\longrightarrow} R

where the codiagonal

\nabla

is precisely the multiplication

m: R \otimes R \to R

Todd Trimble (Oct 17 2024 at 18:56):

B \overset{\alpha}{\to} B \otimes B \overset{f \otimes g}{\to} R \otimes R \overset{m}{\to} R \qquad (2)

Todd Trimble (Oct 17 2024 at 18:56):

Replacing the co-addition

\alpha

by comultiplication

\mu

, the same construction as in

(2)

produces the product

f \cdot g

[B, R]

Todd Trimble (Oct 17 2024 at 18:56):

Now categorify all this. Replace

\mathbb{Z}[x]

, the free commutative ring on one generator, with the free 2-rig on one generator, which is the (additive) Cauchy completion of the

k

-linearization of the free symmetric monoidal category

\mathsf{S}

on one generator. (To get anywhere interesting in categorifying commutative rings, you should add in some limits/colimits, and Cauchy completeness is a good place to start.) We write this as

\overline{k\mathsf{S}}

Todd Trimble (Oct 17 2024 at 18:57):

I think I mentioned before that this

\overline{k\mathsf{S}}

is the representing 2-rig for the forgetful functor

\mathbf{2Rig} \to \mathsf{Cat}

, and on those grounds alone, through abstract nonsense, one can copy over (or categorify) the development above for

\mathbb{Z}[x]

, to derive a 2-birig structure on

\overline{k\mathsf{S}}

, with a categorified co-addition given by the unique (up to isomorphism) 2-rig map

(where the codomain is the free 2-rig on two generators

x, y

) that sends the generator

x

\overline{k\mathsf{S}}

to the formal coproduct

x \oplus y

\overline{k\mathsf{S}(x, y)}

Todd Trimble (Oct 17 2024 at 18:58):

It is interesting to watch what

\alpha

does to objects like

S^n

and

\Lambda^n

\overline{k\mathsf{S}}

. I'll start with

S^n

, the

n^{th}

symmetric power. For any 2-rig

\mathcal{R}

, there is a 2-rig

\mathcal{R}[\mathbb{N}]

of graded

\mathcal{R}

-objects (whose symmetric monoidal tensor is given by Day convolution, coming from

\mathbb{N}

), and one way we can view the symmetric power

S^n(r) = r^{\otimes n}/S_n

for an object

r

\mathcal{R}

, which I sometimes like to write as

r^{\otimes n}/n!

, is that it is the

n^{th}

homogeneous component of a symmetric algebra construction, which I will write as

Here I'm thinking of the object

r

as sitting in grade

1

\mathcal{R}[\mathbb{N}]

, so that

S^n(r)

sits in grade

n

. This

\exp(r)

is the free commutative monoid on

r

, and the category of commutative monoid objects has

\otimes

as its coproduct. This free functor gives a (partially defined) left adjoint [I say "partial" because if for example

r'

is in degree

0

, then so would be

\exp(r')

, but maybe the 2-rig

\mathcal{R}

we started with doesn't have infinite coproducts like

\sum_{n \geq 0} S^n(r)

-- there's a better chance of success if the coproduct is spread across grades]. Since left adjoints preserve coproducts, we deduce an isomorphism

Todd Trimble (Oct 17 2024 at 19:00):

an identity which holds as a natural transformation between Schur functors of type

\mathcal{R} \to \mathcal{R}

. This holds in particular when

\mathcal{R} = \overline{k\mathsf{S}(x, y)}

Todd Trimble (Oct 17 2024 at 19:01):

Decategorifying this "identity" by taking isomorphism classes, i.e., by taking Grothendieck rings, this implies that the induced co-addition

\alpha: \Lambda \to \Lambda \otimes \Lambda

satisfies

Todd Trimble (Oct 17 2024 at 19:02):

Consider now the induced addition on

[\Lambda, R]

(

R

a commutative ring), sending a pair

(f, g)

of homomorphisms

\Lambda \to R

to the homomorphism

\Lambda \overset{\alpha}{\longrightarrow} \Lambda \otimes \Lambda \overset{f \otimes g}{ \longrightarrow} R \otimes R \overset{m}{\longrightarrow} R.

This composite takes

\sigma^n

\sum_{j + k = n} f(\sigma^j) g(\sigma^k)

. In other words, Witt ring addition is defined by

and if we set up an isomorphism

[\Lambda, R] \overset{\sim}{\longrightarrow} 1 + tR[[t]]

f \mapsto \sum_{n \geq 0} f(\sigma^n) t^n

, then the induced Witt ring addition on

1 + tR[[t]]

is given by multiplying power series.

Todd Trimble (Oct 17 2024 at 19:02):

It turns out the same is true if we use instead the isomorphism

[\Lambda, R] \overset{\sim}{\longrightarrow} 1 + tR[[t]]

given by

f \mapsto \sum_{n \geq 0} f(\lambda^n) t^n

. At the beginning of that story, we have the identity

\Lambda^n(r \oplus s) \cong \sum_{j + k = n} \Lambda^j(r) \otimes \Lambda^k(s)

as a natural transformation between Schur functors on any 2-rig

\mathcal{R}

. To see this, we replace the 2-rig

\mathcal{R}[\mathbb{N}]

of graded objects, with its "vanilla" symmetry

\sigma(u \otimes v) = v \otimes u

for objects

u

in grade

p

and

v

in grade

q

, with the more sophisticated symmetry that introduces a sign factor,

\sigma(u \otimes v) = (-1)^{pq} v \otimes u

. Otherwise, the entire story (symmetric algebra construction as free commutative monoid, etc.) remains the same, mutatis mutandis, hence we get this "exponential identity". Therefore, Witt addition on formal power series is again multiplication, even if we opt for this other identification with

1 + tR[[t]]

Todd Trimble (Oct 17 2024 at 19:08):

Now that I've fully explained how addition on the big Witt ring works, I can turn my attention to how multiplication works. Multiplication of

f, g: \Lambda \to R

is defined by the composite

\Lambda \overset{\mu}{\longrightarrow} \Lambda \otimes \Lambda \overset{f \otimes g}{\longrightarrow} R \otimes R \overset{m}{\longrightarrow} R

where this time we have to figure out how comultiplication

\mu

\Lambda

works. It is, of course, gotten by decategorifying (taking isomorphism classes) the unique (up to isomorphism) 2-rig map

Working this out in detail will involve the "splitting principle" for 2-rigs, which John will be discussing at the upcoming Octoberfest meeting, but perhaps I'll pause to take a break from this longish series of comments.

John Baez (Oct 18 2024 at 18:06):

That's great! Maybe someday we should write a little exposition of the big Witt ring. (Better than
a big exposition of the little Witt ring, explaining stuff like this. :upside_down:)

It might be fun to see what Witt addition looks like if we use the basis for

\Lambda

given by power sum symmetric functions.

Todd Trimble (Oct 18 2024 at 21:46):

I'm going to push on with this series of posts; the next topic will be this splitting principle that I keep banging on about.

Todd Trimble (Oct 18 2024 at 21:47):

For us, the splitting principle is the idea that to establish isomorphisms between Schur functors, it is permissible to pretend that they can be decomposed as coproducts of "line objects". It is analogous to the splitting principle in K-theory, where equations are verified by acting as if vector bundles split as coproducts of line bundles.

This sounds a little flaky perhaps, so I'll put it more precisely in a moment, but first I want to say that the situation reminds me of how those Italian mathematicians from the late Renaissance who developed the cubic formula -- Cardano, Tartaglia, etc. -- made the bold move to act as if

i = \sqrt{-1}

were a thing. Even when all the roots of the cubic polynomial were real, they were arrived at by making use of imaginary elements. (True, they were fairly uncomfortable with the situation, and it took a few centuries before mathematicians felt generally at home with splitting extensions of fields.) The analogy is apt: the coefficients of a polynomial are symmetric functions of their roots, and the roots are analogous to the line objects we are about to discuss. Don't believe the analogy? That's okay. Humor me anyway by considering a linear transformation

T: V \to V

on a vector space. By passing to an extension of the scalar field if need be, i.e., adjoining roots, we can split

V

into a coproduct of eigenspaces (which generically are lines), by splitting the characteristic polynomial into linear factors.

(At the risk of too much self-indulgence: this also reminds me of splitting light into a spectrum, where the energy levels of photons are given by eigenvalues of a suitable operator.)

Todd Trimble (Oct 18 2024 at 21:47):

Okay, now let me state the splitting principle more precisely. For the purposes of this thread, define a line object in a 2-rig

\mathcal{R}

to be a (nonzero) object

r

satisfying

\Lambda^2(r) \cong 0

. (In our paper, we say instead "bosonic subline object".) Another way of saying it is that the canonical quotient

r \otimes r \to S^2(r)

is an isomorphism, or equivalently that the symmetry

\sigma: r \otimes r \to r \otimes r

that transposes factors equals the identity. Or equivalently still, that the tautological action of

S_n

r^{\otimes n}

is trivial for all

n

. Finally, this last condition is equivalent to saying that for the symmetric powers

S^n

, we have

S^n(r) \cong r^{\otimes n}

for all

n

Todd Trimble (Oct 18 2024 at 21:48):

Just as

\overline{k\mathsf{S}}

with its generator

x

is initial among 2-rigs equipped with an object, so

\overline{k\mathbb{N}}

(the linear Cauchy completion of the linearized discrete symmetric monoidal category

\mathbb{N}

) is initial among 2-rigs equipped with a line object. Here

\overline{k\mathbb{N}}

is equivalent to the category of

\mathbb{N}

-graded vector spaces of finite total dimension, which we in our paper denote as

\mathsf{A}

, and the line object in this universal case is a 1-dimensional vector space concentrated in grade

1

. Likewise, the walking 2-rig on

n

line objects

L_1, \ldots, L_n

, denoted as

\mathsf{A}^{\boxtimes n}

is equivalent to the category of

\mathbb{N}^n

-graded vector spaces of finite total dimension.

Todd Trimble (Oct 18 2024 at 21:48):

Assume the ground field

k

is of characteristic zero. Suppose given Schur objects

F, G: \mathsf{S} \to \mathsf{Vect}

, also known as polynomial functors, say of degree

n

or less; the latter means they are valued in finite-dimensional vector spaces, and that their restrictions to the symmetric groups

S_m

are zero for

m > n

. Thus

F

is given by (let's say right) linear representations

F[j]

S_j

for

j = 0, \ldots, n

. The polynomial or Schur functor itself takes a finite-dimensional vector space

V

\tilde{F}(V) := \sum_{j = 0}^n F[j] \otimes_{S_j} V^{\otimes j}

. In fact this formula for the Schur functor

\tilde{F}

makes sense for any 2-rig, taking

V

to be an arbitrary object in the 2-rig. In particular, it can be applied to

V = L_1 \oplus \cdots \oplus L_n

\mathsf{A}^{\otimes n}

Todd Trimble (Oct 18 2024 at 21:49):

The splitting principle we use concerns properties of the 2-rig map

\overline{k\mathsf{S}} \to \mathsf{A}^{\boxtimes n}

that takes

F

\tilde{F}(L_1 \oplus \cdots \oplus L_n)

. In the form that we use it here, it states that the restriction of this 2-rig map to the subcategory of Schur objects or polynomial functors of degree at most

n

is essentially injective, i.e., if

\tilde{F}(L_1 \oplus \cdots \oplus L_n) \cong \tilde{G}(L_1 \oplus \cdots \oplus L_n)

for

F, G

of degree at most

n

, then

F \cong G

. (Actually we prove more: that this restricted functor is faithful and conservative as well. But the essential injectivity property just stated is the one that is really key.)

Todd Trimble (Oct 18 2024 at 21:49):

(the Schur functor

\Lambda^j

\tilde{F}

in the case where

F = \mathrm{sgn}_j

, the sign representation of

S_j

), by exploiting the exponential identity mentioned earlier:

\Lambda^j(L_1 \oplus \cdots \oplus L_n) \cong \bigoplus_{j_1 + \ldots + j_n = j} \Lambda^{j_1}(L_1) \otimes \cdots \otimes \Lambda^{j_n}(L_n).

For a line object

L

\Lambda^p(L) = 0

for all

p > 1

, as is easily shown by induction (since

\Lambda^{p+1}(r)

is a retract of

r \otimes \Lambda^p(r)

in any 2-rig). Thus the only summands that survive on the right in the last display line are the ones where all the indices

j_1, \ldots, j_n

are

0

1

(and add up to

j

). Thus

\Lambda^j(L_1 \oplus \cdots \oplus L_n) \cong \bigoplus_{1 \leq i_1 < \ldots < i_j \leq n} L_{i_1} \otimes \cdots \otimes L_{i_j}.

Letting

x_i

denote the isomorphism class

[L_i]

, the Grothendieck ring of

\mathsf{A}^{\boxtimes n}

is isomorphic to the polynomial ring

\mathbb{Z}[x_1, \ldots, x_n]

, and the isomorphism class of the coproduct on the right is

which is precisely the

j^{th}

elementary symmetric polynomial

e_j(x_1, \ldots, x_n)

as defined by the identity

Todd Trimble (Oct 18 2024 at 21:50):

S^j(L_1 \oplus \cdots \oplus L_n) \cong \bigoplus_{j_1 + \ldots + j_n = j} S^{j_1}(L_1) \otimes \cdots \otimes S^{j_n}(L_n)

and using the fact noted earlier that for line objects

L

we have

S^n(L) \cong L^{\otimes n}

, this may be rewritten as

S^j(L_1 \oplus \cdots \oplus L_n) \cong \bigoplus_{j_1 + \ldots + j_n = j} L_1^{\otimes j_1} \otimes \cdots \otimes L_n^{\otimes j_n}.

which is precisely

h_j(x_1, \ldots, x_n)

, the

j^{th}

complete homogeneous polynomial in

n

variables, as defined by the identity

\sum_{j = 0}^\infty h_j(x_1, \ldots, x_n) t^n = \prod_{i=1}^n \frac1{1 - x_i t}.

Todd Trimble (Oct 18 2024 at 21:51):

Letting

\Lambda_n

be the Grothendieck group of the category of Schur objects of degree at most

n

, the splitting principle implies that the induced map

taking

[F]

[\tilde{F}(L_1 \oplus \cdots \oplus L_n)]

, is an injection. Of course

[\tilde{F}(L_1 \oplus \cdots \oplus L_n)]

is manifestly invariant under permutations of the elements

x_i = [L_i]

, so that elements in the image of this map are symmetric polynomials in the

x_i

(of degree no more than

n

). Recall that every symmetric polynomial in variables

x_1, \ldots, x_n

is uniquely a polynomial

p(e_1, \ldots, e_n)

in the elementary symmetric polynomials

e_j = e_j(x_1, \ldots, x_n)

. Provided that the total degree of

p(e_1, \ldots, e_n)

is no more than

n

(where the degree of

e_j

is of course

j

), we have

Since the map

\Lambda_n \to \mathbb{Z}[x_1, \ldots, x_n]

carries

\lambda^j

e_j(x_1, \ldots, x_n)

, we deduce that every class

[F] \in \Lambda_n

is a polynomial

p(\lambda^1, \ldots, \lambda^n)

of total degree no more than

n

, which is already a nontrivial theorem. It says that every polynomial functor of degree less than or equal to

n

is isomorphic to a suitable coproduct of tensor powers of exterior power functors.

Todd Trimble (Oct 18 2024 at 21:52):

We are beginning to come full circle. Passing to suitable limits, explained in our paper, we can summarize one of our consequences of the splitting principle by the intuitive formula

so that we are splitting an "infinite polynomial" into linear factors, a la splitting extensions in the sense of Galois theory, and the coefficients

\lambda^i

are thereby manifestly identified with symmetric functions in the

x_i = [L_i]

, which play a role similar to roots in the splitting extension.

Todd Trimble (Oct 18 2024 at 21:52):

1 + \sigma^1 t + \sigma^2 t^2 + \cdots = \prod_{n=1}^\infty (1 - [L_i]t)^{-1}.

This goes at least some distance towards answers to some of John's questions at the top of this thread, although we still have some ways to go. I'm going to take a break for the moment.

John Baez (Oct 19 2024 at 04:25):

I will take this opportunity to digress a bit and ponder Todd's suggestion that this formula arising from the splitting principle

amounts to splitting "infinite polynomial" into linear factors. This analogy seems extremely strong. We can take any monic polynomial

P(t)

of degree

n

, factor it as

where

e_i

is a degree

i

polynomial in

a_1, \dots, a_n

. If I'm doing it right $e_i$$ is the ith elementary symmetric polynomial in

n

variables. E.g.

But the

\lambda^i

, thought of as symmetric functions, are very similar! They're the elementary symmetric functions, which are essentially elementary symmetric polynomials in infinitely many variables.

What exactly is the relation between what I just said and what you explained about

There are some differences in convention, e.g. I was talking about monic polynomials while you're looking at a 'comonic' power series (where the coefficient of the constant term is 1), and correspondingly I've got factors of

t + a_i

while you've got factors of

1 + [L_i] t

. Those can presumably be fiddled so things match up better. But what is this business about factoring a formal power series into linear factors?

In my case I think there's really a Galois extension lurking around: if we treat the

a_i

as formal variables, the field

k(a_1, \dots, a_n)

generated by the roots of the polynomial

p

is an extension of the field

k(e_1, \dots, e_n)

generated by the coefficients, and the Galois group is

S_n

. But in your case we seem to be doing a field extension that's not Galois, whose 'Galois group' (group of automorphisms over the base field) is something like

S_\infty

Todd Trimble (Oct 19 2024 at 13:49):

I think what you wrote,

p(t) = \prod_{i=1}^n (t + a_i) = \sum_{i=0}^n e_i(a_1, \ldots, a_n)

, and what I wrote at 49 past the hour,

\sum_{j=0}^n e_j(x_1, \ldots, x_n) t^j = \prod_{i=1}^n (1 + x_i t)

, are basically the same thing. "My

p(t)

" is

t^n

times your

p(1/t)

, is the way I would match things up.

As I've mentioned to you and Joe in private conversation, I very much have in mind that either way,

\mathbb{Q}(x_1, \ldots, x_n)

is a splitting field extension of its subfield

\mathbb{Q}(e_1, \ldots, e_n)

, with Galois group

S_n

. When the textbooks talk about unsolvability of the quintic and so on, that's really the formal framework of what they're discussing (maybe replace here

\mathbb{Q}

by any field). So we start with indeterminates

e_1, \ldots, e_n

that have no special meaning attached to them, and pass to the splitting field of

p(t) = t^n + e_1 t^{n-1} + \cdots + e_n

(alternate those terms if you like) and basically wind up with

\mathbb{Q}(x_1, \ldots, x_n)

which is abstractly isomorphic to the original field, but which (now thinking geometrically) fibers over it differently, with fibers given by

S_n

-torsors.

[I'll say again that I perceive a kind of unity between the various uses of the word "splitting" (splitting a polynomial, splitting into eigenspaces, splitting into line bundles, etc., even splitting field in the sense of representation theory, although I would need to recall the story of why I thought that's similar).]

But anyway, this fibering reminds me of configuration spaces, how

n

-tuples of distinct points in

\mathbb{C}

, say, fiber over

n

-element subsets of

\mathbb{C}

. Jim D. sometimes talks about this sort of thing, too.

In my write-up here, I'm thinking of

\Lambda

as a ring filtered by the

\Lambda_n

, but one can argue that really what we should be thinking of is that the quotient ring

\Lambda/(\lambda^{n+1})

\mathbb{Z}[e_1, \ldots, e_n]

. This would correspond to a lambda-ring generated by Young diagrams with

n

rows or fewer, whereas the filtration component

\Lambda_n

is not a ring, and corresponds to Young diagrams with

n

boxes or fewer.

We didn't quite get to a full explanation of the

\Lambda/(\lambda^{n+1})

picture in our paper, which we certainly hold (without proving this) to be the Grothendieck ring of the 2-rig of algebraic representations of the multiplicative monoid

\hom(V, V)

where

V

is an

n

-dimensional vector space. Same as the monoid

M_n

n \times n

matrices. We denote this 2-rig of algebraic representations as

and the "splitting principle pretense" of acting as if matrices can be diagonalized, split into 1-dimensional eigenspaces, is formalized by considering the pullback functor

where

k^n

is the multiplicative submonoid of diagonal matrices, and showing that this pullback functor satisfies our trio of conditions: faithful, conservative, and essentially injective. (For the readers out there, this is one of our main results.) We also observe that

\mathsf{Rep}(k^n)

is equivalent to

\mathsf{A}^{\boxtimes n}

, the walking 2-rig on

n

line objects.

Anyway, what I am leading up to here is that maybe in some ways it's better to think of

\overline{k\mathsf{S}}

not as a colimit or union of its filtered pieces

\overline{k\mathsf{S}_{\leq n}}

, as we do in the paper, but as a limit of 2-rig quotients

\overline{k\mathsf{S}}/(\Lambda^{n+1})

. Not a limit in a "naive" 2-rig sense, but in a graded 2-rig sense. This is analogous to how we typically treat the cohomology algebra

H^\ast(BU)

: not as an inverse limit of

H^\ast(BU(n)) \cong \mathbb{Z}[e_1, \ldots, e_n]

in the category of rings (now interpreting the

e_i

as Chern classes!), but as an inverse limit in the category of graded rings, leading to the polynomial algebra

\Lambda = \mathbb{Z}[e_1, e_2, \ldots]

in infinitely many variables.

I'm thinking that this inverse limit perspective on

\Lambda

, placing it in the same neighborhood as how we treat

\mathsf{A}^{\boxtimes \infty}

as an inverse limit in a graded sense, might lead to a more harmonious picture of what is going on at the level of Galois groups. For example, it might clarify whether we are thinking of this

S_\infty

as the full permutation group, or just the union of the

S_n

(I'm thinking the former is more appropriate).

John Baez (Oct 19 2024 at 22:11):

All this is really exciting. It's good to sort out what are limits and what are quotients. As far as the 'splitting fields'

\mathbb{Q}[x_1, \dots, x_n]

go, if we want homomorphisms between them, the maps clearly must go this way:

John Baez (Oct 19 2024 at 22:13):

Todd Trimble (Oct 19 2024 at 22:22):

Right, if you're using fields, you have to go this direction (maybe you want to use parentheses instead of square brackets for fields). It might be in fact that fields are awkward. Maybe there's an okay sense of speaking of

\mathbb{Z}[x_1, \ldots, x_n]

as a "Galois extension" of

\mathbb{Z}[\lambda^1, \ldots, \lambda^n]

, though, with Galois group

S_n

The reason I bring this up is that we go the "bad" direction

\mathsf{A}^{\boxtimes (n+1)} \to \mathsf{A}^{\boxtimes n}

in our paper. Here "bad" means the wrong direction if we consider fields of fractions. (Of course, the fields of fractions construction is not functorial. I guess it is functorial however on the category of integral domains and injective maps between them.)

John Baez (Oct 19 2024 at 22:26):

Back in week261 of This Week's Finds, I wrote a lot about this stuff in the special case n = 3. I explained how this is related to thinking of

S_3

as a Coxeter group, and I hint at generalizations of the theory of symmetric polynomials to other Dynkin diagrams:

Imagine we're trying to solve a cubic equation. We can always divide by the coefficient of the cubic term, so it's enough to consider equations like this:

Note that A, B, and C don't change when we permute a, b, and c. So, they're called "symmetric polynomials" in the variables a, b, and c.

You see this directly, but there's also a better explanation: the coefficients of a polynomial depend on its roots, but they don't change when we permute the roots.

I can't resist mentioning a cool fact, which is deeply related to the trefoil: every symmetric polynomial of a, b, and c can be written as a polynomial in A, B, and C - and in a unique way!

In fact, this sort of thing works not just for cubics, but for polynomials of any degree. Take a general polynomial of degree n and write the coefficients as functions of the roots. Then these functions are symmetric polynomials, and every symmetric polynomial in n variables can be written as a polynomial of these - and in a unique way.

But, back to our cubic. Note that -A/3 is the average of the three roots. So, if we slide z over like this:

we get a new cubic equation for which the average of the three roots is zero. This new cubic equation will be of this form:

for some new numbers B and C. In other words, the "A" in this new cubic is zero, since we translated the roots to make their average zero.

So, to solve cubic equations, it's enough to solve cubics like x^3 + Bx + C = 0. This is a great simplification. When you first see it, it's really exciting. But then you realize you have no idea what to do next! This must be why it's called a "depressed cubic".

In fact, Scipione del Ferro figured out how to solve the "depressed cubic" shortly after 1500. So, you might think he could solve any cubic. But, NEGATIVE NUMBERS HADN'T BEEN INVENTED YET. This prevented him from reducing any cubic to a depressed one!

It's sort of hilarious that Ferro was solving cubic equations before negative numbers were worked out. It should serve as a lesson: we mathematicians often work on fancy stuff before understanding the basics. Often that's why math seems hard! But often it's impossible to discover the basics except by working on fancy stuff and getting stuck.

Plugging this in the cubic, you'll get a quadratic equation in y^3, which you can solve. From this you can figure out y, and then x.

Alas, I have no idea what this trick means. Does anyone know? Ferro and Tartaglia used a more long-winded method that seems just as sneaky. Later Lagrange solved the cubic yet another way. I like his way because it contains strong hints of Galois theory.

So, I won't say more about solving the cubic now. Instead, I want to explain the "discriminant". This is a trick for telling when two roots of our cubic are equal. It turns out to be related to the trefoil knot.

For a quadratic equation ax^2 + bx + c = 0, the two roots are equal precisely when b^2 - 4ac = 0. That's why b^2 - 4ac is called the "discriminant" of the quadratic. The same idea works for other equations; let's see how it goes for the cubic.

The left side isn't a symmetric polynomial in a, b, and c; it changes sign whenever we switch two of these variables. But if we square it, we get a symmetric polynomial that does the same job:

This is the discriminant of the cubic! By what I said about symmetric polynomials, it has to be a polynomial in B and C (since A = 0). If you sweat a while, you'll see

So, here's the grand picture: we've got a 2-dimensional space of cubics with coordinates B and C. Sitting inside this 2d space is a curve consisting of "degenerate" cubics - cubics with two roots the same. This curve is called the "discriminant locus", since it's where the discriminant vanishes:

If we only consider the case where B and C are real, the discriminant
locus looks like this:

                   |C
           o       |
            o      |
               o   |
       -----------o-------------
               o   |           B
            o      |
           o       |
                   |

Now here's where the trefoil knot comes in. The equation for the discriminant locus:

John Baez (Oct 19 2024 at 22:28):

Indeed, after a linear change of variables they're the same! But, for the trefoil we need u and v to be complex numbers. We took them to be unit complex numbers, in fact.

So, the story is this: we've got a 2-dimensional complex space of complex cubics. Sitting inside it is a complex curve, the discriminant locus. In our new variables, it's this:

Normal folks think of knots as living in ordinary 3d space, but topologists often think of them as living in a 3-sphere: a sphere in 4d space. That's good for us. We can take this 4d space to be our 2d complex space of complex cubics! We can pick out spheres in this space by equations like this:

These are not round 3-spheres, thanks to that annoying third power. But, they're topologically 3-spheres. If we take any one of them and intersect it with our discriminant locus, we get a trefoil knot!

But if you think about it, we also get a trefoil knot for any other c > 0. This trefoil shrinks as c -> 0, and at c = 0 it reduces to a single point, which is also the cusp here:

                     |u
                     |      o
                     |     o
                     |   o
          -----------o-------------
                     |   o        v
                     |     o
                     |      o
                     |

We don't see trefoil knots in this picture because it's just a real 2d slice of the complex 2d picture. But, they're lurking in the background!

Now let me say how the group of permutations of three things gets into the game. We've already seen the three things: they're the roots a, b, and c of our depressed cubic! So, they're three points on the complex plane that add to zero. Being a physicist at heart, I sometimes imagine them as three equal-mass planets, whose center of mass is at the origin.

The space of possible positions of these planets is a 2d complex vector space, since we can use any two of their positions as coordinates and define the third using the relation

So, there are three coordinate systems we can use: the (a,b) system, the (b,c) system and the (c,a) system. We can draw all three coordinate systems at once like this:

                b
                 \       /
                  \     /
                   \   /
                    \ /
             --------o--------a
                    / \
                   /   \
                  /     \
                 /       \
                c

The group of permutations of 3 things acts on this picture by permuting the three axes. Beware: I've only drawn a 2-dimensional real vector space here, just a slice of the full 2d complex space.

Now suppose we take this 2d complex space and mod out by the permutation symmetries. What do we get? It turns out we get another 2d complex vector space! In this new space, the three coordinate axes shown above become just one thing... but this thing is a curve, like this:

Why does it work this way? The explanation is sitting before us. We've got two 2d complex vector spaces: the space of possible ordered triples of roots of a depressed cubic, and the space of possible coefficients. There's a map from the first space to the second, since the coefficients are functions of the roots:

These functions are symmetric polynomials: they don't change when we permute a, b, and c. And, it follows from what I said earlier that we can get any symmetric polynomial as a function of these - under the assumption that a+b+c = 0, that is.

So, the map where we mod out by permutation symmetries of the roots is exactly the map from roots to coefficients.

            c=a
              \       /
               \     /
                \   /
                 \ /
          --------o-------- b=c
                 / \
                /   \
               /     \
              /       \
            a=b

So, when we apply the map from roots to coefficients, these lines get mapped to the discriminant locus:

                   |
           o       |
            o      |
               o   |
       -----------o-------------
               o   |
            o      |
           o       |
                   |

You should now feel happy and quit reading... unless you know a bit of topology. If you do know a little topology, here's a nice spinoff of what we've done. Though I didn't say it using so much jargon, we've already seen that space of nondegenerate depressed cubics is C^2 minus a cone on the trefoil knot. So, the fundamental group of this space is the same as the fundamental group of S^3 minus a trefoil knot. This is a famous group: it has three generators x,y,z, and three relations saying that:

On the other hand, we've seen this space is the space of triples of distinct points in the plane, centered at the origin, mod permutations. The condition "centered at the origin" doesn't affect the fundamental group. So, this fundamental group is another famous group: the "braid group on 3 strands". This has two generators:

\ /  |
 /   |          X
/ \  |

|  \ /
|   /           Y
|  / \

and one relation, called the "Yang-Baxter equation" or "third Reidemeister move":

\ /  |        |  \ /
 /   |        |   /
/ \  |        |  / \
|  \ /        \ /   |
|   /     =    /    |           XYX = YXY
|  / \        / \   |
\ /  |        |  \ /
 /   |        |   /
/ \  |        |  / \

So: the 3-strand braid group is isomorphic to the fundamental group of the complement of the trefoil! You may enjoy checking this algebraically, using generators and relations, and then figuring out how this algebraic proof relates to the geometrical proof.

... but what's really magnificent is that most of it generalizes to any Dynkin diagram, or even any Coxeter diagram! (See "week62" for those.)

Yes, we've secretly been studying the Coxeter diagram A_2, whose "Coxeter group" is the group of permutations of 3 things, and whose "Weyl chambers" look like this:

                 \       /
                  \     /
                   \   /
                    \ /
             --------o--------
                    / \
                   /   \
                  /     \
                 /       \

Let me just sketch how we can generalize this to A_{n-1}. Here the Coxeter group is the group of permutations of n things, which I'll call n!.

Let X be the space of n-tuples of complex numbers summing to 0. X is a complex vector space of dimension n-1. We can think of any point in X as the ordered n-tuple of roots of some depressed polynomial of degree n. Here "depressed" means that the leading coefficient is 1 and the sum of the roots is zero. This condition makes polynomials sad.

The permutation group n! acts on X in an obvious way. The quotient X/n! is isomorphic (as a variety) to another complex vector space of dimension n-1: namely, the space of depressed polynomials of degree n. The quotient map

Sitting inside X is the set D consisting of n-tuples of roots where two or more roots are equal. D is the union of a bunch of hyperplanes, as we saw in our example:

                 \       /
                  \     /
                   \   /
                    \ /
             --------o--------
                    / \
                   /   \
                  /     \
                 /       \

Sitting inside X/n! is the "discriminant locus" D/n!, consisting of degenerate depressed polynomials of degree n - that is, those with two or more roots equal. This is a variety that's smooth except for some sort of "cusp" at the origin:

The fundamental group of the complement of the discriminant locus is the braid group on n strands. The reason is that this group describes homotopy classes of ways that n points in the plane can move around and come back to where they were (but possibly permuted). These points are the roots of our polynomial.

On the other hand, the discriminant locus is topologically the cone on some higher-dimensional knot sitting inside the unit sphere in C^{n-1}. So, the fundamental group of the complement of this knot is the braid group on n strands.

This relation between higher-dimensional knots and singularities was investigated by Milnor, not just for the A_n series of Coxeter diagrams but more generally:

7) John W. Milnor, Singular Points of Complex Hypersurfaces, Princeton U. Press, 1969.

The other Coxeter diagrams give generalizations of braid groups called Artin-Brieskorn groups. Algebraically you get them by taking the usual presentations of the Coxeter groups and dropping the relations saying the generators (reflections) square to 1.

Todd Trimble (Oct 19 2024 at 22:39):

Thanks for recalling all this! I've never read this carefully, but now it seems I shall.

This gives a much more elaborated picture (for

n = 3

) of what I was waving my hands at earlier, when I spoke of configuration spaces of

n

-tuples of distinct points mapping down onto the space of

n

-element subsets (or is the latter called the configuration space?). The fundamental group of the space below is the full braid group, and the fundamental group of the space above is the pure braid group, which is the kernel of the quotient

B_n \to S_n

John Baez (Oct 19 2024 at 22:58):

Now I really want to find an analogous story for the

\text{BC}_n

series of Coxeter groups and their Artin-Brieskorn braid groups. (The

\text{B}_n

Dynkin diagrams have the same Coxeter groups as the

\text{C}_n

, so I'm lumping them together, especially since

B_n

also has a completely different meaning in what you just wrote!)

The

\text{BC}_n

Coxeter group is the symmetry group of the

n

-cube (I hope I'm getting the numbers to match correctly here), or if you prefer, the

n

-dimensional 'orthoplex', the

n

-dimensional generalization of an octahedron. I like to think of it as the full rotation-reflection symmetry group of the

n

coordinate axes in

\mathbb{R}^n

This is the symmetry group of

n

pairs of things, where the pairs can be permuted, and the two roots within each pair can be switched, but the roots within each pair are "joined at the hip". It's the wreath product of

S_n

and

\mathbb{Z}/2

To get this group as the Galois group of something, maybe I should be looking for a polynomial with

2n

roots that come in pairs, something like

Todd Trimble (Oct 19 2024 at 23:12):

Let's see, in the case of the Coxeter group

S_n

, down below, you localize away from (geometrically, take the complement of) the discriminant locus; up above, you remove some hyperplanes where two coordinates are equated.

So I guess in your proposal, the space above would consist of pairs

(a_1, -a_1), \ldots (a_n, -a_n)

which are all distinct from each other and their negatives, and also nonzero, and down below, the space of possible coefficients of this polynomial, which again can be given by localizing away from the locus of a suitable variant of a discriminant (so as to forbid any

a_i = 0

from being a root; I guess it's the usual discriminant times

a_1^2 \cdots a_n^2

John Baez (Oct 20 2024 at 00:15):

Hmm. I'm confused about a lot of things, but people have written about "symplectic symmetric functions", and I'm hoping these should give a way of working with the cohomology ring

H^\ast(\mathrm{BSp})

just as the usual symmetric functions can be identified with elements of

H^\ast(\mathrm{BU}) \cong H^\ast(\mathrm{BGL})

Todd Trimble (Oct 20 2024 at 00:20):

Very interesting idea! (Did I say something wrong or confusing in my last message?) Oh well, we can certainly discuss this.

John Baez (Oct 20 2024 at 00:20):

John Baez (Oct 20 2024 at 00:23):

I'd read in Weyl's book about generalizations of Young diagram techniques for orthogonal and symplectic groups, but for some reason I'd never thought about that stuff in conjunction with symmetric functions. Macdonald's anemic index doesn't contain the words "symplectic" or "orthogonal", so I'm having trouble finding about these various bases. It sounds like he's somehow embedding the rings of orthogonal and symplectic characteristic classes into the ring of characteristic classes for

\mathrm{BGL}

, i.e. the usual ring

\Lambda

of symmetric functions. I would like to start by treating them separately, as their own independent rings.

John Baez (Oct 20 2024 at 00:25):

I just don't have a clear idea of what's going on. I guess you're probably right in what you said.

John Baez (Oct 20 2024 at 00:28):

My mind is just a bit blown by the bigger picture we seem to be stumbling on. It gets even bigger if we remember Bott periodicity and thus that

\mathrm{BO}

looped 4 times gets you

\mathrm{BSp}

, which looped 4 times gets you back to

\mathrm{BO}

Todd Trimble (Oct 20 2024 at 00:38):

Let me now pick up where I left off in the general exposition. I promised to get into some nitty-gritties involving the comultiplication

\mu: \Lambda \to \Lambda \otimes \Lambda

, considering

\Lambda

as a biring. Now that we have the splitting principle in hand, we're in a position to do that.

Todd Trimble (Oct 20 2024 at 00:39):

By the way, for anyone reading these posts but who are relatively new to the general topic of lambda rings, I might pose an exercise at this point: what can you say about line objects in

\overline{k\mathsf{S}}

; alternatively, what can you say about elements

\ell

\Lambda

such that

\lambda^2(\ell) = 0

? I'll reveal the answer later.

Todd Trimble (Oct 20 2024 at 00:39):

Here's another exercise we can do right now: in any 2-rig, prove that the tensor product of two line objects is also a line object. (Let's drop the nonzero requirement for that one. Actually, in our paper, we don't impose that condition. I only put it in there, parenthetically, as a kind of nod to what is done elsewhere in the literature, but morally it's better to leave it out. That's why in our paper we use the term "subline" by the way, because for example

0

satisfies

\lambda^2(0) \cong 0

, but it isn't at all 1-dimensional. Nonetheless, here in this series, I'll keep saying "line" because it's short.)

Todd Trimble (Oct 20 2024 at 00:39):

For this exercise, one should decide which formulation of line object is the most convenient. Our choices are

If you chose the third as the most convenient, then I think you chose well, because for one thing you don't need all the infrastructure of a 2-rig to make sense of it; you only use the symmetric monoidal structure. (Recall that in our paper we call these objects "bosonic subline objects", as distinguished from "fermionic subline objects". The terminology is meant to recall the mathematics of supersymmetry, which actually plays an important role, especially in categorified calculations that have anything to do with negatives -- see for example section 7 of our first paper, where we need to transition from a rig-plethory

\Lambda_+

to the ring-plethory

\Lambda

Having made this choice, the exercise becomes fairly straightforward. For example, it's easy to give a proof via string diagrams (go ahead and try it!).

Todd Trimble (Oct 20 2024 at 00:40):

That exercise is important for what we do. In the spirit of the splitting principle, given objects

r, r'

in a 2-rig that are finite coproducts of line objects,

r \cong L_1 \oplus \cdots \oplus L_m

and

r' = L_1' \oplus \cdots \oplus L_n'

, their tensor product is also a coproduct of line bundles,

Todd Trimble (Oct 20 2024 at 00:40):

In a moment we'll actually apply the splitting principle to understand comultiplication

\mu

, but first I want to extend it to say that not only is our canonical

\overline{k\mathsf{S}} \to \mathsf{A}^{\boxtimes n}

(to the category of

\mathbb{N}^n

-graded spaces) essentially injective when restricted to polynomial functors of degree no more than

n

, so are 2-tensor products of such 2-rig maps, i.e.,

\overline{k\mathsf{S}(x, y)} \simeq \overline{k\mathsf{S}} \boxtimes \overline{k\mathsf{S}}\to \mathsf{A}^{\boxtimes m} \boxtimes \mathsf{A}^{\boxtimes n}

is essentially injective when restricted to polynomial functors of degree no more than

m

x

and

n

y

Todd Trimble (Oct 20 2024 at 00:40):

On to comultiplication. Abstractly, we understand

\mu: \Lambda \to \Lambda \otimes \Lambda

. Its effect on isomorphism classes

[F]

of a Schur object or polynomial functor (polynomial species)

F

is to take it to

[\tilde{F}(x \otimes y)]

, where

\tilde{F}(x \otimes y)

lives in

\overline{k\mathsf{S}(x, y)} \simeq \overline{k\mathsf{S}} \boxtimes \overline{k\mathsf{S}}

. But now we want to understand how to calculate explicitly this map in terms of a chosen polynomial basis for

\Lambda

, say

and for that it suffices to know how to calculate

\lambda^n(x \otimes y)

as an element in

\Lambda(x, y) \cong \mathbb{Z}[\lambda^1(x), \lambda^2(x), \ldots; \lambda^1(y), \lambda^2(y), \dots].

Todd Trimble (Oct 20 2024 at 00:41):

I'm not going to do it for all

n

. I just want to explain how I would go about it for small

n

if I were locked in a room with pen and paper and no internet or books. Experts in the area would know how to do it efficiently (as far as the state of the art allows), but that's not the point here; the modest point is simply to understand it.

In fact I'm just going to recall how it goes for

n = 2

(which we will need later anyway), and wave my hands a little at the case for higher

n

. In fact, I can just quote John quoting me:

Todd Trimble (Oct 20 2024 at 00:42):

Anyhow, to sum it all up, what you do to compute the

\lambda^n(xy)

is expand

\prod_{1 \leq i, j \leq n} (1 + x_i y_j t)

and then write the coefficients as a polynomial in the elementary symmetric polynomials

e_i(x), e_j(y)

Todd Trimble (Oct 20 2024 at 00:43):

Finally: the answer to the exercise: there are no nonzero line objects in

\overline{k\mathsf{S}}

One might hastily think that the generator

x

itself is surely a line object: as a functor

\mathsf{S} \to \mathsf{Vect}

, it vanishes at

n

for every

n \neq 1

; at

n = 1

, its value is the 1-dimensional (trivial) representation of

S_1

. The trouble is that the tensor product on

\overline{k\mathsf{S}}

is not the pointwise one (that would be the Hadamard product, as some people say); it's the Day convolution one (aka the Cauchy product). Tensors powers using the Day convolution "spread out", and their retracts are also spread out.

Or, one might think that you can get line objects in

\overline{k\mathsf{S}}

by pulling back the canonical line object

\mathbb{N} \to \mathsf{Vect}

(in

\overline{k\mathsf{N}}

) along the symmetric monoidal quotient

\mathsf{S} \to \mathbb{N}

. The trouble is that the resulting pullback functor

\overline{k\mathsf{N}} \to \overline{k\mathsf{S}}

is only lax monoidal; we require strong monoidality for our 2-rig maps. It's pushforward (i.e., left Kan) along this symmetric monoidal quotient functor that is strong monoidal.

Todd Trimble (Oct 20 2024 at 01:03):

Yes, well, I'm not ready to think about that! I suppose if I were trying to return my mind back to real Bott periodicity, I would at first take a Clifford algebra approach, which I probably never properly learned in the first place.

The amount of latent geometry we began to stumble on in the second paper is itself pretty daunting. In the long series of comments I just posted, it seems we're running into the Segre embedding again, induced by

\langle(x_1, \ldots, x_m); (y_1, \ldots, y_n) \mapsto (x_i y_j)_{1 \leq i \leq m, 1 \leq j \leq n}.

This appeared around the point that we were describing 2-rigs of algebraic representations, but you thought it would be better to remove the words "Segre embedding", so as to not scare off readers. :-)

Todd Trimble (Oct 20 2024 at 16:03):

I lied. The species that takes

0 \in \mathsf{S}

k

and otherwise

n

0

, i.e., the monoidal unit, is a nonzero line object. (The monoidal unit in any 2-rig is a line object.) But that's the only one, up to isomorphism.

John Baez (Oct 20 2024 at 21:52):

I got to know Bott periodicity pretty well blogging and giving talks about the tenfold way, which unifies real and complex Bott periodicity. For some reason I never noticed until now that our friend

\Lambda

arises from

\mathrm{U}

, which is one of the ten infinite loop spaces in the tenfold way, and that others should give 'mutant' versions of symmetric functions. But I will resist derailing this thread with that.

Todd Trimble (Oct 21 2024 at 03:11):

It's now time to look at the big Witt ring

W(R) = [\Lambda, R]

, for a commutative ring

R

. This on the other hand has lots of line objects, I mean line elements! But we will need to familiarize ourselves with how its lambda-ring structure works.

Todd Trimble (Oct 21 2024 at 03:12):

There are two abstract ways to define lambda-rings: either as coalgebras of the right adjoint comonad

[\Lambda, -]: \mathsf{CRing} \to \mathsf{CRing}

, or as algebras of its left adjoint monad, typically denoted

\Lambda \odot -: \mathsf{CRing} \to \mathsf{CRing}

. A standard abuse of language is that

\odot

may be thought of either as the action constraint of an actegory structure where birings act on rings, or as a monoidal product on the category of birings, but abstractly it is easy to understand: if

B

is a biring, then

[B, -]: \mathsf{CRing} \to \mathsf{CRing}

is a right adjoint, and we define the functor

B \cdot -: \mathsf{CRing} \to \mathsf{CRing}

to be its left adjoint. If

B, C

are two birings, then the composition of right adjoints

[C, [B, -]] = [C, -] \circ [B, -]

is again a right adjoint endofunctor on

\mathsf{CRing}

, and therefore is of the form

[D, -]

for some new biring

D

. We define

B \odot C

to be this biring, so that

[B \odot C, R] \cong [C, [B, R]

, naturally in rings

R

. This defines a monoidal (just monoidal, not symmetric monoidal) product on the category of birings, and by abuse of language there is an isomorphism

for birings

B, C

and a (commutative) ring

R

. A plethory may be defined in at least three ways:

Todd Trimble (Oct 21 2024 at 03:12):

The ur-example, perhaps the original example of a plethory, is

\Lambda

, but there is a plethora of plethories. I'll just mention one class of examples quickly. If

C

is a cocommutative

\mathbb{Z}

-coalgebra, i.e., a cocommutative comonoid in the category of abelian groups under tensor product, then for any ring

R

, the abelian group of additive homomorphisms

\mathsf{Ab}(UC, UR)

between underlying abelian groups carries a commutative ring structure, where multiplication of homomorphisms

f, g: UC \to UR

is given by the expected formula

UC \overset{\delta}{\longrightarrow} UC \otimes UC \overset{f \otimes g}{\longrightarrow} UR \otimes UR \overset{m}{\longrightarrow} UR.

The functor

\mathsf{Ab}(UC, U-): \mathsf{CRing} \to \mathsf{CRing}

is a right adjoint endofunctor. Since the symmetric

\mathbb{Z}

-algebra construction

S: \mathsf{Ab} \to \mathsf{CRing}

is left adjoint to

U: \mathsf{CRing} \to \mathsf{Ab}

, we have that the right adjoint endofunctor is represented by

SUC

, which is thereby a biring. But if moreover

C

carries a cocommutative

\mathbb{Z}

-bialgebra structure, then the multiplication

\mu: UC \otimes UC \to UC

induces a comonad structure on

\mathsf{Ab}(UC, U-)

, and in this way

SUC

becomes a plethory. This type of plethory is called a linear plethory.

Todd Trimble (Oct 21 2024 at 03:12):

Anyway, for technical reasons we chose in our first paper to emphasize the first point of view, defining a lambda-ring as a coalgebra of

[\Lambda, -]

, or of the big Witt ring comonad

W

. Of course

W(R)

itself would be the cofree lambda-ring cogenerated by

R

. Its coalgebra structure is given by the comonad comultiplication

\delta R: WR \to WWR

, which is a map of type

[\Lambda, R] \to [\Lambda, [\Lambda, R]]

. If

\hom(R, S)

denotes the ordinary hom-set of functions between rings

R, S

, then

[\Lambda, R]

sits inside

\hom(\Lambda, R)

, and

[\Lambda, [\Lambda, R]]

sits inside

\hom(\Lambda, \hom(\Lambda, R)) \cong \hom(\Lambda \times \Lambda, R): \Phi \mapsto ((r, s) \mapsto \Phi(r)(s)

and it turns out that the comonad comultiplication

[\Lambda, -] \to [\Lambda, [\Lambda, -]]

is a restriction of a map

\hom(\Lambda, -) \to \hom(\Lambda \times \Lambda, -)

induced by an operation

\Lambda \times \Lambda \to \Lambda

. This map is closely related to "plethystic multiplication", except that one has to be careful to get the order right. Given a pair of isomorphism classes

([\tau], [\rho]) \in \Lambda \times \Lambda

of polynomial functors, the map

\Lambda \times \Lambda \to \Lambda

takes this pair to their composition (aka substitution)

\rho \circ \tau

as polynomial functors; considered as species, the formula is

where the tensorial exponent refers to the Day convolution on

\overline{k\mathsf{S}}

. (See our first paper, top of page 35.) The class

[\rho \circ \tau]

is denoted

[\rho] \bullet [\tau]

Todd Trimble (Oct 21 2024 at 03:13):

We can now say that the comonad comultiplication on the big Witt ring

W(R) = [\Lambda, R]

is given by the map

If we want to compute the value

\lambda^i(f)

for an element

f: \Lambda \to R

of the lambda-ring

W(R)

, this formula tells us what to do: it's the homomorphism

\lambda^i(f): \Lambda \to R

defined by

Todd Trimble (Oct 21 2024 at 03:16):

Now, very much in keeping with our ruminations on the connection between the splitting principle and splittings of polynomials into linear factors

1 + x_i t

, we can at least guess what line elements should look like in

W(R)

. First, I ought to define a "line element" in a general lambda-ring

R

. The definition I'll adopt (and I believe something like this appears in the literature; I need to check up on that) is that it's an element

r \in R

such that

\lambda^n(r) = 0

for all

n \geq 2

Todd Trimble (Oct 21 2024 at 03:17):

The statement is that line elements in

W(R)

are homomorphisms

\Lambda \to R

that take

\lambda^1

to some element

a \in R

, and

\lambda^n

0

for

n > 1

. In other words, under the identification

[\Lambda, R] \cong 1 + tR[[t]]

taking

f

1 + \sum_{n \geq 1} f(\lambda^n) t^n

, the line elements in

1 + tR[[t]]

are of the form

1 + at

. In the first place, it is necessary that a line element

f

be of this form, because if

\lambda^n f = 0

for all

n \geq 2

, then evaluation at

\lambda^1

yields

for all

n \geq 2

. For sufficiency, we must show that if

f(\lambda^n) = 0

for all

n \geq 2

, then

\lambda^m(f) = 0

for all

m \geq 2

, or that

f(\lambda^i \bullet \lambda^m) = 0

for all

i \geq 1

and

m \geq 2

. Since

f

is a ring homomorphism, it is enough to see that

\lambda^i \bullet \lambda^m

belongs to the ideal

(\lambda^2, \lambda^3, \ldots)

. For now I am going to leave this as an exercise in using the splitting lemma. The basic point is that

where the arguments of

e_i

are indexed over all cases where

i_1 < \ldots < i_m

, written as a polynomial in the symmetric functions

e_j

, contain no terms of type

e_1^k

, because for example no monomials terms of type

x_1^k

can occur in the expansion in terms of the

x_i

Todd Trimble (Oct 21 2024 at 03:17):

Thus, we have identified line elements in

1 + tR[[t]]

with elements of type

1 + at

a \in R

, under the identification

[\Lambda, R] \cong 1 + tR[[t]]

using the

\lambda^i

polynomial basis. What do these elements look like when we use the identification

[\Lambda, R] \cong 1 + tR[[t]]

using the

\sigma^i

basis?

I claim that if

f(\lambda^1) = a

and

f(\lambda^n) = 0

for all

n \geq 2

, then

f(\sigma^n) = a^n

for all

n

. One way to see this is by induction on

n

. It is true for

n = 1

, since

\sigma^1 = \lambda^1

. For

n > 1

, use the following beautiful identity that holds in

\Lambda

\sigma^n - \sigma^{n-1} \lambda^1 + \sigma^{n-2}\lambda^2 - \cdots + (-1)^n \lambda^n = 0.

(John and Joe and I have discussed this many times; it has a relatively short conceptual proof at the 2-rig level, using ideas of superalgebra. I may include a proof a little later in these notes.) Applying the ring homomorphism

f: \Lambda \to R

to this identity, and the assumption that

f(\lambda^n)

vanishes for

n \geq 2

, we see that

Todd Trimble (Oct 21 2024 at 03:18):

Thus, when written in the

\sigma^n

basis, a line element in

W(R)

necessarily has the form

Finally, if

1 + at, 1 + bt

are line elements [again in the

\lambda^i

basis], I claim that their Witt product is

1 + ab t

. (I claim) this can be deduced again by exploiting the splitting principle. As a consequence, in the

\sigma^i

basis, the Witt product of

\frac1{1-at}

and

\frac1{1-bt}

must be

\frac1{1 - abt}

Todd Trimble (Oct 21 2024 at 03:18):

Incidentally, there is a famous ring involution

\omega: \Lambda \to \Lambda

that takes

\lambda^i

\sigma^i

and

\sigma^i

\lambda^i

. (We explain this in terms of 2-rig theory at the end of our second paper.) This induces a ring involution on

1 + tR[[t]]

that takes

1 + at

\frac1{1-at}

. This allows us to deduce that

Todd Trimble (Oct 21 2024 at 03:22):

(It's possible I'm slipping up somewhere in this last part, but it's late where I am, and I'm going to bed.)

Todd Trimble (Oct 21 2024 at 18:21):

Yes, I retract the claim that

\frac1{1-at} \ast_W \frac1{1-bt} = \frac1{1 - abt}

even if we use the

\lambda^i

-basis. If

\phi: [\Lambda, R] \to 1 + tR[[t]]

sends

f: \Lambda \to R

\sum_{i \geq 0} f(\lambda^i) t^i

, then certainly the composite

1 + tR[[t]] \overset{\phi^{-1}}{\longrightarrow} [\Lambda, R] \overset{[\omega, 1]}{\longrightarrow} [\Lambda, R] \overset{\phi}{\longrightarrow} 1 + tR[[t]]

is an involution, and it sends

1 + at

\frac1{1-at}

. But

[\omega, 1]

isn't a ring involution itself, because

\omega: \Lambda \to \Lambda

is not a biring involution; it's only a ring involution. (There should be a quick counterexample to show it's not a biring involution; maybe I'll cook one up later.)

Todd Trimble (Oct 21 2024 at 18:21):

under the

\sigma^i

-basis, i.e., transferring the god-given big Witt multiplication on

[\Lambda, R]

over to

1 + tR[[t]]

using the identification

[\Lambda, R] \to 1 + tR[[t]]

given by

f \mapsto \sum_{i \geq 0} f(\sigma^i) t^i

. But I actually like this formula less than the corresponding explicit formula using the

\lambda^i

-basis, which is

and liken these to splitting principles, like

1 + \sum_{i \geq 1} \lambda^i t^i = \prod_{n \geq 1} (1 + x_i t)

, which comes from the 2-rig map

\overline{k\mathsf{S}} \to \mathsf{A}^{\boxtimes \infty}

that sends the generator

x = \Lambda^1

to the sum of line objects

L_1 \oplus L_2 \oplus \cdots

Todd Trimble (Oct 21 2024 at 18:22):

Ramachandran remarks that the explicit formulas (whether in terms of

1 + at

\frac1{1-at}

) plus functoriality of

W

are enough to pin down the big Witt multiplication on

1 + tR[[t]]

. It would be nice to understand this point better. I'm thinking there must be a splitting principle type of explanation. It might be something like this. A general element in

1 + tR[[t]]

, say

\sum_{i \geq 0} a_i t^i

where

a_0 = 1

, may be written as

\sum_{i \geq 0} f(\lambda^i) t^i

where

f: \Lambda \to R

is the unique ring map sending

\lambda^i

a_i

. In the category of commutative rings, consider the pushout

R'

of the span

R \overset{f}{\longleftarrow} \Lambda \to \mathbb{Z}[[x_1, x_2, \ldots]]_{\mathrm{bd}}

where on the right we have power series of bounded degree (admitting formal infinite sums like

x_1^2 + x_2^2 + x_3^2 + \ldots

). This thing on the right is the Grothendieck ring

K_0(\mathsf{A}^{\boxtimes \infty}

, and the map on the right is obtained by applying

K_0

to the canonical 2-rig map

\overline{k\mathsf{S}} \to \mathsf{A}^{\boxtimes \infty}

. The idea is that, for example, the image of

a_2 = f(\lambda^2)

under the pushout coprojection

i: R \to R'

should match

where

f': \mathbb{Z}[[x_1, x_2, \ldots]]_{\mathrm{bd}} \to R'

is the other coprojection. Adapting the formal manipulations of Ramachandran to this framework, the idea is that

f'(x_1 x_2 + x_1 x_3 + \cdots + x_2 x_3 + \cdots) = f'(x_1)f'(x_2) + f'(x_1)f'(x_3) + \cdots

is the coefficient of

t^2

of a formal product

\prod_{n \geq 1} (1 + f'(x_i) t) \in 1 + tR'[[t]]

, and this formal infinite product is supposed to be (if I'm interpreting Ramachandran correctly) an infinite Witt sum of line elements, since Witt addition is given by multiplying power series, as we saw earlier. So, then, the lambda-ring map

W(i): W(R) \to W(R')

takes

1 + \sum_{i \geq 1} a_i t^i

to an infinite Witt sum

W

' stands for 'Witt') and similarly would take some other element

1 + \sum_{i \geq 1} b_i t^i

to some other formal infinite Witt sum

and the Witt product of these two elements (supposing for now they wind up in the same

1 + tR'[[t]]

) is going to be defined by a third infinite Witt sum

which can be expanded and rewritten back in

1 + tR[[t]]

. As you can see, it looks like a bunch of shenanigans, and there is the mild challenge of making honest sense of these formal manipulations, but that seems to be the idea.

John Baez (Oct 21 2024 at 23:13):

Now you've gotten to the stuff I really want to understand! It will take me a while to absorb this and reply.

Todd Trimble (Oct 22 2024 at 01:56):

The pushout of rings I mentioned at the end probably ought to be replaced by the pushout of the span

R \overset{(f, g)}{\longleftarrow} \Lambda \otimes \Lambda \to \mathbb{Z}[[x_1, x_2, \ldots; y_1, y_2, \ldots]]_{\mathrm{bd}}

John Baez (Oct 22 2024 at 21:23):

It's taking me a while to find enough time to think about this stuff, but please don't mistake that for lack of interest. I will get around to it.

Todd Trimble (Oct 22 2024 at 21:25):

No worries at all! I'm mulling some stuff over anyway, about ways of putting those manipulations of Ramachandran on solid ground.

Todd Trimble (Oct 22 2024 at 22:14):

So the more I think about this, the more I suspect what Ramachandran is doing is a kind of "joke" in the sense of Littlewood. (For anyone who doesn't know what is meant by this, see this MO post.)

My own favorite example of a Littlewood joke is the proof of a statement from spectral theory: if

A, B

are operators on a vector space and

1 - AB

is invertible, then so is

1 - BA

. Proof: Write

(1 - BA)^{-1} = 1 + BA + BABA + BABABA + \cdots = 1 + B(1 + AB + ABAB + \cdots)A = 1 + B(1 - AB)^{-1} A.

The punchline of the joke is that

1 + B(1 - AB)^{-1} A

is in fact the inverse of

1 - BA

, as one can easily verify. The telling of the joke is the stuff in the middle, which strictly speaking isn't really legitimate, but it doesn't really matter: it's just a vehicle for how to remember the punchline, which is all that is needed in a rigorous proof. (I'm reminded here of Abel's comment on Gauss's mathematical writing style: "He is like the fox, who effaces his tracks in the sand with his tail"; in other words, he never gives you the inner motivation, or explanations of how he arrived at his insights. Presumably Gauss would have written down

1 + B(1 - AB)^{-1} A

without further commentary.)

If I'm right, those infinite Witt sums in Ramachandran's proof are part of a Littlewood joke about how Witt products work, in a way that can be made rigorous but at the cost of using somewhat more roundabout expressions. I'll try to flesh out these thoughts soon.

Oscar Cunningham (Oct 23 2024 at 11:12):

Jean-Baptiste Vienney (Oct 23 2024 at 15:47):

(Sounds like expressing yourself in a very literary but logically and scientifically discutable way can be seen as cool when you’re a serious math guy.

It reminds me of this extract from the Wikipedia page « Geometric group theory »:
« In the introduction to his book Topics in Geometric Group Theory, Pierre de la Harpe wrote: "One of my personal beliefs is that fascination with symmetries and groups is one way of coping with frustrations of life's limitations: we like to recognize symmetries which allow us to recognize more than what we can see. In this sense the study of geometric group theory is a part of culture, and reminds me of several things that Georges de Rhampracticed on many occasions, such as teaching mathematics, reciting Mallarmé. »

This phenomenon can also be seen in the titles of math papers on the Arxiv. The words that are more rare are more used than the most common ones even when it makes less sense according do dictionaries. For instance the word « via » is more used than « by » or « through » in titles of the form « characterization of … via/by/through … ».)

Kevin Carlson (Oct 23 2024 at 17:03):

"Via" and "by" and "through" aren't syntactically interchangeable, to be sure. You shouldn't say "Characterization of a class A by theorem B", whereas "via" works well there. Instead you say "by application of theorem B", or something. This is usually the situation with rarer words; there are few direct substitutions in English or, I'm sure, in most languages. "Utilize" for "use" is a famous exception that proves the rule!

John Baez (Oct 23 2024 at 17:14):

I think I see how to show any object in a 2-rig with

\Lambda^2 x \cong 0

automatically has

\Lambda^n x \cong 0

for

n > 2

. So I would hope that in any lambda-ring

\lambda^2(r) = 0

implies

\lambda^n(r) = 0

for

n > 2

. However, my ability to do computations in lambda-rings is not up to the job of proving this!

Jean-Baptiste Vienney (Oct 23 2024 at 17:29):

Ok for « by ». But isn’t « through » better than « via »? If I remember what I found by looking on the internet, it was said that « through » can be interpreted as « by mean of » whereas « via » is also interpreted as « by mean of » today but in the past was used more strictly to mention a city by which you pass on a travel. I guess it is used also in the sense of « by mean of » today because it is a Latin word and so sounds more poetic and sophisticated.

I also discovered that « cf. » is for « confere » which means « compare » in latin. It was used in the past in a more restricted way, in particular in legal texts to mean « compare with the other reference which says something different. » But today it is used to mean « see », almost all the time to point to a reference where more details are given, so to defend a point which is almost the contrary of what it means in Latin.

At least for cf., it shows that people tend to use uncommon words just to sounds cool, distorting their original meaning.

Jean-Baptiste Vienney (Oct 23 2024 at 17:30):

(Sorry for this interruption which has nothing to do with the very cool math you were talking about.)

Todd Trimble (Oct 23 2024 at 17:32):

Responding to John here: frankly, I get confused about some things here myself, especially when it comes to dealing with negatives. A question that's been needling me is whether every lambda-ring comes from a 2-rig, i.e., whether we can cook up a 2-rig whose Grothendieck ring is the given lambda-ring. That would be a big help!

Jean-Baptiste Vienney (Oct 23 2024 at 17:33):

A related observation (which can be made this time on the paper on the splitting principle!) is that people tend to say that two categories are equivalent when in fact they are even isomorphic, because « equivalent » sounds more cool.

Todd Trimble (Oct 23 2024 at 17:46):

I've noticed this thing about "cf." (some people write instead c.f., not knowing where the abbreviation comes from), and sometimes I have pangs of conscience about using it in this slightly looser way, but then again, language is constantly changing, so I waffle back and try not to worry about it much, feeling that most people reading would get the drift.

I wouldn't attribute "equivalence" to wanting to sound cooler, necessarily (and it sounds unkind to think so). For some personalities, it could be an act of hedging one's bets or playing it safe. It's a tricky business.

There is a spot in the splitting principles paper where we deal with a 1-limit in a 2-category (where we introduce

\mathsf{A}^{\boxtimes \infty}

), which requires thinking up to isomorphism with very specific models in mind, but then other ways of referring to the object up to equivalence could be appropriate if one wishes to evoke other ways of thinking about it. But I'm actually not sure what you're talking about. Do you want to tell us more specifically what you had in mind?

Jean-Baptiste Vienney (Oct 23 2024 at 17:50):

Sure. I was thinking about Lemma 5.2. I think the equivalence is actually an isomorphism here.

Jean-Baptiste Vienney (Oct 23 2024 at 17:52):

Ahah, almost the same for me. Now that I know precisely the original meaning, I feel bad about using it to mean "see" but I'm so used to that it's difficult to stop doing it. But I also think that we don't have to be so conservative about words and so it doesn't matter much.

Todd Trimble (Oct 23 2024 at 17:54):

Jean-Baptiste Vienney (Oct 23 2024 at 17:58):

Well, now I think that it is probably not an isomorphism :sweat_smile:. It is just that some category theory people defined the category of affine schemes as

\mathrm{CommAlg}^{op}

in talks. So I might be completely wrong.

Jean-Baptiste Vienney (Oct 23 2024 at 18:00):

Hmm, but you actually define the category of affine schemes precisely like this ahah (as the opposite of

\mathbf{CommAlg}

). So I'm not wrong about your lemma 5.2 finally.

Jean-Baptiste Vienney (Oct 23 2024 at 18:02):

It must be that you want to give a simple definition but then you feel bad about it in the proof and statement of Lemma 5.2 because you know that this is not the usual definition and the usual definition is only equivalent and not isomorphic to your definition.

Joe Moeller (Oct 23 2024 at 18:09):

I think it is something like this, but maybe upside down. It's morally correct to treat them as equivalent, but it's also good manners to give a definition of things. So we take advantage of the equivalence to shortcut the definition. Then if we asked ourselves if these two things are at odds, we moved on from it quickly. I don't actually see the benefit of the isomorphism. I think of isomorphisms of categories as equivalences with a bijection on objects. Bijections are good for counting, but I don't want to count anything here.

Jean-Baptiste Vienney (Oct 23 2024 at 18:36):

Equivalence of categories confuse me because I'm always thinking to the equivalence between the category of matrices over

k

and the category of finite dimensional vector spaces over

k

which feel quite different to me: the first one is a world where you think in terms of coordinates and need to do arbitrary choices to do so (if you start from abstract vector spaces), in the second one you think without coordinates. You can compute easily in the first one with a computer, but it would be more difficult to work with the second one on a computer. I tend to feel like category theory is too coarse here (like a coarser topology) and puts under the rug some subtleties (I mean, if you consider that two equivalent categories are more or less the same categories). But it's maybe just a psychological problem of me. So when I read "equivalent" but it is in fact "isomorphic" (or when it is not even clear what it should be because of how things are written), it creates these questionings in my mind and distracts me from the real point of the work under consideration.

John Baez (Oct 23 2024 at 20:33):

This is an example of why I say "equivalence": to show you have an isomorphism of categories you have to check everything very carefully.

But also there's no advantage to doing so! If we are doing category theory in the usual style, isomorphisms between objects in a 2-category like

\mathbf{Cat}

are considered no more useful than equivalences. They are merely distracting.

(I am avoiding the word [[evil]], because it's not politically correct. :upside_down:)

Todd Trimble (Oct 23 2024 at 20:33):

I think that saying "isomorphism" instead would also be distracting for many readers, because it would stop them in their tracks and make them think, "really, isomorphic?", when so much else is already going on. I agree with Joe that there's no real benefit to saying "isomorphism" here, and I expect most readers will go along with "equivalence" without a murmur (it's certainly not wrong to say equivalence). The distinction is not worth bothering about.

I assure you that I didn't "feel bad" about anything! Well, not here anyway. I feel bad that we didn't finish the proofs of more theorems! :-)

John Baez (Oct 23 2024 at 20:43):

I don't want to think about whether Lemma 5.2 could be stating an isomorphism - it's like thinking about how many threads of cotton are in my sock, when I just want to put on my sock.

This lemma says that the category of monoids internal to the category of affine schemes is equivalent to the category of commutative bialgebras. It's a triviality, but the two viewpoints have a different feel to them. In one case I picture a geometrical object, an affine scheme, which is equipped with a multiplication making it a monoid. In the other case I imagine a vector space which is equipped with a commutative multiplication and a comultiplication that get along with each other. So the first picture is 'geometrical' while the second is 'algebraic'.

For example in the first picture I might imagine the Lie group

\mathrm{SL}(2,\mathbb{R})

, which is shaped like some sort of 3-dimensional hyperboloid in 4-space, but I'd view it as an algebraic variety equipped with a group structure. In the second picture I'd imagine the commutative algebra

and think about how to equip it with a comultiplication arising from the group structure of

\mathrm{SL}(2, \mathbb{R})

. I find this second picture a lot less intuitive, but it has the advantage that in the end we just have a vector space with some operations and co-operations - so our linear algebra skills become useful.

Morgan Rogers (he/him) (Oct 23 2024 at 21:21):

In a talk I'm giving to some algebraic topologists tomorrow I'm going to talk about the general process of externalization (of which the above amounts to a particular case). These different perspectives on objects can be wildly different and I'm confident there are many cases where the different perspectives have not been adequately exploited :star_struck:

Todd Trimble (Oct 23 2024 at 21:24):

I suspect that many people who will open that paper have never gotten seriously familiar with categories of comodules of coalgebras, or at least not to anything like the extent they are familiar with modules over algebras. I got an inkling of their importance for algebraic representations of algebraic monoids through conversations some years back with Jim Dolan, but even so it was really only while we were developing ideas for this paper that I really began getting my hands dirty with them. (That's just an expression; no shade on comodule theory, which is very clean and beautiful, and sometimes surprising!)

Todd Trimble (Oct 24 2024 at 00:04):

The jokey aspect is that Ramachandran manipulates certain infinite Witt sums (= infinite products of formal power series with constant coefficient

1

), but the language of commutative rings doesn't accommodate infinite products in general. So there's a faint odor of bullshit to what he's doing, even though his arguments are succinct and suggestive, in the style of a good Littlewood joke.

Todd Trimble (Oct 24 2024 at 00:05):

Here is my attempt to put what Ramachandran does on grounds that seem more rigorous to me. Taking it from the top: let

f, g: \Lambda \to R

be two elements in the big Witt ring

W(R)

, and let

f \ast g

denote their Witt product that we are trying to describe. Using the fact that

\Lambda \otimes \Lambda

is the coproduct of

\Lambda

with itself in the category of commutative rings, there is an induced ring map that I'll denote as

(f, g): \Lambda \otimes \Lambda \to R

. As mentioned earlier, this is the composite

\Lambda \otimes \Lambda \overset{f \otimes g}{\longrightarrow} R \otimes R \overset{m}{\longrightarrow} R.

Todd Trimble (Oct 24 2024 at 00:05):

Next, the splitting principle gives a 2-rig extension

\phi: \overline{k \mathsf{S}} \to \mathsf{A}^{\boxtimes \infty}

, or better yet an extension of graded 2-rigs. Here

\mathsf{A}^{\boxtimes \infty}

is defined to be the 2-rig consisting of functors

\mathbb{N}^{(\infty)} \to \mathsf{FinVect}

, where

\mathbb{N}^{(\infty)}

is the commutative monoid of natural number sequences

(m_1, m_2, \ldots)

whose sum

m_1 + m_2 + \cdots

is finite, and we regard this commutative monoid as a discrete symmetric monoidal category. This is graded by the sum

m = m_1 + m_2 + \cdots

; the set of sequences with that sum is denoted

\mathbb{N}^{(\infty)}(m)

. The 2-rig extension takes

x \in \overline{k \mathsf{S}}

to the functor that is constantly the ground field

k

on elements of the component

m = 1

, and

0

for other

m

. The elements of

\mathbb{N}^{(\infty)}(m)

are

(1, 0, 0, \ldots)

(0, 1, 0, \ldots)

(0, 0, 1, \ldots)

, etc., and the

i^{th}

line object

L_i

takes the

i^{th}

element of this sequence to

k

, and all other elements of

\mathbb{N}^{(\infty)}

0

. Hence

\phi(x)

L_1 \oplus L_2 \oplus \cdots

. Decategorifying this 2-rig map gives a commutative ring map, even a lambda-ring map

\Lambda = K(\overline{k\mathsf{S}}) \to K(\mathsf{A}^{\boxtimes \infty})

, sending

\lambda^j

to the

j^{th}

elementary symmetric function

e_j(x_1, x_2, \ldots)

as we have discussed, where

x_i

is the isomorphism class

[L_i]

. So far, this is all on solid ground.

Todd Trimble (Oct 24 2024 at 00:06):

Applying

W

to this ring map

\phi: \Lambda \to K(\mathsf{A}^{\boxtimes \infty})

, we get the assignment

as long as we say to ourselves that the right side is suggestive shorthand for the well-founded expression

\sum_{j \geq 0} e_j(x_1, x_2, \ldots) t^j

, which makes sense in our context (yes, the coefficients

e_j

are "infinite sums", but they make sense as elements in

K(\mathsf{A}^{\boxtimes \infty}) \cong [\mathbb{N}^{(\infty)}, \mathbb{Z}]

, just as ordinary formal power series in

A[[t]]

, which ostensibly are infinite sums

\sum_{n \geq 0} a_n x^n

, make perfect sense as sequences

\mathbb{N} \to A

of elements of

A

Todd Trimble (Oct 24 2024 at 00:06):

[By the way, if we interpret what Ramachandran is doing with his infinite Witt sums, we are led to write down infinite products of type

as an element of

W(K(\mathsf{A}^{\boxtimes \infty}))

. Yes, one can make sense of this, but... I don't know about you, but taking infinite products of linear terms

1 + x_i t

feels more familiar and comfortable to me than taking infinite products of geometric series. I suppose that's silly, since Euler didn't blink an eye writing down

Todd Trimble (Oct 24 2024 at 00:07):

I said above that I want to consider the pushout

i: R \to R'

, where we push out the injection

\phi \otimes \phi: \Lambda \otimes \Lambda \to K(\mathsf{A}^{\boxtimes \infty}) \otimes K(\mathsf{A}^{\boxtimes \infty})

along the map

(f, g): \Lambda \otimes \Lambda \to R

And one could hope, at least for the limited purpose of trying putting a gloss on what Ramachandran is doing, that this

i: R \to R'

is also an injection. (There might be some really principled way of seeing that, but I don't know what it would be.) It would follow that the induced map

W(i): W(R) \to W(R')

, which by definition is

[\Lambda, i]

, is also an injection.

Todd Trimble (Oct 24 2024 at 00:07):

K(\mathsf{A}^{\boxtimes \infty}) \otimes K(\mathsf{A}^{\boxtimes \infty}) \to R'.

So the plan is to Witt-multiply the elements

\prod_{i \geq 1} (1 + x_i t)

and

\prod_{j \geq 1} (1 + y_j t)

, where the first product sits in the image of

W(K(\mathsf{A}^{\boxtimes \infty})) \overset{W(i_1)}{\hookrightarrow} W(K(\mathsf{A}^{\boxtimes \infty}) \otimes K(\mathsf{A}^{\boxtimes \infty}))

with

i_1

being the first coproduct coprojection in the category of commutative rings, and the second product similarly sits in the image of

W(i_2)

. Their Witt-product is the simple-looking

\prod_{i, j} (1 + x_i y_j t) \in W(K(\mathsf{A}^{\boxtimes \infty}) \otimes K(\mathsf{A}^{\boxtimes \infty}))

which is again a shorthand for something more complicated-looking, but what is going on can be derived at the 2-rig level. (I am tempted to give this 2-rig level explanation now, but I'll resist.) Now push this element down to

W(R')

. The result is a corresponding Witt-product in

W(R')

. This gives the image in

W(R')

of the desired Witt-product in

W(R)

; this desired element is uniquely determined, by injectivity of

W(i): W(R) \to W(R')

Todd Trimble (Oct 24 2024 at 00:08):

This joke is by now more like a shaggy dog story, and I think that's about as far as I'll take it for now. I'm hoping it will make sense to John at least, how this account fits with the verbiage set down in Ramachandran's paper. Somehow I imagine the cognoscenti reading his paper smiling and nodding knowingly at this passage, and with others having cartoon question marks popping out of their heads, because what he writes really is cryptic unless you already know (or until you figure out) the story.

Having unraveled what I think he was getting at, I think it makes matters harder than necessary, although all the ideas are there. But again, all we have to do is figure out how comultiplication

\mu: \Lambda \to \Lambda \otimes \Lambda

works, then take the composite

\Lambda \overset{\mu}{\longrightarrow} \Lambda \otimes \Lambda \to \overset{f \otimes g}{\longrightarrow} R \otimes R \overset{m}{\longrightarrow} R.

You figure out how comultiplication works using the splitting principle, along the lines sketched way back here in a simple case, which was amplified further here.

John Baez (Oct 24 2024 at 15:27):

Thanks for getting to the bottom of multiplication in the big Witt ring, @Todd Trimble! It looks a lot simpler and less problematics in the

\lambda^i

basis, which almost makes me wonder: why bother with the

\sigma^i

basis?

I guess Ramachandran provides one answer to this question. As you pointed out, in the

\sigma^i

basis a line element in

W(R)

looks like

Ramachandran wants to relate the big Witt ring to zeta functions over finite fields

\mathbb{F}_q

; one of the simplest of these is the zeta function of the affine line, which is

I just now noticed the double appearance of word "line" - line element versus affine line. That could be a coincidence: it's hard for me to connect these two kinds of line.

Todd Trimble (Oct 24 2024 at 15:33):

Oh! Interesting observation. I'll/we'll have to ponder whether there's some reason that zeta functions would jibe better with the

\sigma^i

John Baez (Oct 24 2024 at 15:34):

It's all rather mysterious. But the zeta function of the affine line should not be too mysterious.

In general, the coefficient of

t^n

in the zeta function

Z(X,t)

is defined to be the number of

\mathbb{F}_{q^n}

-points of the scheme

X

The number of

\mathbb{F}_{q^n}

-points of the affine line is simply

q^n

. So we get

John Baez (Oct 24 2024 at 15:38):

Todd Trimble (Oct 24 2024 at 15:39):

Curious. The

\sigma^n

refer to the class of

S^n

, and for a line object

L

we have

S^n(L) \cong L^{\otimes n}

. But this is in contrast to the cartesian product

L^n

, whose size is

q^n

Todd Trimble (Oct 24 2024 at 15:40):

John Baez (Oct 24 2024 at 15:41):

Z(\mathbb{A^2}, t) = \displaystyle{\frac{1}{1 - qt} \ast_W \frac{1}{1 - qt} = \frac{1}{1 - q^2t} }

Todd Trimble (Oct 24 2024 at 15:42):

Todd Trimble (Oct 24 2024 at 15:46):

Ah, so the fleeting remark I made about the Euler product formula might not have been too far off the mark.

John Baez (Oct 24 2024 at 15:50):

Right! Actually I'm confused: rereading Ramachandran I think my claim that

Z(X,t)

is just the generating function for the number of

\mathbb{F}_{q^n}

-points of

X

is wrong, but weirdly all the specific computations I did seem to work.

John Baez (Oct 24 2024 at 15:51):

The top formula of his equation (11) is the right formula relating

Z(X,t)

to numbers of points.

Todd Trimble (Oct 24 2024 at 15:53):

Yes, that formula is familiar. You and Jim have those nice papers on the nLab, whose titles you will be able to recall more quickly than I can. One of them interprets the Hasse-Weil zeta function. The other is about zeta functions of Z-sets generally.

Todd Trimble (Oct 24 2024 at 15:57):

The stuff on Euler characteristics is extremely interesting. (And the very coarse "cutting apart" of a scheme

X

into

Y

and

X \setminus Y

reminds me very much of Schanuel's papers on negative sets and Euler characteristic. I mean like this one. I don't know if there are others particularly, except for "What is the length of a potato?" where the Euler characteristic plays a starring role. Here it is, courtesy of Tom Leinster's website -- thanks @Tom Leinster !)

Todd Trimble (Oct 24 2024 at 16:01):

You were also telling me about motives some months back, which by my memory also involve this coarse cutting apart of schemes, reminiscent of how a projective space is a "sum"

where the left side is the quotient of punctured

(n+1)

-space by the action of the multiplicative group

k^\ast

, and the right side is a decomposition into Schubert cells.

Todd Trimble (Oct 24 2024 at 16:06):

Hopefully you can remind me at some point of the things you were telling me about motives, if what I said rings any bells (or even if doesn't).

Todd Trimble (Oct 24 2024 at 16:31):

I'll mention one pretty cool result from Schanuel's negative sets paper. It starts off with a reason that the open interval can be thought of as a negative set, indeed a proxy for

-1

. Take an open interval

x

, say

(0, 1)

and divide it into three parts:

(0, 1/2) \cup \{1/2\} \cup (1/2, 1)

Hence "

2x + 1 \sim x

". If we could cancel

x

, then "

x \sim -1

Next, consider the category of bounded polyhedra. A polyhedron is by definition a subset of some Euclidean space

\mathbb{R}^n

contained in the smallest Boolean subalgebra of

P(\mathbb{R}^n)

that contains loci of the form

L(x_1, \ldots x_n) \geq 0

where

L

is an affine function. A bounded polyhedron is what you think it is. A morphism between polyhedron is the graph of a function between them that is itself a polyhedron. The category of polyhedra

P

or the subcategory of bounded polyhedra

P_0

has some good properties, such as extensitivity. The equivalence

\sim

comes from an isomorphism in this category.

John Baez (Oct 24 2024 at 16:32):

Ramachandran actually talks about zeta functions from a somewhat "motivic" point of view, but this is based on the low-budget approach to motives based on the "Grothendieck ring of varieties", as explained quite tersely on page 6. The idea is that you take the rig category of varieties, decategorify and group complete it to get a commutative ring, and then impose the extra relations

John Baez (Oct 24 2024 at 16:34):

The high-end, difficult approach to motives seeks instead to define a rig category of motives, rather than merely this commutative ring, which is intended to be some sort of decategorification of that dreamt-of category.

Todd Trimble (Oct 24 2024 at 16:35):

Mm. I think in fact we touched recently on the fact that motives should form a 2-rig, no?

Todd Trimble (Oct 24 2024 at 16:36):

John Baez (Oct 24 2024 at 16:37):

Ramachandran uses

GF_K

to denote the Grothendieck ring of varieties over a field

K

, and intriguingly writes:

Todd Trimble (Oct 24 2024 at 16:38):

I mean, we maybe have to change

k

-linearity to something else, but we should have a symmetric monoidal category with coproducts and idempotent splittings. Just a passing thought for the moment.

Todd Trimble (Oct 24 2024 at 17:00):

I want to get back to the Schanuel paper I was just describing. It must be understood of course that morphisms in the category

P_0

, let's say isomorphisms, need not be continuous at all -- the graph of a function can be broken up in pieces, as we saw in the case

x \to 2x + 1

Two very interesting definitions: (1) the Euler characteristic of a commutative rig

R

is the universal quotient

R \to E(R)

where to a commutative rig that enjoys additive cancellation; (2) The dimension of a commutative rig

R

is the quotient

R \to R/(1 + 1 \sim 1)

Todd Trimble (Oct 24 2024 at 17:05):

The Euler characteristic of

\mathbb{N}[x]/ (x \sim 2x + 1)

is the expected quotient to

\mathbb{Z}

(easy to check I think).

Todd Trimble (Oct 24 2024 at 17:09):

I'll copy what Schanuel says about the dimension of

\mathbb{N}[x]/ (x \sim 2x + 1)

John Baez (Oct 24 2024 at 17:20):

Yes, I believe motives form a 2-rig. I can't believe I didn't notice that. I often find myself thinking about two things and noticing only later that they're connected.

I was a bit confused about the

k

-linearity but yes, the category of pure motives defined using 'numerical equivalence' is a

k

-linear abelian category; people don't emphasize the symmetric monoidal structure so much but it should exist.

(There are potentially many categories of pure motives defined using different 'adequate equivalence relations' on cycles, but some of the Standard Conjectures say some of these equivalence relations are the same as numerical equivalence... let us not sink into this mire now!)

Indeed, something that confused me for a while (and apparently still) is that the field

k

we're talking about here is typically different than the field our varieties (or schemes) are defined over! You know how cohomology has 'coefficients'. Motives are like a universal cohomology theory for varieties defined over some field

\mathbb{F}

with coefficients in some field

k

. Right now we're talking about the field of coefficients.

Todd Trimble (Oct 24 2024 at 17:25):

Instead of typing this out, I can just refer you to page 382 for the demonstration of the theorem (that I just ascribed to Schanuel).

The same considerations apply to other structures, the so-called o-minimal structures. An archetypal example is where semialgebraic sets replace the semilinear sets that constitute the category

P

. These examples are the propositionally (or Boolean-)definable sets of a model

\mathbb{R}

of a logical theory where the language for semialgebraic sets, say, would be given by

(R, 0, 1, +, -, \cdot, \leq)

. In the types of theories I have in mind, there is a quantifier elimination theorem (e.g., Tarski-Seidenberg theorem) that says the image of a definable set under a linear projection is itself definable. And there is also an o-minimality, which says that the only definable subsets of the real line are finite unions of points and intervals. Model theorists, those clever devils, know how to tease out an incredible amount of geometric structure from these two conditions.

Anyway, I think the rough upshot is that Schanuel's theorem extends to such cases as semialgebraic sets.

John Baez (Oct 24 2024 at 17:30):

Todd Trimble (Oct 24 2024 at 17:30):

Well, this is incredible. I sort of took a seat-of-the-pants guess there. I'm eager to learn more!

Todd Trimble (Oct 24 2024 at 17:31):

Todd Trimble (Oct 24 2024 at 17:37):

By the way, I read about 25 minutes ago that the zeta functions under study are valued in

W(\mathbb{Z})

. It should be mentioned that

\mathbb{Z}

is itself a lambda-ring, where the

\lambda^i

act on

\mathbb{Z}

n \mapsto \binom{n}{i}

. These functions generate, as a ring, precisely the integer-valued polynomial functions

\mathbb{Z} \to \mathbb{Z}

John Baez (Oct 24 2024 at 17:39):

I've read a lot about motives here and there. I believe you'd like this, which defines pure motives and states their known properties:

By the way, when he says the category of motives is not Tannakian, he (like many other people) is sort of crying wolf: they're not Tannakian with a certain bad symmetric monoidal structure, but they are with the good 'super' symmetric monoidal structure, where you stick in minus signs in the expected places.

(So I take it back about the symmetric monoidal structure being less discussed - I seem to have forgotten lots of stuff I knew.)

Todd Trimble (Oct 24 2024 at 17:49):

I'm making a note for later about the occurrence of Chow rings on page 5 of Milne's paper (we were talking about these in connection with Grothendieck-Riemann-Roch).

John Baez (Oct 24 2024 at 18:07):

While I do want to learn algebraic geometry, I don't feel I have much of a chance doing anything new when it comes to Chow rings. To work with those, you need a good understanding of algebraic varieties. For example, to define the intersection map in equation (1) on Milne's page 5, you need Chow's moving lemma, which says that given two subvarieties you can move one a bit so that they're in general position. This is like an algebraic geometry version of the fact that given an

k

-dimensional submanifold and an

(n-k)

-dimensional submanifold of an

n

-manifold, you can isotope one of them so that they intersect in finitely many points. I suffered through learning the techniques for doing such things in differential topology, but I have no desire to go through it all again in the more rigid context of algebraic geometry! And this is just the basic stuff: it gets a lot worse. This is why the Standard Conjectures remain conjectures.

Where I think I might contribute is in figuring out how to distill some concepts from algebraic geometry, formulate them using category theory, and prove (easy) things about them.

John Baez (Oct 24 2024 at 18:18):

So, when it comes to Ramachandran's paper, I'm not seriously interested in proving anything about motives or the Grothendieck ring of varieties. But his proof that the zeta function of a variety obeys

is just a calculation which doesn't really use anything about varieties! It probably works for the Hasse-Weil zeta function of any functor from finite commutative rings to finite sets. (We think of such functor as telling us the set of

R

-points of some gadget for each finite commutative ring

R

, but we don't have to say what this gadget is! The functor says it all.)

John Baez (Oct 24 2024 at 18:20):

So, the kind of question I'm really interested in now is what does multiplication in the big Witt ring really mean - and why does it make Ramachandran's identity hold?

John Baez (Oct 24 2024 at 18:20):

Todd Trimble (Oct 24 2024 at 18:36):

Todd Trimble (Oct 24 2024 at 18:44):

To be honest, I'm having trouble identifying where Ramachandran's proof of Theorem 2.1 begins and ends. I'll continue reading and scanning (between this and the Milne paper you linked to).

John Baez (Oct 24 2024 at 18:57):

It probably starts at the bottom of page 10 where it says Proof (of Thm. 2.1). But it relies on the previous lemmas.

Todd Trimble (Oct 24 2024 at 18:59):

Todd Trimble (Oct 24 2024 at 19:00):

John Baez (Oct 24 2024 at 19:01):

Todd Trimble (Oct 24 2024 at 19:02):

John Baez (Oct 24 2024 at 19:02):

It's another case of how two things I'm struggling to understand turn out to be related.

Todd Trimble (Oct 24 2024 at 19:04):

Todd Trimble (Oct 24 2024 at 19:10):

So I'm looking at my notes that I shared with you, split2rigs. (I want time to think whether I want those publicly shared yet.) Notation:

\sigma_t = \sum_{n \geq 0} \sigma^n t^n

. So then the Adams operations

\psi^n

can be defined by a generating function

where

\psi_t

is this thing,

t

times the logarithmic derivative of

\sigma_t

. We've talked about these things before. The Adams operations are of course ring homomorphisms.

Todd Trimble (Oct 24 2024 at 19:12):

Well, "of course". I mean that of course you've seen me talk about this before. We recently went through a proof in that split2rigs, where I invoke the beautiful identity

Todd Trimble (Oct 24 2024 at 19:14):

Todd Trimble (Oct 24 2024 at 19:15):

Anyway, you see

t

times the logarithmic derivative in the proof of Lemma 2.3 in Ramachandran.

John Baez (Oct 24 2024 at 19:17):

Yes, I think we talked about that. But are the ghost components simply the components of an element of

W(R)

in the

\psi^n

basis? Is that what you're about to tell me?

John Baez (Oct 24 2024 at 19:18):

If so, the ring homomorphism property of the

\psi^n

should do something good for these ghost components.

Todd Trimble (Oct 24 2024 at 19:19):

I'll just mention that the "beautiful identity" can be written as

\sigma_t \cdot \lambda_{-t} = 1

, symbolically

and that's where this whole connection with

\frac1{1 - at}

being the

\sigma^i

basis analogue of the line elements

1 + at

comes into play.

Todd Trimble (Oct 24 2024 at 19:20):

Todd Trimble (Oct 24 2024 at 19:30):

Yes, that's the message! The ring homomorphism property of the

\psi^n

gives the first line of the proof of Theorem 2.1.

Todd Trimble (Oct 24 2024 at 19:49):

So I'm speculating out loud here, really just playing around, but looking at Ramachandran's Remark 2.4, I get a sense that a zeta function

Z(X, t) \in W(\mathbb{Z})

on motives

X

might be derivable by taking advantage of the 2-rig structure on the category of motives (I'm following Milne and considering rational equivalence classes of algebraic cycles -- he uses this symbol

\sim

to cover either the rational equivalence case or the numerical equivalence case). I'll let

M

denote this 2-category of motives. Then its Grothendieck ring

K(M)

is a lambda-ring. That means we get a canonical

W

-coalgebra structure (which is a

W

-coalgebra map = lambda-ring map)

and now I'm half-wondering whether composition with the lambda-ring map

W(d): W(K(M)) \to W(\mathbb{Z})

where I'll describe

d

in a moment, could morally be a zeta function

Here I'm guessing that

d: K(M) \to \mathbb{Z}

could be a dimension function, maybe. Or better yet -- Euler characteristic?! It might itself come from a suitable 2-rig map

M \to \mathsf{FinVect}_\mathbb{Q}

. (Eh, maybe not.)

Todd Trimble (Oct 24 2024 at 20:00):

Todd Trimble (Oct 24 2024 at 20:02):

And also of course "Euler characteristic" appears in Remark 2.4. (But I need to think about this more slowly and carefully.)

Todd Trimble (Oct 24 2024 at 20:03):

Todd Trimble (Oct 24 2024 at 20:09):

Todd Trimble (Oct 24 2024 at 20:27):

On a different front: Ramachandran talks about the Grothendieck ring

K(\mathrm{Var}_F)

of schemes of finite type over a field

F

. He says this is a pre-lambda-ring. So that's a kind of poor cousin of an actually lambda ring. A pre-lambda-ring

R

is given by a map

(so I'm defining the lambda operations in terms of the given structure map) satisfying an exponential law

and I think that's about it. I'm roughly thinking that if

K(\mathrm{Var}_F)

is merely a pre-lambda ring, and not a lambda-ring, that may be because

\mathrm{Var}_F

lacks good categorical properties (its not being a 2-rig, for instance). Working with a 2-rig of motives could address this. (?)

John Baez (Oct 24 2024 at 20:28):

Indeed, all this stuff is great! What I'd really like to do is "devein" it - as people say of shrimp - and remove the stuff related to varieties, leaving pure 2-rig theory. Of course we may need a 2-rig with extra structure and properties to get various things to work.

John Baez (Oct 24 2024 at 20:29):

I seem to recall somewhere he says it's an open question whether it's a lambda-ring. Let me see what he says about that...

John Baez (Oct 24 2024 at 20:30):

Yeah, at the bottom of page 18 he says there are 4 pre-lambda-ring structures on

K(\mathsf{Var}_F)

, and says for one it's not known if it's a lambda-ring. I guess this one is the only pre-lambda-ring structure he actually discusses.

Todd Trimble (Oct 24 2024 at 20:32):

Todd Trimble (Oct 24 2024 at 20:33):

John Baez (Oct 24 2024 at 20:34):

I edited my comment. I'm annoyed that he has a

\zeta_\mu

and a

\hat{\zeta}_\mu

running around and I don't see how he defines

\hat{\zeta}_\mu

John Baez (Oct 24 2024 at 20:35):

Oh, I see, it's defined by that factorization diagram near the bottom of page 18.

John Baez (Oct 24 2024 at 20:36):

Todd Trimble (Oct 24 2024 at 20:37):

Todd Trimble (Nov 07 2024 at 21:52):

I think I can show how to refute this by making use of the big Witt ring

W(\mathbb{Q}) = [\Lambda, \mathbb{Q}]

. The idea is to cook up an element

f: \Lambda \to \mathbb{Q}

for which

\lambda^3(f) \in W(\mathbb{Q})

can't possibly be zero (e.g., by arranging

f(\lambda^3) = 1

\mathbb{Q}

), but then force

\lambda^2(f) \in W(\mathbb{Q})

to be zero by a kind of inductive procedure. Here

\lambda^2(f)

is defined by the formula

where

\lambda^i \bullet \lambda^2

is some polynomial in

\lambda^1, \lambda^2, \ldots

, and what we need to do is define the rational numbers

f(\lambda^k)

so as to satisfy an infinite sequence of equations

f(\lambda^i \bullet \lambda^2) = 0

. At the

k^{th}

step of the induction, we will have recursively defined

f(\lambda^1), f(\lambda^2), \ldots, f(\lambda^{2k+1})

so as to make

f(\lambda^i \bullet \lambda^2) = 0

true for

i = 1

up to

i = k

So, assume as inductive hypothesis that we're good up to

i = k-1

. The "hard part" is to define

f(\lambda^{2k})

so that we pick up

f(\lambda^k \bullet \lambda^2) = 0

; then we define

f(\lambda^{2k+1})

to be anything you like (at the base case

k = 1

, remember we chose

f(\lambda^3) = 1

). The key fact to notice is that when we write out

\lambda^k \bullet \lambda^2

as a polynomial in exterior powers

\lambda^j

, the highest

j

that appears is

j = 2k

\lambda^k \bullet \lambda^2 = C_k \lambda^{2k} + \text{stuff involving exterior powers less than } 2k

f(\lambda^{2k}) = -\frac1{C_k} f(\text{stuff involving exterior powers less than } 2k)

The plethysm calculation at the level of symmetric polynomials involves writing out the

k^{th}

elementary symmetric polynomial

e_k(x_1x_2, x_1x_3, \ldots x_1 x_{2k}, x_2x_3, \ldots x_2 x_{2k}, \ldots x_i x_j, \ldots)

where

1 \leq i < j \leq 2k

, where there are

C_k

many contributions of the product

x_1 x_2 \cdots x_{2k}

occurring in the expansion. For example, when

k = 2

, there are

3

ways to get the monomial

x_1x_2x_3x_4

when you write out

e_2(x_1x_2, x_1x_3, x_1x_4, x_2x_3, x_2x_4, x_3x_4)

Todd Trimble (Nov 08 2024 at 02:37):

By the way, I think this

C_k

is one of these double factorial thingies:

C_k = 1 \cdot 3 \cdot 5 \cdots (2k-1)

John Baez (Nov 08 2024 at 18:42):

That's impressive, Todd! What made you choose to look for this example in the big Witt ring (instead of the free lambda-ring on one generator, or something)?

Todd Trimble (Nov 08 2024 at 20:48):

Partly is that if there is an example in any lambda-ring

R

whatsoever, then there's going to be one in the cofree lambda-ring

W(R)

, because a lambda-ring

R

comes equipped with a

W

-coalgebra structure

\eta: R \to W(R)

which by abstract nonsense is a

W

-coalgebra homomorphism, and it is also injective, so any inequalities (like

\lambda^3(r) \neq 0

) will be preserved. Therefore, you might as well look for examples in cofree lambda-rings.

But also, working with ring maps

f: \Lambda \to R

felt like an engineering job where you just have to twiddle the parameters

f(\lambda^i)

to make something work, so the task was to see what that something was. Intuitively I felt like

\lambda^k \bullet \lambda^2

, as a polynomial in

\lambda^j

s, was going to have

\lambda^{2k}

as its highest exterior power, opening the door to an inductive procedure. Then it was simply a matter of verifying that intuition, which turned out to be straightforward if you merely roll up your sleeves and calculate a few lines.

Todd Trimble (Nov 08 2024 at 23:01):

One simple way of seeing there are no higher exterior powers than

\lambda^{2k}

\lambda^k \bullet \lambda^2

is to see what this would entail for the initial commutative ring

\mathbb{Z}

, which is also the initial

\lambda

-ring. For

\mathbb{Z}

, the lambda operations

\lambda^i: \mathbb{Z} \to \mathbb{Z}

are defined by

which is an integer-valued polynomial function in

n

of degree

2k

. There can't be any lambda-operations

\lambda^j

occurring in the expansion

\lambda^k \bullet \lambda^2 \in \mathbb{Z}[\lambda^1, \lambda^2, \ldots]

with

j

greater than

2k

, because otherwise, the polynomial function

n \mapsto \binom{n}{j}

is of degree greater than

2k

, and that would lead to a contradiction.

Stream: theory: mathematics

Topic: Big Witt ring

John Baez (Oct 06 2024 at 17:09):

John Baez (Oct 06 2024 at 17:14):

John Baez (Oct 06 2024 at 17:23):

John Baez (Oct 06 2024 at 17:31):

John Baez (Oct 06 2024 at 17:44):

John Baez (Oct 06 2024 at 19:19):

Morgan Rogers (he/him) (Oct 07 2024 at 05:50):

David Corfield (Oct 07 2024 at 08:09):

John Baez (Oct 07 2024 at 17:18):

John Baez (Oct 07 2024 at 17:21):

John Baez (Oct 07 2024 at 17:28):

John Baez (Oct 07 2024 at 17:30):

John Baez (Oct 07 2024 at 17:32):

John Baez (Oct 07 2024 at 17:36):

John Baez (Oct 07 2024 at 17:38):

Morgan Rogers (he/him) (Oct 07 2024 at 18:11):

John Baez (Oct 07 2024 at 18:34):

John Baez (Oct 07 2024 at 18:38):

John Baez (Oct 07 2024 at 18:39):

Josselin Poiret (Oct 08 2024 at 08:27):

Morgan Rogers (he/him) (Oct 08 2024 at 16:20):

John Baez (Oct 08 2024 at 18:59):

John Baez (Oct 08 2024 at 19:22):

John Baez (Oct 08 2024 at 19:25):

John Baez (Oct 08 2024 at 22:23):

Todd Trimble (Oct 12 2024 at 04:07):

John Baez (Oct 12 2024 at 05:01):

John Baez (Oct 12 2024 at 05:06):

John Baez (Oct 12 2024 at 05:07):

Todd Trimble (Oct 12 2024 at 16:35):

Todd Trimble (Oct 12 2024 at 16:35):

Todd Trimble (Oct 12 2024 at 16:36):

Todd Trimble (Oct 12 2024 at 16:36):

Todd Trimble (Oct 12 2024 at 16:36):

Todd Trimble (Oct 12 2024 at 16:37):

Todd Trimble (Oct 12 2024 at 16:37):

Todd Trimble (Oct 12 2024 at 16:38):

Todd Trimble (Oct 12 2024 at 16:38):

Todd Trimble (Oct 12 2024 at 16:38):

John Baez (Oct 12 2024 at 17:49):

John Baez (Oct 12 2024 at 17:55):

Todd Trimble (Oct 12 2024 at 17:56):

Todd Trimble (Oct 12 2024 at 18:00):

John Baez (Oct 12 2024 at 18:03):

Todd Trimble (Oct 12 2024 at 18:12):

Todd Trimble (Oct 12 2024 at 18:32):

Todd Trimble (Oct 12 2024 at 18:36):

John Baez (Oct 12 2024 at 18:40):

Todd Trimble (Oct 12 2024 at 18:49):

Todd Trimble (Oct 12 2024 at 18:54):

John Baez (Oct 12 2024 at 21:15):

John Baez (Oct 12 2024 at 21:16):

John Baez (Oct 12 2024 at 21:19):

John Baez (Oct 12 2024 at 21:22):

John Baez (Oct 12 2024 at 21:24):

John Baez (Oct 12 2024 at 21:25):

John Baez (Oct 12 2024 at 21:32):

John Baez (Oct 12 2024 at 21:49):

John Baez (Oct 12 2024 at 21:57):

John Baez (Oct 12 2024 at 21:59):

John Baez (Oct 12 2024 at 22:01):

John Baez (Oct 12 2024 at 22:09):

John Baez (Oct 12 2024 at 22:13):

John Baez (Oct 12 2024 at 22:17):

John Baez (Oct 12 2024 at 22:19):

John Baez (Oct 12 2024 at 22:19):

John Baez (Oct 16 2024 at 01:24):

John Baez (Oct 16 2024 at 01:26):

Todd Trimble (Oct 16 2024 at 01:29):

Todd Trimble (Oct 16 2024 at 01:32):

Todd Trimble (Oct 16 2024 at 01:36):

Todd Trimble (Oct 16 2024 at 01:42):

Todd Trimble (Oct 16 2024 at 01:48):

John Baez (Oct 16 2024 at 02:04):

Todd Trimble (Oct 17 2024 at 18:53):

Todd Trimble (Oct 17 2024 at 18:53):

Todd Trimble (Oct 17 2024 at 18:54):

Todd Trimble (Oct 17 2024 at 18:54):