reading through Baez's topos theory blog posts · learning: reading & references

A few disclaimers: I'm not sure how far I'll get! Also, my discussion here is unlikely to be self-contained; if you want to follow along, you may need to reference the blog posts. And finally, I am almost certainly going to make a lot of mistakes!

Please feel free to join in, or just to point out when I'm confused about something! I can't guarantee I'll have the energy to give you a full response if you do so, but please know that I will still appreciate whatever you choose to post.

David Egolf (Mar 26 2024 at 17:29):

I'm going to take a puzzle/exercise-based approach. I find it helps me focus my thoughts to have a particular thing I'm trying to figure out. (Sometimes I'll even jump straight to an exercise before reading a section! Then that exercise helps motivate my reading.)

Morgan Rogers (he/him) (Mar 26 2024 at 17:35):

The exercise is "Check that...", right? Do you have an idea of where to start? :wink:

David Egolf (Mar 26 2024 at 17:40):

Yes, that's right! And I think I do have an idea of where to start! It'll take me a minute to type it out, though.

David Egolf (Mar 26 2024 at 17:45):

We have a functor

F:\mathcal{O}(X)^{\mathrm{op}} \to \mathsf{Set}

. Here,

\mathcal{O}(X)^{\mathrm{op}}

is the category of open sets of a topological space

X

, where we have a unique morphism from

U

V

exactly if

V \subseteq U

. I think of a morphism from

U

V

as saying "

U

contains

V

The first thing I want to note is that

\mathcal{O}(X)^{\mathrm{op}}

is a poset. Consequently, all diagrams commute in

\mathcal{O}(X)^{\mathrm{op}}

! In particular, this diagram commutes for any

U_i, U_j \subseteq U

:
diagram

David Egolf (Mar 26 2024 at 17:50):

Next, I'll use the fact that functors send commutative diagrams to commutative diagrams. That means that this diagram in

\mathsf{Set}

also commutes:
diagram

r_{U \to V}

I mean the "restriction function" that restricts things from

U

V

, for

V \subseteq U

. This is the image under

F

of the unique morphism from

U

V

\mathcal{O}(X)^{\mathrm{op}}

David Egolf (Mar 26 2024 at 17:53):

Now, let's pick some

s \in FU

. Since this diagram commutes, we have that

r_{U_i \to U_i \cap U_j} \circ r_{U \to U_i}(s) = r_{U_j \to U_i \cap U_j} \circ r_{U \to U_j}(s)

. I believe this is just different notation for the thing we wanted to prove!

Peva Blanchard (Mar 26 2024 at 18:01):

David Egolf (Mar 26 2024 at 18:12):

I use this amazing website to draw the diagrams: https://q.uiver.app/ .
Then all I have to do is take screenshots, and paste them into my draft!

David Egolf (Mar 27 2024 at 17:33):

David Egolf (Mar 27 2024 at 17:39):

To show that

F

is a presheaf, we need to show that it is a functor

F: \mathcal{O}(\mathbb{R})^{\mathrm{op}} \to \mathsf{Set}

. Now, for each open

U \subseteq \R

, we have that

FU

is the set of continuous-real valued functions on

U

. To talk about "continuous" functions means that there needs to be some topology on

U

. I think it's reasonable to assume that

U

is equipped with the subspace topology it inherits from

\mathbb{R}

I'm a bit intrigued by the fact that we then have

FU = \mathsf{Top}(U, \mathbb{R})

, where

U

is equipped with the subspace topology. This leads me to consider the functor

\mathsf{Top}(-,\mathbb{R}): \mathsf{Top}^{\mathrm{op}} \to \mathsf{Set}

David Egolf (Mar 27 2024 at 17:43):

I'd now like dream up some functor

G: \mathcal{O}(\mathbb{R})^{\mathrm{op}} \to \mathsf{Top}^{\mathrm{op}}

, so that we can express

F

F = \mathsf{Top}(-, \mathbb{R}) \circ G

David Egolf (Mar 27 2024 at 17:44):

It remains to show that

G

is a really a functor, and that

F = \mathsf{Top}(-, \mathbb{R}) \circ G

Peva Blanchard (Mar 27 2024 at 18:17):

spoiler

Regarding the definition of $F$ . On object: $F$ sends an open subset $U$ to the set $FU$ of continuous real-valued functions. On arrow: $F$ maps the containment $U \supseteq V$ to the function $FU \rightarrow FV$ that sends a continuous function $f$ on $U$ to its restriction on $V$ . At this point, we could dive into the details of why the restriction of a continuous function on a subset of its domain is indeed a continuous function, given the relevant induced topologies on their domains. But that would be an exercise in topology, so ... I'll invoke the "reductio ad obvious-um" ...

The fact that $F$ preserve identities and composition amounts to:
* restricting to the entire domain is the same as not restricting anything at all
* when $U \supseteq V \supseteq W$ , restricting a function $f : U \rightarrow \mathbb{R}$ on $V$ and then restricting on $W$ is the same as restricting $f$ directly on $W$ .

I think the crux of the exercise is in proving that $F$ also satisfies the sheaf condition.

David Egolf (Mar 27 2024 at 18:22):

To show that

G

is a functor, I think the only tricky thing to check (unless I'm missing something) is as follows: We want to show that if

U

and

V

, with

V \subseteq U

, are open subsets of

\mathbb{R}

equipped with the subspace topology inherited from

\mathbb{R}

, then the inclusion function

i:V \to U

is continuous.

To show this, I want to use this property of the subspace topology: "Let

Y

be as subspace of

X

and let

i:Y \to X

be the inclusion map. Then for any topological space

Z

a map

f: Z \to Y

is continuous if and only if the composite map

i \circ f

is continuous".

In our case, we have the inclusion map

i_{V \to U}: V \to U

and the two inclusions to

\mathbb{R}

, namely:

i_V: V \to \mathbb{R}

and

i_U: U \to \mathbb{R}

. Since

i_U \circ i_{V \to U} = i_V

and

i_V

is continuous, we conclude that

i_{V \to U}: V \to U

is also continuous.

David Egolf (Mar 27 2024 at 18:24):

@Peva Blanchard I was strongly considering aiming for a more direct proof! I'll be interested to take a look at the "spoiler" in a bit!

David Egolf (Mar 27 2024 at 18:49):

It remains to show that

F = \mathsf{Top}(-, \mathbb{R}) \circ G

. To show this, it suffices to show the following:

Let's consider

r:U \to V

\mathcal{O}(\mathbb{R})^{\mathrm{op}}

. So,

V \subseteq U

. First, we note that

\mathsf{Top}(-, \mathbb{R}) \circ G(U)

first equips

U

with the subspace topology and then gives the set of all real-valued continuous functions on

U

Next, we consider

\mathsf{Top}(-, \mathbb{R}) \circ G(r)

G(r)

is the morphism

i^{\mathrm{op}}

from

U

V

corresponding to the (continuous) inclusion function

i:V \to U

. Then,

\mathsf{Top}(-, \mathbb{R})(i^{\mathrm{op}}) = i^*: \mathsf{Top}(U, \mathbb{R}) \to \mathsf{Top}(V, \mathbb{R})

. Here,

i^*

acts by precomposition, so it sends a continuous function

f:U \to \mathbb{R}

to the function

f \circ i:V \to \mathbb{R}

. We note that

\mathsf{Top}(-, \mathbb{R}) \circ G(r) = i^*

is indeed acting to restrict functions, as desired.

David Egolf (Mar 27 2024 at 18:52):

I've taken a look at this now! Thanks for chiming in! I went for the less direct approach in part because it felt like it made it easier for me to realize that there are topology things going on. (The "reduction ad obvious-um" is great :laughing:, but I wanted to try and work out the details this time!)

I think there's going to be a bit more topology involved in showing that

F

also satisfies the sheaf condition... :upside_down:

Peva Blanchard (Mar 27 2024 at 19:09):

Btw, I find this format very nice. I read John's blog post series on topos not that long ago, but I never took the time to do the puzzles in detail. If you don't mind I'll probably join you on some puzzles, using the "spoiler" feature.

David Egolf (Mar 27 2024 at 19:18):

Julius Hamilton (Mar 28 2024 at 11:53):

Julius Hamilton (Mar 28 2024 at 11:55):

David Egolf (Mar 28 2024 at 16:44):

Sounds great! By the way, I think it's very much in the spirit of this topic to "think out loud" a bit on these exercises. So, if you feel like it, please feel free to share some thoughts on the exercise - whether you're stuck on it or whether you have completed it!

David Egolf (Mar 28 2024 at 16:45):

David Egolf (Mar 28 2024 at 16:56):

David Egolf (Mar 28 2024 at 17:03):

To give an analogy to imaging, we might think of each

s_i

as a picture of some region in space. Then we would like to be able to "stitch together" a bunch of pictures that agree on their overlaps to get one picture of a larger area. Depending on what conditions we place on the pictures in question, this may or may not be possible! For example if we care about "plausibility" in some sense, note that we can't always stitch together plausible images of small areas to make a plausible image of some larger area.

David Egolf (Mar 28 2024 at 17:12):

Let us assume that we have selected some

U \in \mathcal{O}(\mathbb{R})

, a bunch of

U_i \in \mathcal{O}(\mathbb{R})

so that

U = \cup_i U_i

, and for each

i

s_i \in FU_i

so that these selected continuous-real-valued functions "agree on overlaps" in the sense mentioned above. We wish to show that these

s_i

can be glued together to form a real-valued continuous function

s:U \to \mathbb{R}

that restricts to

s_i

U_i

, and further that this resulting function is unique.

Now, a function

s:U \to \mathbb{R}

is completely determined by the value that it takes at each point in

U

. Since

\cup_i U_i = U

, an arbitrary point

x

U

is in some

U_i

. And since

s|_{U_i} = s_i

we have

s(x) = s_i(x)

. So, the value of

s

is determined at each point once we pick

s_i

for each

i

. So, if

s

exists, it is certainly unique.

David Egolf (Mar 28 2024 at 17:15):

Let's define

s:U \to \mathbb{R}

as follows. For

x \in U

, if

x \in U_i

, let

s(x) = s_i(x)

. This definition gives us a function, because if

x \in U_i

and

x \in U_j

, then

s_i(x) = s_j(x)

. It remains to show that

s

is continuous.

David Egolf (Mar 28 2024 at 17:21):

At this point, I recall the "gluing lemma" from topology (the quote below is from "Introduction to Topological Manifolds" by Lee):

David Egolf (Mar 28 2024 at 17:23):

This lemma assures us that

s

is indeed continuous! I think that concludes this puzzle. (If I was working this out offline, I'd consider trying to prove the relevant part of this "gluing lemma". But to keep this topic a bit more focused, I'll not do that here).

Reid Barton (Mar 28 2024 at 17:28):

Here's another (possibly tricky) puzzle for you. When you proved that

F

was a presheaf, you introduced another functor,

G

. Does this proof that

F

is a sheaf have some formulation that involves

G

Reid Barton (Mar 28 2024 at 17:29):

(To make things a bit simpler, I suggest getting rid of the "op"s in the definition of

G

John Baez (Mar 28 2024 at 18:16):

I would like to see a proof of the gluing lemma! To my mind this is the most interesting part of the whole puzzle. It's also not hard to prove. In fact, I never knew anyone had stated it formally as a "lemma" in some book.

The reason it's important is this: to show that

\mathbb{R}

-valued continuous functions form a sheaf, it turns out that the crucial step - the only step where something about continuity matters - is the step where you show that continuity is a "local" property.

That is, you can check if a function is continuous by running around your space, looking at each point, and asking "is the function continuous here?" Your function is continuous iff the answer is "yes" at each point.

Later I give an example of a property that's not local: namely, for an

\mathbb{R}

-valued function to be bounded is not local. So there's no sheaf of bounded

\mathbb{R}

-valued functions.

So, by outsourcing this "gluing lemma" to some textbook, I think you're missing out on an insight that this puzzle was designed to deliver.

Peva Blanchard (Mar 28 2024 at 18:16):

This gluing business is really what clicked for me when trying to understand sheaves. Continuity is a "local" property, so it goes well with gluing. I'll try to prove the gluing lemma.

spoiler

Let $s$ be the function as defined above. We only need to show that $s$ is continuous. Let $x \in U$ , and $y = s(x)$ . Let $V$ a neighborhood of $y$ . It suffices to show that the pre-image $W = s^{-1}(V)$ is a neighborhood of $x$ . By construction, there exists $i$ such that $s(x) = s_i(x)$ with $s_{|U_i} = s_i$ . We know that $s_i$ is continuous so $W_i = s^{-1}_i(V)$ is a neighborhood of $x$ . Hence, $W \supseteq U_i \cap W \supseteq W_i$ is also a neighborhood of $x$ .

To be complete, we should show that "being continuous at every point" entails "being continuous". The latter means that the pre-image of any open subset is an open subset. Let $V$ be an arbitrary open subset of $\mathbb{R}$ , and let $U = s^{-1}(V)$ . For each $x \in U$ , $V$ is a neighborhood of $s(x)$ . Because $s$ is continuous at $x$ , we know that $U$ is a neighborhood of $x$ . In other words, $U$ is a neighborhood of each of its elements. This implies that $U$ is an open subset.

David Egolf (Mar 28 2024 at 19:06):

Thanks for pointing that out, @John Baez ! (And @Reid Barton , thanks for suggesting another puzzle - I'll potentially take a look at it in a bit! It sounds intriguing.)

I was already a bit "spoiled" on the proof of the gluing lemma, when I looked it up in my book earlier. Lee uses what he calls the "Local Criterion for Continuity" to prove the lemma in the case where the

\{A_i\}

form an open cover of

X

. Here is a statement of the "Local Criterion for Continuity" from Lee's book:

If we accept this, then we can use this to prove that our

s \in FU

is continuous (we recall that

s:U \to \mathbb{R}

). Let us pick an arbitrary point

x

from

U \subseteq \mathbb{R}

. We want to show that there is a neighborhood of

x

on which the restriction of

s

is continuous. Since

U = \cup_i U_i

, we know that

x \in U_i

for some

i

. The restriction of

s

U_i

s_i

, which we know is continuous. By the "Local Criterion for Continuity", we conclude that

s: U \to \mathbb{R}

is continuous.

David Egolf (Mar 28 2024 at 19:08):

It might be good to take this a step further and prove the "Local Criterion for Continuity" described above. I'll probably give this a try in a bit!

John Baez (Mar 28 2024 at 22:23):

Yes, that "local criterion for continuity" is the key step. It's actually a wonderful fact: while continuity is defined in a somewhat "global" way for a function

f: X \to Y

, there turns out to be a concept of a function being "continuous at a point", such that

f

is continuous iff it is continuous at each point

x \in X

If you know how to prove this in your sleep, then of course there's no need to prove it here! But otherwise, it's worth thinking about.

Julius Hamilton (Mar 29 2024 at 13:03):

That’s exactly what I want to do. I get self conscious that my amateur sloppiness feels like fluff. But I will. One sec.

Julius Hamilton (Mar 29 2024 at 13:20):

I live under time constraints (like all of us) so if it seems like I could easily get the answers to these questions just by reading more, I’m just trying to make it clear that I was encouraged by David egolf to “think out loud” and it is helpful for me to be able to learn on-the-go like this. Thanks.

I took a real analysis class as an undergraduate but did not take a lot of other standard math classes. I never really studied topology.

The definition seems simple (I already looked it up). I want to make my thinking super rigorous which is why I am trying to formulate everything in Coq lately. That’s been fun but means I also need to learn more about Coq itself.

(This is meant to be on-topic, I’m saying I prefer to learn by expressing math in Coq).

Set is a fundamental keyword in Coq - I think a Type. I know there is the meta-type Sort as well. I am not clear in some regards how Coq treats types and sets differently. For example, I don’t know if Types have zero implication of their size, ie how many terms are assumed to exist in any given type.

I need to express “some specific element / term of type Set, but I don’t know which one (becusss it’s meant to be arbitrary)”. I think this is the Parameter keyword but not sure yet.

A topology is arguably of Coq Sort “Record”. I understand Record to be yet another of these mathematical ways of describing “collections” of things. A Record tries to have no commitment to any theories of mathematics, like Sets do. A record is a Type, but it can contain “multiple things”. So, a topology is:

A Record T
Consisting of a set X
And a set O of subsets of X
Such that the 3 topology conditions hold. They are:
Empty set and X are in S.
Arbitrary unions.
Finite intersections.
(I never took the time to think about why unions can be arbitrary but intersections have to be finite).

Actually, Topology should not be a record, it should use keyword Definition, since a Record is better for a single instance of an object? Not sure.

Record T : Type :=
| Parameter X : Set
| Parameter S : Set :=
(assume we just import a “power set function” until I have time to define one myself)
| (I haven’t learn how to state constraints on a type yet, but I need to state here the topology requirements. I guess it’s of type Prop, since they are Boolean. Some like Definition has_empty : S -> Prop := (assume we import a definition of empty set and set membership).

There’s my thinking aloud before I head off to work. I wanted to work up to questions I had about presheafs. I was stuck on thinking about inclusion mappings.

Julius Hamilton (Mar 29 2024 at 13:21):

Baez’s post mentions that we need to reverse the direction of the arrows (O(X)^{op}) and I was trying to fully understand.

Julius Hamilton (Mar 29 2024 at 13:24):

I have actually been spending a lot of time thinking about what the real nature of being “functional” is. I know the common definition. But I want to see clearly why the properties of categories come from mappings being functional (sometimes).

An inclusion mapping is functional. It maps an element of a subset of S to the same element in the set S. How do you define that mathematically?

Julius Hamilton (Mar 29 2024 at 13:26):

I guess we can express “mapped to itself” with an expression like f: S1 -> S such that S1 \subset S and f(x) = x. This is allowed by the axiom of extensionality? Though they are elements in different sets, there is already an equality relation defined on the elements of the two respective sets.

Julius Hamilton (Mar 29 2024 at 13:26):

How can we reverse an inclusion mapping? It wouldn’t be functional. An inclusion mapping is injective but not surjective.

Julius Hamilton (Mar 29 2024 at 13:26):

David Egolf (Mar 29 2024 at 17:08):

Thanks for joining in @Julius Hamilton ! I don't know enough about Coq to understand your questions relating to it (hopefully someone else here does, though!).

I can talk a bit about how I understand

\mathcal{O}(X)

and

\mathcal{O}(X)^{\mathrm{op}}

though, in case that is helpful to you.

Julius Hamilton (Mar 29 2024 at 17:08):

David Egolf (Mar 29 2024 at 17:14):

In my understanding,

\mathcal{O}(X)

is a category that we can create when we're given a topology on the set

X

. To create this category, we need to say what its objects and morphisms are.

As you mentioned above, any topology on

X

has a collection of subsets of

X

- the "open sets". The open sets of our topology are the objects of

\mathcal{O}(X)

And given two open sets

U

and

V

in our topology on

X

, we can ask this question: Is

U

a subset of

V

? If the answer to this question is "yes!", then we put a morphism from

U

V

. Otherwise, we don't put a morphism from

U

V

David Egolf (Mar 29 2024 at 17:17):

Then

\mathcal{O}(X)^{\mathrm{op}}

is another category: it's just like

\mathcal{O}(X)

except we turn all the arrows around. So now we put a morphism from

U

V

exactly if

V \subseteq U

David Egolf (Mar 29 2024 at 17:19):

I don't really think of these morphisms as functions. I think of them more like "yes!" answers to a yes/no question.

I'm not sure I was really addressing your point of confusion :sweat_smile:. Hopefully this is still somewhat helpful!

Julius Hamilton (Mar 29 2024 at 17:28):

I think I was confused regarding how simple a morphism can be. I’ll think more about that.

Julius Hamilton (Mar 29 2024 at 17:39):

s is some arbitrary set.
U is an open set in O(X).
Does it matter if it turned out that s was a set in O(X)? Since O(X) is surely a sub-category of Set?

I’ll assume an open cover of U is a union of sets in O(X) such that U \subset of the union.

Baez’s post said we need to flip the morphisms precisely so we can “restrict” the functor. So this is saying, “only those elements of s such that there exists an element x in U_i for which F(x) \in s”. ?

We can’t do this without flipping the morphisms? I’d like to think about how. (Thinking out loud :wink:)

David Egolf (Mar 29 2024 at 17:53):

The notation

s \in FU

means that

s

is an element of the set

FU

. For example, depending on what

F

does,

s

might be a continuous real-valued function with domain of

U

John Baez (Mar 29 2024 at 17:56):

@Julius Hamilton - I think it's very helpful to focus on a specific example of a sheaf when trying to understand the definition of sheaf, and (unsurprisingly) I recommend the example David is talking about, where

FU

is the set of continuous functions

f: U \to \mathbb{R}

Or, if "continuous" is distracting to you, think about the sheaf where

FU

is the set of all functions

f: U \to \mathbb{R}

If you read all the sheaf axioms keeping some example like this in mind, they should make more sense.

John Baez (Mar 29 2024 at 18:05):

(The reason I picked a sheaf of continuous functions is because topos theory originated as a generalization of topology, as the name suggests - so ideas from topology can help explain what people are doing in topos theory. There are also other ways to get into topos theory, but my course notes - and the book they're based on - start with topology. Luckily you only need to know a small amount of topology.)

David Egolf (Mar 29 2024 at 18:53):

In the spirit of trying to better understand why we can detect continuity of a function in a local way, I'll now to try to prove this:

I got stuck along the way. To get unstuck, I referenced Lee's book on topological manifolds. So, what I wrote below below closely follows the proof in Lee's book.

f

is continuous, then for any point

x \in X

, we have that

X

is a neighborhood of

x

on which the restriction of

f

(which is just

f

) is continuous.

For the other direction, we assume that each point of

X

has a neighborhood (an open set) on which the restriction of

f

is continuous. We wish to show that

f: X \to Y

must then be continuous. To show that

f

is continuous, we consider an arbitrary open subset

V

Y

. We wish to show that the preimage of

V

under

f

is open in

X

I'll call this preimage

f^*(V)

. Now, a set

A

is open exactly if every point

x

A

is contained in an open subset

U_x \subseteq A

. Thinking of openness in this way seems likely helpful, as it involves a condition that can be checked at each point - and we are trying to understand why continuity involves a condition that can be checked at each point.

Now, we know that any

x \in f^*(V)

has some neighborhood

U_x

such that

f|_{U_x}: U_x \to Y

is continuous. Since

f|_{U_x}

is continuous and

V

is open,

(f|_{U_x})^*(V)

is also open in the subspace topology on

U_x

. Thus, it is the intersection of some open set

A

(in the topology on

X

) with

U_x

. Since

U_x

is open and

A

is open,

(f|_{U_x})^*(V)

is also open (in the topology on

X

We are hoping that

(f|_{U_x})^*(V)

is an open subset of

f^*(V)

that contains

x

. This would provide the neighborhood of "breathing room" about

x

f^*(V)

that we need for

f^*(V)

to be open.

By definition,

(f|_{U_x})^*(V)

is the subset of

U_x

that maps into

V

under

f

. So, its elements are exactly those that are: (1) in

U_x

and (2) map to

V

under

f

. Thus,

(f|_{U_x})^*(V) = U_x \cap f^*(V)

. Note that

x \in U_x \cap f^*(V)

. We also note that

(f|_{U_x})^*(V) = U_x \cap f^*(V)

is a subset of

f^*(V)

We conclude that an arbitrary point of

f^*(V)

has a neighborhood contained in

f^*(V)

. Therefore

f

is continuous.

Julius Hamilton (Mar 29 2024 at 19:02):

A presheaf

F

is a functor from

O(X)^{op}

{\mathbf Set}

. (Just restating definitions to exercise myself).

Baez says that we can see why we would want to take the opposite category of

O(X)

if we think of one possible presheaf

F

sending each

U \in Ob(O(X))

to that set in

\{\mathbf Set\}

that contains all real-valued functions over

U

X

is the real numbers, let’s consider a common topology over the reals. (Which is?)

Topologies are a way to express geometric concepts. Why are they so fundamentally defined in terms of “open sets”?

Perhaps it has to do with continuity and limits. Maybe it allows us to define the epsilon-delta condition without recourse to a distance metric?

Julius Hamilton (Mar 29 2024 at 19:06):

Eric M Downes (Mar 29 2024 at 19:29):

I'm sure there are more sophisticated ways of thinking about this, but that is how I approach it.

David Egolf (Mar 29 2024 at 19:58):

When we say (part of) a diagram "commutes", we mean that two different sequences of morphisms compose to the same morphism. If a category is a poset, it has at most one morphism from

A

B

, for any objects

A

and

B

. Therefore, if I have two different sequences of morphisms from

A

B

, when I compose the morphisms in each sequence there's only one possible morphism for me to get as a result!

Consequently, these two sequences of morphisms must compose to the same morphism. And hence the corresponding (part of a) diagram must commute.

Julius Hamilton (Mar 29 2024 at 22:00):

That is such a beautifully simple explanation. You have a knack for clear simple understanding.

Julius Hamilton (Mar 29 2024 at 22:19):

I’d never thought of that before. I’m curious to know what those elements might be called in an abstract algebra setting. I’ve been thinking about how a one object category is a monoid, and if there is a corresponding abstract algebraic structure for a “multi-object category”. The thing is, not all the arrows (elements of the “structure”) compose with one another. All algebraic structures I know of are defined by closure and by “totality”.

I think your point is that all the arrows in a thin category are generators. Now I have to think about what how categories can have non-generator arrows.
I think the most basic way of expressing the compositional requirement of the arrows in a category is, “if they can compose, they do.” “If they are composable, then compose.”

Julius Hamilton (Mar 30 2024 at 00:31):

Every diagram in a poset commutes.
Functors preserve commutative diagrams (why)?
In

O(X)^{op}

, the morphisms essentially say “contains”.
If we think of

F

as mapping a set to a function defined on that set, we can think of the “contains” morphism in

O(X)^{op}

as corresponding, under

F

, to a restriction of some function on some subset of its domain.
Baez basically just asks us to show that restricting some

s \in FU

(which can be thought of as a function) to an open subset

U_i \in U

, and restrict it to some other subset

U_j \in U

, you can further restrict both of those restricted functions to

U_i \cap U_j

, and they are the same. Which David showed.

Julius Hamilton (Mar 30 2024 at 00:40):

In order to show it is a presheaf, I think we have to show

X

has a natural topology and forms a thin category, and then that

F

fulfills the functor axioms (if we reverse the direction of the arrows). I’ve been trying to tell myself “a functor is a morphism in the category of categories” as a single idea to remind myself of the definition. I think the important thing is if two arrows

f, g

C

compose, then arrows

Ff, Fg

must compose in such a way that

Fg \circ Ff = F(g \circ f)

. Which basically says, “when you map the morphisms over, you can take the composition before, or you can take it after”.

Julius Hamilton (Mar 30 2024 at 00:42):

David Egolf (Mar 30 2024 at 02:35):

I don't have energy right now to respond in detail to your comments, @Julius Hamilton. But I did notice that above you asked "why?", regarding the fact that functors send commutative diagrams to commutative diagrams. You might find it a helpful exercise to pick a particular (simple) commutative diagram, and then aim to show that applying a functor

F

to that diagram yields a commutative diagram.

Julius Hamilton (Mar 30 2024 at 02:35):

David Tanzer (Mar 30 2024 at 04:47):

To prove it, need to first formalize what it means for a diagram to be commutative.

David Tanzer (Mar 30 2024 at 06:16):

As an aside, in an alternative definition, one could take preserving diagram commutativity as a defining characteristic of a functor; then recover

T(f \circ g) = T(f) \circ T(g)

by applying this to a simple diagram. Not as efficient as a technical definition, but seems conceptually useful.

David Egolf (Mar 30 2024 at 16:14):

Reid Barton said:
Before moving on to the next puzzle, I want think about this for a bit:

The words "possibly tricky" strike some fear into my heart, but this sounds fun to think about - so I'll see what I can figure out...

David Egolf (Mar 30 2024 at 16:22):

David Egolf (Mar 30 2024 at 21:53):

Imagine we have two open sets

U_1

and

U_2

(in the standard topology on

\mathbb{R}

) with

U = U_1 \cup U_2

. For

\mathsf{Top}(-, \mathbb{R}) \circ G

to be a sheaf, I think we need to be able to construct a unique real-valued continuous function

s:GU \to \mathbb{R}

from a pair of real-valued continuous functions

s_1:GU_1 \to \mathbb{R}

and

s_2:GU_2 \to \mathbb{R}

, provided that

s_1

and

s_2

agree on

GU_1 \cap GU_2

. [Or should they agree on

G(U_1 \cap U_2)

? Still figuring this out...]

We also want to be able to go the other way: given the

s:GU \to \mathbb{R}

constructed from

s_1

and

s_2

above, we want to be able to recover

s_1

and

s_2

from

s

by appropriately restricting

s

When we have two batches of information that we want to be equivalent, that makes me start to think of limits or colimits!

David Egolf (Mar 30 2024 at 22:05):

I'm pretty sure what I just wrote above isn't quite right. Or, at least, I'm quite confused about it.

But here's a picture illustrating the very rough idea I have in mind, that I'm still working to clearly spell out:
picture

David Egolf (Mar 30 2024 at 22:07):

Very roughly, I'm starting to wonder if

G

should do something like "preserving pullbacks", if

\mathsf{Top}(-, \mathbb{R}) \circ G

is to be a sheaf. But it will take me some thinking to express this idea more clearly!

John Baez (Mar 30 2024 at 22:16):

John Baez (Mar 30 2024 at 22:24):

I go into this in a bit more detail in Part 3 of my course. Just two short paragraphs. But you seem to be enjoying discovering this stuff on your own, which is really better.

John Baez (Mar 30 2024 at 22:27):

If you try to develop a subject on your own, the way you're doing, it can become much easier to understand what people are doing when you read the 'official' treatment.

John Baez (Mar 30 2024 at 22:36):

I'm losing track of who said what where, but I think I saw someone derive this "local criterion for continuity" from something more basic, which might be called the "local criterion for openness:

A subset

S

of a topological space is open

\iff

each point

x \in S

is contained in some open set

O_x

contained in

S

This is amusingly easy to prove. For the

\implies

direction just take

O_x = S

. For the

\Leftarrow

direction just note

S = \bigcup_{x \in S} O_x

and use the fact that a union of open sets is open.

John Baez (Mar 30 2024 at 22:38):

So we can say openness is a 'local' condition: to see if a set is open, you can run around checking some condition about all its points, and the set is open iff this condition holds for all its points.

And this implies that continuity is also a local condition: to see if a function is continuous, you can run around checking some condition at all points of its domain, and the function is continuous iff this condition holds for all those points.

David Egolf (Mar 30 2024 at 22:40):

Yes, I made implicit use of this "local criterion for openness" above! Before doing so, I had never noticed this connection between continuity being "locally detectable" and openness being "locally detectable"! Cool stuff! :smile:

John Baez (Mar 30 2024 at 22:40):

And there's a "for all" in both of these "local criteria". I haven't thought about it hard, but I bet this is connected to the fact that the sheaf condition can be stated in terms of limits. (A "for all" is a limit, and the pullback you were looking at is also a limit.)

John Baez (Mar 30 2024 at 22:43):

I guess for now the main moral is that sheaves are all about "locally detectable" properties.

Peva Blanchard (Mar 30 2024 at 22:46):

It is as if it was something that is trivially parallelizable: I imagine a (possibly uncountable) set of agents that would check for each point

x

whether the property holds "around" that point, and, most importantly, they don't need to communicate/synchronize. And the agents are indistinguishable (each of them runs the same check procedure).

John Baez (Mar 30 2024 at 23:00):

That sounds right! It's a nice thought. I can try to make it slightly more precise. There's an agent for each point. Each agent runs the same check procedure, which can be checked in an arbitrarily small neighborhood of their point. Then at the end we report the answer "true" if and only if they all get the answer "true".

John Baez (Mar 30 2024 at 23:02):

Important properties of functions

f : \mathbb{R} \to \mathbb{R}

like continuity, differentiability, smoothness, analyticity, upper and lower continuity, and measurability all work like this.

John Baez (Mar 30 2024 at 23:02):

Later in my posts I talk about the idea of a 'germ', which is connected to this stuff.

Peva Blanchard (Mar 30 2024 at 23:22):

Oh yes, I remember now the formal definition of 'germ', but it's the first time that I get an inverse "tree-like" mental picture of it. Here's what I have in mind.

Initially, we have one agent that checks whether the property

P

holds over the open set

X

. The agent can spawn (possibly uncountably many) new agents, that are clones of their genitor, each of them being responsible for an open subset of

X

. The parent agent reports true iff all its children reports true. And the process continues like this.

This is a very poor algorithm: depending on the topological space, this could take a transfinite number of steps and a transfinite number of agents.

What I find amusing is that the opposite category of open subsets of

X

somehow describes this big clone-spawning branching process. The points of

X

are exactly the (infinitely long) branches.

Peva Blanchard (Mar 30 2024 at 23:32):

ps: To be more precise, I should force agents to merge when they work on the same open subset.

David Egolf (Mar 31 2024 at 00:10):

The idea of "local agents that can work in parallel" reminds me a whole lot of an ultrasound reconstruction technique I know of, where the reconstruction at each point can be computed in isolation of the reconstruction at different points. But this is not quite analogous because the agents in this case would report a number, not just a "yes!" or "no!" regarding whether some property holds.

David Egolf (Mar 31 2024 at 00:11):

One could also consider checking a reconstructed image point by point, and at each point asking if the reconstruction at that point is "plausible" (in some sense) given the observations. (This would probably involve assessing whether the observed data "relevant to this point" is similar enough relative to what we'd expect if our reconstruction as this point reflects the true object).

However, just because a reconstructed image agrees (in some sense) with the observed data at each point does not imply that the entire reconstructed image agrees with the observed data. So, in this example, "reconstruction plausibility" is not "locally detectable".

David Egolf (Mar 31 2024 at 00:16):

However, starting next week, I'm hoping to move on to the next puzzle in the blog post, which is this:

Peva Blanchard (Mar 31 2024 at 00:44):

I'll cheat a bit (because I remember later posts from John's series). I think these Boolean agents actually are the truth values of the topos of sheaves over

X

. When the root agent reports "yes!", then the property holds everywhere. When one of its children does, then the property holds on the associated subset. I think we can think of this "collection of boolean agents over

X

" as a specific sheaf.

Now regarding agents that would report numbers. I think we should remember that the parent agent aggregates the results of its children. In the "yes/no" case, the aggregation is simply a big conjunction. If, oth, "point"-children report numbers, then the parent can aggregate just by making a tuple out of them, indexed by their location. In other words, each agent really reports real-valued function, and the parent aggregation process amounts to gluing the functions reported by its children, i.e., taking a categorical limit.

I will stop there, as otherwise, it's just going to be another burrito tutorial.

John Baez (Mar 31 2024 at 01:44):

Good stuff, Peva! I'm too tired to think hard about what you said, so I'll just report the slogan "topos theory is the study of local truth".

Madeleine Birchfield (Mar 31 2024 at 15:16):

Do presheaf categories still have subobject classifiers if Set does not have a subobject classifier because one is a constructive predicativist?

Morgan Rogers (he/him) (Mar 31 2024 at 15:23):

Eric M Downes (Mar 31 2024 at 18:13):

"Generators" is most common for such elements in most contexts; famously the symmetric group

S_n

can be generated by just two maps

m\mapsto m+1\pmod{n}

and

(12)

. What is the operation under which a permutation group specifically can be said to be closed?

A family of subsets

\mathcal{F}(X)

can generate a topology

\mathrm{cl}_{\cup,\cap}(\mathcal{F})

. The topology is the closure under arbitrary unions and finite intersections. Is a topology a category?

There is such a thing as a delooping; simplest context is a finite commutative monoid, in which every element is an object. Draw an arrow

x\to y

just when

\exists z;~y=zx

(Green's relations). Take your favorite finite commutative monoid (without inverses if you want to deal with fewer arrows), how few arrows can you specify, such that asserting closure under composition of arrows fills in the rest of the cayley table?

The above arrow drawing requires associativity of elements. "Magmas" are the non-associative binops. For a familiar structure that is closed in an elemental sense but not closed in another very meaningful sense, consider the rock-paper-scissors magma

\begin{array}{c|ccc}& r&p&s\\ \hline r & r & p & r\\ p & p & p & s\\ s & r & s & s\end{array}

This binary operator is not associative. You can rephrase the associativity condition as a kind of closure (or lack-thereof) under a certain familiar operation. What is it? How many elements must the closed structure have?

Eric M Downes (Mar 31 2024 at 18:36):

I think your point is that all the arrows in a thin category are generators. Now I have to think about what how categories can have non-generator arrows.

No, you can have non-generator arrows in a thin category. Consider a poset

\big(\{x,y,z\},\leq\big)

and there are two "generator" arrows

x\leq y, y\leq z

what third arrow must also be present?

Eric M Downes (Mar 31 2024 at 18:44):

(And, having answered that question, and recalling there is at most one arrow between any two objects in a thin category, you should understand why all diagrams* in a thin category commute.)

David Egolf (Apr 01 2024 at 16:29):

We start by showing

F

is a functor

F: {\mathcal{O}(\mathbb{R})}^{\mathrm{op}} \to \mathsf{Set}

, which means it is a presheaf. On objects,

F

sends an open set

U \subseteq \mathbb{R}

to the set

FU

of bounded continuous real-valued functions on

U

. Note: to determine if a function from

U

it continuous, we need to put a topology on

U

. To talk about continuity, we equip

U

with the subspace topology it inherits from

\mathbb{R}

On morphisms,

F

sends a morphism

r:A \to B

to the corresponding restriction function, which sends a bounded continuous real-valued function

f:A \to \mathbb{R}

f \circ i:B \to \mathbb{R}

, where

i: B \to A

is the inclusion map. We saw earlier that inclusion maps like

i

are continuous, and therefore the restriction of a continuous function is continuous. Further, restricting a bounded function yields a bounded function.

For each object

U

{\mathcal{O}(\mathbb{R})}^{\mathrm{op}}

F

sends the identity morphism

1_U: U \to U

to the identity function on

FU

. This is because restricting a function to its own domain leaves the function unchanged.

If we have the situation

r \circ r' = r''

{\mathcal{O}(\mathbb{R})}^{\mathrm{op}}

, then we have

F(r) \circ F(r') = F(r'')

. That is because restricting a function to some domain in two steps yields the same result as restricting it to that domain all at once.

We conclude that

F

is a functor

F: {\mathcal{O}(\mathbb{R})}^{\mathrm{op}} \to \mathsf{Set}

and hence a presheaf.

David Egolf (Apr 01 2024 at 16:34):

Next, we show that

F

is a separated presheaf but not a sheaf. If

F

was a sheaf, we'd always be able to do the following:

If this

s

always exists and is unique, then

F

is a sheaf. If

s

doesn't always exist, but is unique when it exists, then

F

is a separated presheaf.

David Egolf (Apr 01 2024 at 16:40):

In this puzzle, if

s

exists it is unique. For any

x \in U

, since

U = \cup_i U_i

, we have that

x \in U_i

for some

i

. Then, since

s|_{U_i} = s_i

, we have that

s(x) = s_i(x)

. So, the value of

s

at every point is fixed (if it exists) once we pick all our

s_i

But

s

doesn't always exist! That's because if you glue together an infinite number of bounded real-valued continuous functions that agree on overlaps, you don't always get a bounded function! Intuitively, if you run around and check that each little bit of a function is locally bounded, you can't conclude that the whole thing is bounded.

David Egolf (Apr 01 2024 at 16:56):

There is quite a bit of discussion before the next puzzle! So, I'll try to introduce the next puzzle a little.

To my understanding, part of the goal of the next puzzle is to work towards a notion of morphism between categories of sheaves. And since each category of sheaves is an "elementary topos", this is relevant for thinking about morphisms between elementary topoi.

David Egolf (Apr 01 2024 at 17:03):

And why do we care about morphisms between topoi? Here are a couple possible reasons:

David Egolf (Apr 01 2024 at 17:06):

Here,

f_\ast F

is defined as:

(f_\ast F)(V) = F(f^{-1} V)

for each open subset

V

of a topological space

Y

. Note that

f: X \to Y

is a continuous function and

F

is a presheaf on

X

. We also have

f^{-1} V = \{x \in X :\; f(x) \in V \} \subseteq X

. Note that

f^{-1} V

is open because

V

is open and

f

and continuous.

Roughly, our goal here is to make a presheaf on

Y

given a continuous function

f:X \to Y

and a presheaf

F

X

Peva Blanchard (Apr 01 2024 at 21:59):

Consider the topological space

X = (0,1]

, the half-open unit interval, with the induced topology from

\mathbb{R}

. For all

n \in \mathbb{N}

, let

U_n = \left(\frac{1}{1 + n}, 1\right]

and

f_n(x) = \frac{1}{x}

U_n

. The

U_n

cover

X

X = \bigcup_n U_n

, and each

f_n

is bounded. Moreover, for any

n \leq m

U_n \cap U_m = U_n

and

f_n

and

f_m

obviously match on

U_n

. If the presheaf

\mathcal{F}

of bounded functions were a sheaf, there would exist a bounded function

f

defined on

X

such that

f_{|U_n} = f_n

. In particular, for all

n

Peva Blanchard (Apr 01 2024 at 22:11):

There are variants of this idea: presheaf of Lipschitz functions (take

\sqrt{x}

ln~x

in the example above) (edit:

\frac{1}{x}

works as well), presheaf of functions with bounded derivatives of order

k

(just integrate

k

times the previous examples).

David Egolf (Apr 01 2024 at 22:22):

The idea is to consider the identity function

f: \mathbb{R} \to \mathbb{R}

that sends

x

x

. Then, we can get each

s_i

by restricting

f

to, say,

U_i=(i, i+2)

. Each

s_i

is then bounded, and the collection of

s_i

agrees on overlaps, but when we try to glue together all the

s_i

, our resulting function isn't bounded anymore.

Peva Blanchard (Apr 01 2024 at 22:29):

oh yes, nice! The common pattern to get a "non-sheafy presheaf" is to start from a invalid candidate defined globally such that the restrictions of this candidate to specific open subsets satisfy a condition.

Each invalid candidate has a sort of singularity at some point (e.g., 0 in my example, and

+\infty

in yours), and then we just restrict to open subsets that avoid "just enough" this point.

Peva Blanchard (Apr 01 2024 at 22:50):

Mmh, these examples do not work any more if

X

is compact. Compact means that from any covering we can extract a finite cover of

X

. If

X

is compact, is the presheaf of bounded functions on

X

a sheaf? It seems so.

David Egolf (Apr 01 2024 at 22:59):

X

is compact, couldn't it still have a non-compact open subset

U

? And then we could maybe set up an unbounded function

f

defined on

U

to show that we don't have a sheaf. (Restrict

f

to a bunch of

U_i

where

\cup_i U_i = U

, and where

f|_{U_i}

is bounded for each

i

. Then these glue together to

f

, which is unbounded. So then, we can't always glue together a bunch of compatible

FU_i

to make an element of

FU

Peva Blanchard (Apr 01 2024 at 23:05):

~~I'm not sure that a compact set can contain a non-compact open subset. (I'm thinking about the closed unit interval $[0,1]$ ).~~ (edit: oh my brain ...)

David Egolf (Apr 01 2024 at 23:07):

My initial thought was that something like

(0.4,0.6)

would be a non-compact open subset of the topological space

[0,1]

. But I am a bit shaky on compactness, so maybe I'm just confused. (I'd need to review this stuff!)

Peva Blanchard (Apr 01 2024 at 23:09):

Peva Blanchard (Apr 01 2024 at 23:11):

I focused on the total space, but yes we can reproduce the example on any open subset.

Peva Blanchard (Apr 01 2024 at 23:16):

Now, I'm looking for a topological space

X

such that the presheaf of bounded functions is indeed a sheaf. From our discussion, it suffices that any open subset of

X

be compact, right? I'm wondering what kind of space is that.

Peva Blanchard (Apr 01 2024 at 23:17):

One example: any set

X

equipped with the trivial topology (

\emptyset

X

are the only open sets). Topologically, such a space behaves like a space with one point.

Peva Blanchard (Apr 01 2024 at 23:41):

Peva Blanchard (Apr 01 2024 at 23:42):

In other words: any set

X

with a topology admitting a finite number of open sets.

David Egolf (Apr 01 2024 at 23:45):

I wonder if there are any examples where

X

has an infinite number of open sets.
(Interesting stuff! I need to take a break for today - I have to manage my energy carefully - but of course please feel free to keep posting here.)

Peva Blanchard (Apr 01 2024 at 23:50):

Sure! I'll probably continue under another topic, so as not to divert the purpose of yours.

John Baez (Apr 01 2024 at 23:56):

Digression: here's another example of a separated presheaf that's not a sheaf, which I just thought of. Take

\mathbb{N}

with its usual topology, where all subsets are open, and let

F

be the presheaf where

F(U)

for any

U \subset \mathbb{N}

is the set of computable partial functions

F: \mathbb{N} \to \mathbb{N}

whose domain includes

U

John Baez (Apr 01 2024 at 23:58):

So, simply put,

F(U)

consists of all partially defined functions

f

from the natural numbers to the natural numbers such that you can write a computer program which halts and spits out

f(n)

when

n \in U

John Baez (Apr 01 2024 at 23:58):

John Baez (Apr 02 2024 at 00:00):

But for

U = \mathbb{N}

there are lots of functions from

U

N

that aren't computable.

Peva Blanchard (Apr 02 2024 at 00:37):

Mmh, it's not as easy to come up with an explicit example (i.e., a witness of the non-sheafiness of

F

Peva Blanchard (Apr 02 2024 at 00:43):

Define

s_i : U_i \rightarrow \mathbb{N}

as the function that outputs

1

if the

i

-th Turing machine halts, and

0

otherwise. All the

s_i

's agree on the intersections (the

U_i

's are disjoint).

F

were a sheaf, then there would be a total computable function on

\mathbb{N}

that would solve Turing's halting problem, whence a contradiction.

Peva Blanchard (Apr 02 2024 at 00:46):

It's a bit weird, I had to convince myself that

s_i

does belong to

F(U_i)

(still not 100% sure). It seems trivially true since

U_i

is finite. The strangeness comes from the fact that I am invoking the halting problem's oracle to define

s_i

David Egolf (Apr 02 2024 at 17:01):

This sounds cool! But I'm having a hard time wrapping my mind around it. Is the idea that we need a single program that takes in any

n

and then produces the corresponding

f(n)

? Or are we allowed to have different programs, say one for each

n

, to calculate

f(n)

I'm guessing it's the first - we need a single program that can handle any

n

. Then if

f:U \to \mathbb{N}

and

U

is a singleton

\{n\}

, given

f(n)

we can write a very short program that outputs

f(n)

given

n

: just output

f(n)

Then if

U

is finite, and we know

f(n)

for each

n \in U

, we can still write a single program that will run in a finite amount of time, and that outputs

f(n)

given

n

. We can just create a bunch of if/then statements that check to see if the given input value corresponds to the output value

f(n)

n

varies. Since

U

is finite, there will be a finite number of if/then statements that run, and so the run-time will be finite.

David Egolf (Apr 02 2024 at 17:05):

But if

U

is infinite, and there's no clever trick to figure out the values of

f

quickly, then I was going to say that the approach I outlined above wouldn't always run in a finite amount of time, because we'd need an infinite number of if/then statements. But, for any finite

n

, I think we'd only have to run a finite number of if/then statements to look up the appropriate value for

f(n)

. So it seems like the runtime would be finite for any input

n

David Egolf (Apr 02 2024 at 17:08):

Maybe the problem with the program I outline above (in the case where

U

is infinite) is that it would need to be infinite in length (even though its runtime for any input would always be finite). That doesn't sound like a legitimate "computer program"!

John Baez (Apr 02 2024 at 17:28):

You need one program that takes in any

n \in U

and halts after printing out

f(n)

The other option, one program for each

n

, would say that every function

f: U \to \mathbb{N}

is computable. For any

n, m \in \mathbb{N}

you can write a program which prints

m

when you input

n

John Baez (Apr 02 2024 at 17:29):

I can take

\mathbb{N}

and cover it with singletons. The restriction of any function

f: \mathbb{N} \to \mathbb{N}

to any singleton is computable, even if

f

is not computable.

John Baez (Apr 02 2024 at 17:31):

In simple rough terms: we're failing to get a sheaf because you can't always glue together infinitely many programs into one program.

Or even more tersely: computability of functions

f: \mathbb{N} \to \mathbb{N}

is not a local property.

David Tanzer (Apr 02 2024 at 21:49):

David Tanzer (Apr 02 2024 at 21:57):

i.e., generally can't glue compatible local constant functions into a global constant function

John Baez (Apr 02 2024 at 23:09):

Nice! We were talking about examples of separated presheaves that are not sheaves, and the presheaf of constant functions is actually a separated presheaf.

Reminder: a presheaf is a sheaf if given sections

s_\alpha

on open sets

U_\alpha

covering

U

which agree when restricted to the overlaps

U_\alpha \cap U_\beta

, there exists a unique section

s

U

that restricts to each of the

s_\alpha

. If we have uniqueness but perhaps not existence, then our presheaf is called separated. As David showed a while back, the presheaf of bounded real-valued functions on a space is separated but usually not a sheaf.

John Baez (Apr 02 2024 at 23:11):

Btw, one reason this concept is important is that there's a trick called 'sheafification' that turns a presheaf into a sheaf. One way to do it involves doing a certain maneuver twice. The first pass turns the presheaf into a separated presheaf, and then second pass turns it into a sheaf! It's kind of amazing.

John Baez (Apr 02 2024 at 23:21):

It's probably too technical to get into now, but in case anyone cares, this maneuver is called the "plus construction", and you can read about it on the nLab.

David Tanzer (Apr 03 2024 at 07:34):

Cool, thanks for the reminder about the separated aspect. All these examples put a good spotlight on the glueing condition. Now that we've solidly established the definition of a sheaf, which feels rather substantive, I will somewhat naively now ask: what are a couple of cool things that we can do with sheaves, in at least a semi-applied sense? I'm sure there are many; just fishing around here for some favorites.

David Tanzer (Apr 03 2024 at 07:55):

p.s. I know that we're headed towards the topos side of town; in this question I'm fishing around for some good immediate / semi-concrete applications. For example, they're somehow going to give us insight into the structure of manifolds? Or stuff in computer science, ...

David Tanzer (Apr 03 2024 at 07:59):

(If this would go beyond a few high level points, it could be spun off into a separate topic)

Peva Blanchard (Apr 03 2024 at 08:40):

David Egolf (Apr 03 2024 at 16:19):

David Egolf (Apr 03 2024 at 16:22):

Here,

f_\ast F

is defined as:

(f_\ast F)(V) = F(f^{-1} V)

for each open subset

V

of a topological space

Y

. Note that

f: X \to Y

is a continuous function and

F:{\mathcal{O}(X)}^{\mathrm{op}} \to \mathsf{Set}

is a presheaf on

X

We also have

f^{-1} V = \{x \in X :\; f(x) \in V \} \subseteq X

. Note that

f^{-1} V

is open because

V

is open and

f

and continuous.

Roughly, our goal here is to make a presheaf on

Y

given a continuous function

f:X \to Y

and a presheaf

F:{\mathcal{O}(X)}^{\mathrm{op}} \to \mathsf{Set}

X

David Egolf (Apr 03 2024 at 16:23):

I wonder if a continuous function

f: X \to Y

induces a functor

f': {\mathcal{O}(Y)}^{\mathrm{op}} \to {\mathcal{O}(X)}^{\mathrm{op}}

. If it does, then we could form

f_\ast F

F \circ f': {\mathcal{O}(Y)}^{\mathrm{op}} \to \mathsf{Set}

David Egolf (Apr 03 2024 at 16:29):

Let's see. If

f: X \to Y

is a continuous function, let's try to define a functor

f': {\mathcal{O}(Y)}^{\mathrm{op}} \to {\mathcal{O}(X)}^{\mathrm{op}}

as follows:

David Egolf (Apr 03 2024 at 16:32):

Our proposed functor

f': {\mathcal{O}(Y)}^{\mathrm{op}} \to {\mathcal{O}(X)}^{\mathrm{op}}

automatically respects composition, because all diagrams commute in a poset. And if

1_V

is the identity morphism for

V \in {\mathcal{O}(Y)}^{\mathrm{op}}

, then this gets mapped to the identity morphism on

f^{-1}(V)

, as desired.

It seems that a continuous function

f: X \to Y

does in fact induce a functor

f': {\mathcal{O}(Y)}^{\mathrm{op}} \to {\mathcal{O}(X)}^{\mathrm{op}}

David Egolf (Apr 03 2024 at 16:35):

If that is true, then I think that

f_* F

is just

F \circ f':{\mathcal{O}(Y)}^{\mathrm{op}} \to \mathsf{Set}

. For an open set

V \in {\mathcal{O}(Y)}^{\mathrm{op}}

, it spits out

F(f'(V)) = F(f^{-1}(V)

, which is what

f_*F

is supposed to do. And it is indeed a functor, because composing two functors yields a functor.

John Baez (Apr 03 2024 at 16:42):

My own favorite applications of sheaves are the ones that made people invent sheaves in the first place - applications to algebraic geometry and toplogy. I don't know how deeply we want to get into those here. But it's not surprising that some of the most exciting applications of a concept are the ones that made people take the trouble to develop it in the first place!

Briefly, since a bounded analytic function must be constant, there are no everywhere defined analytic functions on the Riemann sphere except constants - all the interesting ones have poles. This issue affects all of complex analysis and algebraic geometry. This puts pressure on us to either accept 'partially defined' functions as full-fledged mathematical objects or work with sheaves of functions, e.g. work with lots of different open sets

U

in the Riemann sphere and let

F(U)

be the set of analytic functions everywhere defined on

U

Mathematicians took the second course, because partially defined functions where you haven't specified the domain of definition are a pain to work with. So nowadays all of algebraic geometry (subsuming chunks of complex analysis, and much much more) is founded on sheaves. In this subject one can do a lot of amazing things with sheaves. Later on these tricks expanded to algebraic topology. And this is how a typical math grad student (like me) is likely to encounter sheaves.

Needless to say, I'm happy to get into more detail about what we actually do with sheaves. But it's quite extensive: the proof of Fermat's Last Theorem and pretty much all the other big results in algebraic geometry relies heavily on sheaves.

John Baez (Apr 03 2024 at 16:47):

This is near the start of a series of over a hundred videos that works through the proof of Fermat's Last Theorem step by step.

John Baez (Apr 03 2024 at 16:53):

But this list of prerequisites is very intimidating. Sheaves have a lot of exciting applications in pure math that are infinitely easier to explain.

David Tanzer (Apr 03 2024 at 19:14):

David Egolf (Apr 04 2024 at 16:51):

In the previous puzzle, we showed that the "direct image" of a presheaf

F

X

is a presheaf

f_\ast F

Y

. As a first step in showing that this gives us a functor, we still need to figure out how our direct image functor

D: \widehat{\mathcal{O}(X)} \to \widehat{\mathcal{O}(Y)}

acts on morphisms between presheaves (which are natural transformations).

For my easy reference, I'll note that

\widehat{\mathcal{O}(X)} = [{\mathcal{O}(X)}^{\mathrm{op}}, \mathsf{Set}]

and

\widehat{\mathcal{O}(Y)} = [{\mathcal{O}(Y)}^{\mathrm{op}}, \mathsf{Set}]

David Egolf (Apr 04 2024 at 16:56):

This one is going to take me some thought. I don't have any intuition for natural transformations between presheafs yet. I think what I'll do to start with, is to draw a naturality square describing part of a natural transformation between two presheafs. Hopefully that will help me find some intuition!

David Egolf (Apr 04 2024 at 17:10):

U' \subseteq U

, we have a (unique) morphism from

U

U'

{\mathcal{O}(X)}^{\mathrm{op}}

. Let

F,G: {\mathcal{O}(X)}^{\mathrm{op}} \to \mathsf{Set}

be presheafs on

X

. Then, to have a natural transformation

\alpha: F \to G

, we need this square to commute for all pairs

(U, U')

where

U

and

U'

are open sets of

X

such that

U' \subseteq U

David Egolf (Apr 04 2024 at 17:16):

Intuitively, for any

x \in FU

, the natural transformation component

\alpha_U: FU \to GU

tells us how to view that

F

-data on

U

as some

G

-data on

U

. Further, this process needs to respect restriction.

So we can expect there to be a natural transformation, for example, from the presheaf of bounded and continuous functions on

X

to the presheaf of continuous functions on

X

. In this case, each

\alpha_U

is an inclusion function.

But I wouldn't expect there to be a natural transformation from the presheaf of continuous functions on

X

to the preseheaf of continuosu and bounded function on

X

. That's because I can't think of a nice way of converting any continuous functions to a corresponding continuous and bounded function.

Peva Blanchard (Apr 04 2024 at 17:18):

I am not so sure about the latter. If

GU

is the singleton set consisting of the zero function on

U

, then we can define

\alpha_{U}

as the unique map that sends every element of

FU

to zero.

John Baez (Apr 04 2024 at 17:21):

Puzzle. Is there a natural transformation from the presheaf of continuous functions to the presheaf of continuous and bounded functions that sends some functions to non-constant functions?

Peva Blanchard (Apr 04 2024 at 17:29):

John Baez (Apr 04 2024 at 17:38):

I don't think that's "almost" the answer. I think it's exactly the answer! If there are any non-constant continuous functions on your space, your natural transformation will convert all continuous functions to bounded continuous functions, and send some to non-constant continuous functions.

David Egolf (Apr 04 2024 at 17:50):

I think this doesn't work though. That's because a restriction of an unbounded function might be bounded. If that happens, then the naturality square doesn't commute if one tries to follow the approach I described above.

David Egolf (Apr 04 2024 at 17:54):

Peaking at @Peva Blanchard 's answer... Huh, I did not expect arctan to show up! I guess its virtue is that it takes in any input, and squishes it down to a fixed finite range. Further, it does this without sending two inputs to the same output. And you can "squish" a function down and then restrict it, or you can restrict it first and then squish it down, and you'll get the same answer. So our naturality square will commute!

John Baez (Apr 04 2024 at 17:54):

Anything defined using cases is going to have trouble being natural. It sometimes works - but when I try to do something natural, I avoid methods that involve different cases, because the spirit of naturality is to do something that works uniformly for all cases.

John Baez (Apr 04 2024 at 17:56):

What Peva did is postcompose with a bounded continuous function; this turns any continuous function into a bounded continuous function. People who take real analysis use arctan as their go-to guy for this purpose, because this is also 1-1, so postcomposing with it doesn't lose any information, but they could equally well use tanh or lots of other things.

John Baez (Apr 04 2024 at 17:56):

David Egolf (Apr 04 2024 at 17:58):

That's actually a really cool point! If

f: X \to Y

is any continuous function, and

g: Y \to \mathbb{R}

is continuous and bounded, then

g \circ f

is also continuous and bounded! And if this post-composition doesn't lose information (which I think corresponds to

g

being a monomorphism), then we've managed to produce a continuous bounded function that "still remembers" the original unbounded function that it came from!

Peva Blanchard (Apr 04 2024 at 17:59):

This way of "applying the same procedure pointwise". I think it relates to the way one checks the sheaf condition.

Peva Blanchard (Apr 04 2024 at 18:02):

John Baez (Apr 04 2024 at 18:03):

To get a natural transformation between presheaves what you do to sections needs to be "local": you can restrict a section to a smaller open set and do the operation, or do the operation and then restrict, and these need to agree.

But notice that there are other local operations: for example differentiation gives a map from the presheaf of smooth real-valued functions on the real line to itself.

John Baez (Apr 04 2024 at 18:04):

(I'm saying "presheaf" a lot here. Each time I could have said "sheaf", but I don't think I'm using the sheaf condition in what I'm saying.)

John Baez (Apr 04 2024 at 18:05):

(I should add that a map between sheaves is just defined to be a map between presheaves that happen to be sheaves.)

Peva Blanchard (Apr 04 2024 at 18:09):

I have the feeling that, in the case of sheaves, a natural transformation is uniquely determined by what it does "pointwise" (more precisely, on the germs).

Peva Blanchard (Apr 04 2024 at 18:14):

Something like: the set of natural transformations from an

A

-valued sheaf

F

to a

B

-valued sheaf

G

is equivalently described by a sheaf with values in

Set(A, B)

(the functions from

A

B

John Baez (Apr 04 2024 at 18:14):

Analysts would never say differentiation is done "pointwise" - so yes, I think the correct word should be something like "germwise".

John Baez (Apr 04 2024 at 18:15):

This could become a theorem once we (= David) officially study germs; then we could show a map between sheaves is determined by what it does to germs.

David Egolf (Apr 05 2024 at 17:13):

Building on the discussion above, I think I can now start to work out how we can get a natural transformation between two direct image functors

f_*F: \mathcal{O}(Y)^{\mathrm{op}} \to \mathsf{Set}

and

f_*G: \mathcal{O}(Y)^{\mathrm{op}} \to \mathsf{Set}

. If we have a natural trasnformation

\alpha: F \to G

, then for each open subset

V

Y

, we need to figure out how to compute

f_*G

-data on

V

from

f_*F

-data on

V

. This will serve as the

V

-th component of a natural transformation from

f_*F

f_*G

So, let's assume we have the two sets

f_*F(V)

and

f_*G(V)

. We're looking for a function

\beta_V: f_*F(V) \to f_*G(V)

. Let

x \in f_*F(V)

. Our goal is to figure out

\beta_V(x)

Since

x \in f_*F(V)

x \in F(f^{-1}(V))

. From this, we need to get some element

\beta_V(x) \in f_*G(V) = G(f^{-1}(V))

. To do this, we can use our natural transformation

\alpha: F \to G

. Since

\alpha_{f^{-1}V}: F(f^{-1}(V)) \to G(f^{-1}(V))

, we can just provide

x

to this function and get out

\beta_V(x)

We've arrived at the following idea: Let our direct image functor

D:\widehat{\mathcal{O}(X)} \to \widehat{\mathcal{O}(Y)}

send a natural transformation

\alpha: F \to G

to the natural transformation

D(\alpha):f_*F \to f_*G

having

V

-th component

D(\alpha)_V = \alpha_{f^{-1}(V)}

David Egolf (Apr 05 2024 at 17:29):

Next, let's check that

D(\alpha): f_*F \to f_*G

really is a natural transformation. Evaluating these functors at some morphsim

:U \to V

{\mathcal{O}(Y)}^{\mathrm{op}}

, we get this square:

And this square diagram commutes, because

\alpha

is a natural transformation. We conclude that any naturality square for

D(\alpha)

commutes, and hence

D(\alpha)

is a natural transformation. So,

D

is sending natural transformations to natural transformations, as it should.

David Egolf (Apr 05 2024 at 17:51):

It only remains to show that

D:\widehat{\mathcal{O}(X)} \to \widehat{\mathcal{O}(Y)}

is a functor!

First, we need to check that

D(1_F) = 1_{F(D)}

for any identity morphism

1_F: F \to F

\widehat{\mathcal{O}(X)}

. By our definition of

D

, we have

D(1_F)_V = (1_F)_{f^{-1}(V)}

. Since each component of

1_F

is an identity function,

(1_F)_{f^{-1}(V)}

is the identity function from

f^{-1}(V)

f^{-1}(V)

. So, we see that

D(1_F)

is the identity natural transformation from

f_*F

to itself. (Indeed, the identity natural transformation from

f_*F

to itself has as its

V

-th component the identity function on

F(f^{-1}V)

David Egolf (Apr 05 2024 at 18:01):

Lastly, we need to check that

D

respects composition. That is, we need to show that

D(\alpha \circ \beta) = D(\alpha) \circ D(\beta)

for two composable morphisms

\alpha, \beta

. To show that two natural transformations are equal, it suffices to show that each of their components are equal. So, we wish to show that

D(\alpha \circ \beta)_V = D(\alpha)_V \circ D(\alpha)_V

, for any

V \in \mathcal{O}(Y)^{\mathrm{op}}

By definition of

D

, we have

D(\alpha \circ \beta)_V = (\alpha \circ \beta)_{f^{-1}V}

. We also have

D(\alpha)_V = \alpha_{f^{-1}V}

and

D(\beta)_V = \beta_{f^{-1}V}

. So,

D(\alpha_V) \circ D(\beta_V) = \alpha_{f^{-1}V} \circ \beta_{f^{-1}V}

. By definition of vertical composition of natural transformations, we have that

\alpha_{f^{-1}V} \circ \beta_{f^{-1}V} = (\alpha \circ \beta)_{f^{-1}V}

We conclude that

D

respects composition! And now we can conclude that taking direct images using a continuous function

f: X \to Y

yields a functor

D:\widehat{\mathcal{O}(X)} \to \widehat{\mathcal{O}(Y)}

David Egolf (Apr 05 2024 at 18:04):

Whew, that felt like a lot. I suppose this sort of thing gets quicker with practice! But I wonder if there is a faster (more abstract?) way to work this out, as well.

David Tanzer (Apr 05 2024 at 18:47):

It would be cool if there were a higher level / more systematic way of proving such things. A proof assistant? I haven't used them. But that wouldn't seem to help with the basic understanding. It's hard to see a way around needed to unpack definitions and verify them in detail. I appreciate the clarity, detail and completeness of your posts here!

Peva Blanchard (Apr 05 2024 at 21:30):

I think there is a higher level way of doing, but, at some point, we still need to work out the details.

We have a continuous function

f : X \rightarrow Y

. This function

f

is equivalently described as a functor

f^{\star} :O(Y) \rightarrow O(X)

, viewing the poset of open sets as a category.

of the functor "taking the opposite category", and the functor "hom-ing into Set".

It remains to show that

H f^{\star}

is indeed the direct image functor. (I'll skip that part for now)

Peva Blanchard (Apr 05 2024 at 21:39):

The tedious details are still present: I haven't proved that the functor "hom-ing into

Set

" is well-defined and a functor. This is, I think, proven exactly as @David Egolf did.

John Baez (Apr 05 2024 at 22:36):

For what it's worth, it doesn't feel like a lot to me. I think if this were part of a book it would be less than a page. Some of the work is coming up with the ideas: that's the fun part. But a lot of the work in writing these arguments is just formatting things in LaTeX. I'm very glad you're doing it, because you're helping other people. But it's less work on paper.

Once you do this kind of argument for a few years, the standard moves become so ingrained that they're almost automatic... except when they're not, meaning that some brand new move is required.

John Baez (Apr 05 2024 at 22:43):

I'm the opposite: what I really want is some software that will go out to dinner and talk to my friends so I can stay home and prove theorems.

John Baez (Apr 05 2024 at 23:10):

As for "systematic", I think @David Egolf's approach to this question was perfectly systematic. To prove P implies Q, he expanded out P using definitions to get a short list of things to check, and then checked each of these using Q, which he expanded out just enough to get this done.

John Baez (Apr 05 2024 at 23:14):

Category theory is full of proofs like this; many mathematicians look down on it because it's not tricky enough, but to me that's a virtue. The main hard part is keeping track of nested layers of structure imvolved... and a main reason for doing lots of proofs like this is to get good at keeping a lot of structures in mind.

John Baez (Apr 05 2024 at 23:27):

The number theorist Serge Lang has an exercise in his book Algebra that goes like this:

John Baez (Apr 05 2024 at 23:30):

He is of course joking to some extent, and definitely showing off. But the hard part in homological algebra - or other kinds of category theory - is developing an intuition for the structures involved so you can guess what's true. The proofs of theorems are often easy in comparison.

David Egolf (Apr 07 2024 at 15:41):

We have now arrived to the final puzzle in the first blog post! For context, recall that

f:X \to Y

is a continuous function, that

F

is a presheaf on

X

, and

f_*F

is the corresponding direct image presheaf on

Y

. Here's the puzzle:

We saw above that

f_*F

is a presheaf. So, we only need to check that we can "glue together" things appropriately: if we start with a bunch of

s_i \in f_*F(V_i)

that agree on overlaps, so that

{s_i}|_{V_i \cap V_j}={s_j}|_{V_i \cap V_j}

for all

i

and

j

, then there is always a unique

s \in f_*F(V)

that restricts to

s_i

V_i

for each

i

. Here, each

V_i

is an open subset of

Y

and

V = \cup_i V_i

David Egolf (Apr 07 2024 at 16:32):

Let's start out with a bunch of

s_i \in f_*F(V_i)

, which agree on overlaps and where

\cup_i V_i = V

. We want to show there is a unique

s \in f_*F(V) = F(f^{-1}V)

that restricts to each

s_i

V_i

Note that

s_i \in F(f^{-1}V_i)

for each

i

, by definition of

f_*F

. I want to use the fact that

F

is a sheaf to glue together these

s_i

to get some

s \in F(f^{-1}V) = f_*F(V)

David Egolf (Apr 07 2024 at 16:43):

First, let's show that

\cup_i f^{-1}V_i = f^{-1}V

, making use of the fact that

\cup_i V_i = V

Let

x \in f^{-1}V

. That means that there is some

y \in V

so that

f(x) = y

. Since

V = \cup_i V_i

, that means that

y \in V_i

for some

i

. Thus,

x \in f^{-1}V_i

for some

i

. Hence,

x \in \cup_i f^{-1}V_i

. We conclude that

f^{-1}V \subseteq \cup_i f^{-1}V_i

Next, let

x \in \cup_i f^{-1} V_i

. That means that there is some

i

so that

x \in f^{-1}V_i

. Thus, there is some

y \in V_i

so that

f(x)=V_i

. Since

\cup_i V_i = V

, we know that

V_i \subseteq V

. Hence

f(x) \in V

and thus

x \in f^{-1} V

. Therefore,

\cup_i f^{-1} V_i \subseteq f^{-1} V

David Egolf (Apr 07 2024 at 17:34):

The next order of business is to talk about "agreeing on overlaps". With respect to

f_*F

, we know that

{s_i}|_{V_i \cap V_j} = {s_j}|_{V_i \cap V_j}

for any

i

and

j

. A particular element

s_i

f_*F(V_i)

is restricted to

V_i \cap V_j

as follows: note that

s_i

is an element of

F(f^{-1}V_i)

and then restrict it (using the fact that

F

is a presheaf, and so provides a notion of restriction) using

F

to an element of

F(f^{-1}(V_i \cap V_j)) = f_*F(V_i \cap V_j)

So, if

{s_i}|_{V_i \cap V_j} = {s_j}|_{V_i \cap V_j}

with respect to

f_*F

, then this means that restricting

s_i

(using

F

) from an element of

F(f^{-1}V_i)

to an element of

F(f^{-1}(V_i \cap V_j))

yields the same result as restricting

s_j

(using

F

) from an element of

F(f^{-1}V_j)

to an element of

F(f^{-1}(V_i \cap V_j))

Now, we'd like to show that if

s_i \in f_*F(V_i)

and

s_j \in f_*F(V_j)

agree on overlaps with respect to

f_*F

, then they agree on overlaps with respect to

F

, where we view

s_i

as an element of

F(f^{-1}V_i)

and

s_j

as an element of

F(f^{-1}V_j)

. To show they agree on overlaps with respect to

F

, we need to show that restricting

s_i

from an element of

F(f^{-1}V_i)

to an element of

F(f^{-1}V_i \cap f^{-1}V_j)

yields the same result as restricting

s_j

from an element of

F(f^{-1}V_j)

to an element of

F(f^{-1}V_i \cap f^{-1}V_j)

By the above discussion, this follows provided that

F(f^{-1}V_i \cap f^{-1}V_i) = F(f^{-1}(V_i \cap V_j))

David Egolf (Apr 07 2024 at 17:37):

We now show that

F(f^{-1}V_i \cap f^{-1}V_i) = F(f^{-1}(V_i \cap V_j))

. It suffices to show that

f^{-1}V_i \cap f^{-1}V_i = f^{-1}(V_i \cap V_j)

Let

x \in f^{-1}(V_i \cap V_j)

. That means there is some

y \in V_i \cap V_j

so that

f(x) = y

. Since

y \in V_i \cap V_j

, we know that

y \in V_i

and

y \in V_j

. Hence,

x \in f^{-1}V_i

and

x \in f^{-1}V_j

. Thus,

x \in f^{-1}V_i \cap f^{-1}V_j

, and so

f^{-1}(V_i \cap V_j) \subseteq f^{-1}V_i \cap f^{-1}V_j

Next, let

x \in f^{-1}V_i \cap f^{-1}V_j

. That means that

f(x) \in V_i

and

f(x) \in V_j

. Hence

f(x) \in V_i \cap V_j

and therefore

x \in f^{-1}(V_i \cap V_j)

. Therefore,

f^{-1}V_i \cap f^{-1}V_j \subseteq f^{-1}(V_i \cap V_j)

We conclude that

f^{-1}(V_i \cap V_j) = f^{-1}V_i \cap f^{-1}V_j

, and so

F(f^{-1}V_i \cap f^{-1}V_i) = F(f^{-1}(V_i \cap V_j))

David Egolf (Apr 07 2024 at 17:40):

From the above discussion, if

\cup_i V_i = V

(where each

V_i

is an open subset of

Y

) and we have a bunch of

s_i \in f_*F(V_i)

(as

i

varies) that agree on overlaps with respect to

f_*F

, then we have:

Since

F

is a sheaf, there is then a unique

s \in F(f^{-1}V) = f_*F(V)

that restricts (using

F

) to

s_i

on each

f^{-1}V_i

. We hope that this

s

restricts to each

s_i

V_i

with respect to

f_*F

. This is true, because restricting

s

to an element of

f_*F(V_i)

with respect to

f_*F

is done by restricting

s

to an element of

F(f^{-1}V_i)

with respect to

F

, and we know this yields

s_i

(by definition of

s

David Egolf (Apr 07 2024 at 17:46):

So, I think we have managed to show there is at least one "gluing together" of our

s_i \in f_*F(V_i)

to get a

s \in f_*F(V)

that restricts to

s_i

on each

V_i

. It remains to show that there is only one way to do this, so that

s

is the unique "gluing" of our

s_i

David Egolf (Apr 07 2024 at 17:54):

Let's imagine we've got some

s' \in f_*F(V)

that restricts (with respect to

f_*F

) to

s_i

V_i

, for each

i

. That means it restricts from

s' \in F(f^{-1}V)

s_i \in F(f^{-1}V_i)

(with respect to

F

) for each

i

. That is,

s'

is a valid "gluing" of all the

s_i \in F(f^{-1}V_i)

. Since

F

is a sheaf, there is only one such gluing, namely

s

. Therefore,

s'=s

Consequently, there is exactly one way to "glue together" our

s_i \in f_*F(V_i)

to get a

s \in f_*F(V)

. We conclude that

f_*F

is indeed a sheaf!

John Baez (Apr 07 2024 at 18:11):

John Baez (Apr 07 2024 at 18:13):

The only change I might make is to pull out a few facts as "lemmas", since they don't really involve presheaves per se: they are properties of the inverse image

f^{-1}(V)

of a subset

V \subset Y

along a function

f: X \to Y

Of course the inverse image of an open subset along a continuous map is open, but these properties are even more fundamental: they work for any subset and any map.

John Baez (Apr 07 2024 at 18:16):

Just for good measure, let

{}^c

denote the operation of taking the complement of a subset.

John Baez (Apr 07 2024 at 18:18):

I think you used Lemma 1 only in the case of the intersection of two subsets. You used the general case of Lemma 2. And you didn't need Lemma 3 at all.

John Baez (Apr 07 2024 at 18:21):

Subsets of a set form a [[complete boolean algebra]] - this is jargon is a way of capturing all the rules that govern intersections, unions and complements in classical logic. Lemmas 1-3 say that

f^{-1}

is a morphism of complete boolean algebras!

John Baez (Apr 07 2024 at 18:26):

If someone out there has never thought about this, it's worth comparing the 'image' operation. The image of a subset

V \subseteq X

under a function

f: X \to Y

is defined by

The image operation does not obey analogues of all three of Lemmas 1-3. I.e. it doesn't preserve all unions, intersections and complements.

John Baez (Apr 07 2024 at 18:26):

Moral: inverse image is 'better' than image. This is one reason it's nice that we use inverse image, not image, to define continuity.

John Baez (Apr 07 2024 at 18:27):

All these simple thoughts will get refined more and more as one digs deeper into topos theory.

David Egolf (Apr 07 2024 at 18:41):

That is very cool! I'll plan to think a bit more about this, as well as the puzzle you gave relating to the image operation.

John Baez (Apr 07 2024 at 19:27):

Great! I wish someone had told me - way back when I was a youth - that 'inverse image' is better behaved than 'image', and also had explained why. Back then inverse image seems like a more sneaky concept than image, in part because of its name. So it seemed a bit weird that it was used in the definition of continuity. Of course from another viewpoint this makes perfect sense: this gives a definition of continuity of maps between metric spaces that matches the

\epsilon-\delta

definition! But it's much more satisfying to understand the fundamental role of inverse images.

David Egolf (Apr 08 2024 at 18:46):

I've reviewed some things relating to Boolean algebras, and I think this is making sense!

One interesting point jumped out to me. When I think about morphisms "preserving the structure", I usually think of equations like this one:

g(x \cup y) = g(x) \cup g(y)

. That is, the binary operation

\cup

only gets applied once on each side of the equation. This is in contrast to something like

f^{-1}(\cup_i V_i) = \cup_i f^{-1}(V_i)

, where potentially we are taking the union of an infinite number of sets on each side of the equation.

I assume that the idea is to "preserve equations". If we know that we are mapping between two complete Boolean algebras, then arbitrary (small) meets and joins always exist in both the source and target Boolean algebras. So then for any collection of

V_i

in some complete Boolean algebra,

\lor_i V_i

always exists - call it

V

. Consequently, we can always write this kind of equation

\lor_i V_i = V

for any collection of elements

V_i

. Asking this equation to be preserved under

g

would mean we'd want

\lor_i g(V_i) = g(V) = g(\lor_i V_i)

I guess the moral of the story is this: if we have fancier equations that hold in all the structures of interest, we'll get fancier corresponding requirements for a structure-preserving map.

David Egolf (Apr 08 2024 at 18:52):

For

f^{-1}

to be a morphism of complete Boolean algebras, there's another condition we'll want it to meet. It's simple, but I thought it might be nice to note explicitly. In particular, for

f: X \to Y

and

f^{-1}: \mathcal{P}(Y) \to \mathcal{P}(X)

, we'll want

U \subseteq V

\mathcal{P}(Y)

to imply that

f^{-1}(U) \subseteq f^{-1}(V)

\mathcal{P}(X)

To see that this holds, let

x \in f^{-1}(U)

. That means that

f(x) \in U

. Since

U \subseteq V

, that means

f(x) \in V

and hence

x \in f^{-1}(V)

. Therefore,

f^{-1}(U) \subseteq f^{-1}(V)

, as desired.

Peva Blanchard (Apr 08 2024 at 19:03):

I think the idea of "preserving equations" is right. It can also be rephrased as "preserving limits/colimits". When seeing a complete boolean algebra as a category, then the join (resp. meet) of an arbitrary collection of elements is literally the colimit (resp. limit) of this collection.

David Egolf (Apr 08 2024 at 19:25):

I now want to think about the image operator

I: \mathcal{P}(X) \to \mathcal{P}(Y)

for a function

f: X \to Y

. We have

I(U) = \{f(x) | x \in U\}

for any

U \in \mathcal{P}(X)

. Let's see in which ways

I

fails to be a morphism between the complete Boolean algebras

\mathcal{P}(X)

and

\mathcal{P}(Y)

First, notice that

I(X)

is not necessarily

Y

. So the biggest ("top") element is not always mapped to the top element! (This is because

f

is not always surjective). However,

I

does map the empty set to the empty set, and it preserves arbitrary unions. Also, if

U \subseteq U'

, then

I(U) \subseteq I(U')

I

doesn't preserve intersections in general. For example, if

U

and

U'

are disjoint non-empty subsets of

X

, but

I(U) = I(U')

, then

I(U \cap U') = I(\emptyset) = \emptyset

but

I(U) \cap I(U') = I(U)

is not empty. This sort of thing can happen when

f

isn't injective.

I

also doesn't preserve complements in general. That is, if

U^c = V

, then we don't necessarily have that

I(U)^c = I(V) = I(U^c)

. For example, let

U = X

. Then

I(X^c) = I(\emptyset) = \emptyset

. But

I(X)

isn't necessarily all of

Y

(as

f

isn't necessarily surjective). Therefore,

I(X)^c

is not always empty.

John Baez (Apr 08 2024 at 21:23):

Great, now you see exactly how inverse image is better than image! Inverse image sends maps between sets to maps of complete boolean algebras, and indeed gives a functor from

\mathrm{Set}

to the opposite of the category of complete boolean algebras. I like to call this "the duality between set theory and logic".

You meant it may not be empty... as you made clear in your very next sentence.

John Baez (Apr 08 2024 at 21:26):

If you can stand one more puzzle about this: do you see the way in which inverse image is 'logically simpler' and 'easier to compute' than image?

David Egolf (Apr 08 2024 at 21:31):

Since I assumed that

U

and

U'

are non-empty, and also that

I(U) = I(U')

, I think

I(U) \cap I(U')

actually isn't empty in this case. But one wouldn't need to assume

I(U) = I(U')

! More generally, I think the idea is that even if

U

and

U'

are disjoint,

I(U)

and

I(U')

can have some elements in common.

David Egolf (Apr 08 2024 at 21:32):

Hmmm, that's interesting. Nothing immediately comes to mind, but I'll give it some thought!

David Egolf (Apr 08 2024 at 21:38):

My initial thought is that an inverse image seems harder to compute then an image!

It seems like it requires at least as many evaluations of

f

to compute an inverse image, as compared to an image.

John Baez (Apr 08 2024 at 21:39):

John Baez (Apr 08 2024 at 21:46):

I was thinking: given

f: X \to Y

and a subset of its domain, what do we have to do to check whether a given element of

Y

is in the image of that subset?

Given

f

and a subset of its codomain, what do we have to do to check whether a given element of

X

is in the inverse image of that subset?

John Baez (Apr 08 2024 at 21:48):

I believe this explains why inverse image is 'better': always a boolean algebra homomorphism.

David Egolf (Apr 08 2024 at 21:52):

I have to take a look at what you just said! Here's the idea that came to mind for me, though:

I think we can use the fact that the inverse image interacts nicely with unions, intersections, and complements. For example, let's say I want to compute the inverse image of

V = \cap_i V_i

, and I know the inverse image of each

V_i

. Then I can just compute the intersection of the inverse images of the

V_i

By contrast, if I know the image of a bunch of

U_i

, I can't directly compute the image of

U = \cap_i U_i

in an analogous way. That's because

I(\cap_i U_i)

is not necessarily equal to

\cap_i I(U_i)

So, the inverse image operation is computationally nicer than the image operation in this sense: you can compute more things directly, when given some known prior things.

David Egolf (Apr 08 2024 at 21:58):

Assume we have

f: X \to Y

and a subset

U

X

. To check if

y \in Y

is in the image of

U

, we need to check each element of

U

and see if we ever get

y

Let's now assume we have a subset

V

Y

. To check if

x \in X

is in the inverse image of

V

, we just need to compute

f(x)

and see if it lands in

V

So, we'll need fewer evaluations of

f

to see if a particular element is in a inverse image, as compared to what we need to check if an element is in an image!

David Egolf (Apr 08 2024 at 21:59):

David Egolf (Apr 08 2024 at 22:31):

Peva Blanchard (Apr 09 2024 at 07:49):

In complexity theory, enumerating elements of a subset and checking that an element is a member of a subset are somehow related, but not really equivalent (this depends on how "easy to compute" is defined).

a. For instance, regarding your second bullet point, it seems that you have the following picture in mind (I may be wrong):

b. Now the related way is to ask for procedures that decide on whether an element is a member of a subset. In that case, your first bullet point can be restated as:

I think cases a and b are "as easy as each other". They seem "dual", although I don't know if we can make this statement precise.

Peva Blanchard (Apr 09 2024 at 07:50):

Things become complicated if we try to apply one case starting from the other. I mean:

Peva Blanchard (Apr 09 2024 at 08:53):

Assuming that one can also check the equality "y = f(x)", and one can enumerate elements of the domain of

f

, here are two (informal) ways:

Peva Blanchard (Apr 09 2024 at 09:33):

Oh! I think there is a more "categorical" rewording of all of the above. Given any category

C

and morphism

f: X \rightarrow Y

C

, we get two functors:

Peva Blanchard (Apr 09 2024 at 10:08):

(this is the kind of things that makes my head spin: over/under, post/pre, left/right)

Peva Blanchard (Apr 09 2024 at 12:11):

(edit: deleted the latest message. I thought I could reformulate the above in terms of adjunctions between pre/post-composition and left/right Kan extension/lift, but it was incorrect.)

David Egolf (Apr 09 2024 at 16:14):

Thanks for the link on "dovetailing". That's a neat concept I'd not heard of before!

I'm not quite understanding why you want to enumerate the elements

(y,x)

V \times dom(f)

, though. Wouldn't it be less work to just loop over all the elements in

dom(f)

(which I've been assuming is all of

X

) and then apply

f

to those, and see if we ever get an element of

V

David Egolf (Apr 09 2024 at 16:18):

I suppose these two loops effectively involve considering each element of

V \times dom(f)

, unless we add a "break" statement in the second loop. The break statement could let us quit out of the second loop early if we find some

v \in V

that

y=f(x)

is equal to.

That explains, I think, why you wanted to enumerate all elements of

V \times dom(f)

. I think we basically had the same idea; I was just struggling to see that the way you formalized the idea matched (at least roughly) what I had in mind!

Peva Blanchard (Apr 09 2024 at 16:26):

I think it is the same idea. Dovetailing is relevant when the sets are infinite. For instance, in your pseudo-code example (the two loops), the inner loop can be infinite, so you never enumerate all of

V \times dom(f)

. But, besides that technical point, you really expressed the same idea.

David Egolf (Apr 09 2024 at 16:41):

Peva Blanchard (Apr 09 2024 at 16:44):

Now, it seems to me that, in maths, unless you work in computability theory, it is unusual to consider structures that can be enumerated. It makes no sense, a priori, for most mathematical structures, e.g., the real numbers. The other way, i.e., testing an element is more common, and, by varying what is meant by "testing", easier to generalize.

This is why I think, the inverse image

f^{-1}(V)

seems "logically easier" to conceive than the image

f(U)

. More precisely, in

Set

, the mother of all tests is the membership test

y \in V

, i.e., a characteristic function

\chi_V : Y \rightarrow \{0,1\}

. You can transport these characteristic functions along

f

just by pre-composing with

f

Peva Blanchard (Apr 09 2024 at 16:45):

David Egolf (Apr 09 2024 at 16:47):

There's no rush! What you are saying is interesting, and I look forward to thinking about it! If there's more you want to say on that topic, please feel free to keep posting about it here. I can always shuffle the "part two" messages to the bottom afterwards.

David Egolf (Apr 09 2024 at 17:00):

In this case the blue area is

Y

and the black line is

X

. Our continuous map

p:Y \to X

projects each point down to the corresponding point on the black line.

Peva Blanchard (Apr 09 2024 at 17:04):

There is another bundle that may be familiar to you. Say we encode RGB pixel values with real values between

0

and

1

, i.e.,

C = [0,1]^3

is the space of colors. Let

S = [0,1]^2

be the unit square, thought of as a canvas. Then we have a (trivial) bundle

S \times C \rightarrow S

(just the projection on the first factor) that represent all the possible RGB-images on the unit square.

David Egolf (Apr 09 2024 at 17:10):

That's a neat example! I'm imagining all possible RGB values "floating over" each point of our square. A section of that bundle I think corresponds to an RGB image over (an open subset) of the unit square.

David Egolf (Apr 09 2024 at 17:12):

Here's a picture to visualize the concept of "section", using the bundle I drew above:
section

The orange line is a section of our bundle

p:Y \to X

over the yellow subset of

X

In this case, we might imagine that the black line is an observed noisy signal, and the blue "envelope" describes at each point the possible "actual" (without noise) signal values. A section then is a "point-wise plausible guess" for a portion of a denoised signal.

John Baez (Apr 09 2024 at 17:25):

At first you said the black line was a picture of X, but to me it looks like a picture of a section... they're both reasonable interpretations but people tend use the latter, and draw X as a horizontal line down below the bundle itself, so that p "projects down" from Y to X. Later you seem to have used the latter interpretation because you said "the black line is an observed noisy signal", which sounds like a section to me.

John Baez (Apr 09 2024 at 17:26):

John Baez (Apr 09 2024 at 17:27):

People like to use E instead of X and B instead of Y, and call E the total space and B the base space of the bundle.

Peva Blanchard (Apr 09 2024 at 17:49):

Morgan Rogers (he/him) (Apr 09 2024 at 19:02):

It might also be useful to have some more explicitly defined examples, like the exponential map from

\mathbb{C}

\mathbb{C} - \{0\}

;)

Peva Blanchard (Apr 09 2024 at 19:07):

oh yes, another one would be

z \mapsto z^2

on the complex numbers, or more generally

z \mapsto z^n

(Which makes me wonder, since

e^z = \sum_n \frac{z^n}{n!}

, if we can combine multiple bundles together)

Morgan Rogers (he/him) (Apr 09 2024 at 19:29):

There are ways to combine bundles over general spaces

X

but the kind you're thinking of (taking products and sums) relies on the algebraic structure of

\mathbb{C}

. See if you can figure out how you might use the addition and multiplication of the complex numbers to combine bundles! Power series are a bonus challenge ;)

John Baez (Apr 09 2024 at 20:18):

Yes, that picture shows a 'bundle' in the extremely general sense introduced here (a continuous map from a space

E

to a space

B

), but not a [[fiber bundle]]. For a fiber bundle we typically want the 'fibers'

p^{-1}(b)

to be homeomorphic to each other for all

b \in B

, while in the picture some fibers are homeomorphic to

[0,1]

and others to

[0,1] \cup [2,3]

Peva Blanchard (Apr 09 2024 at 21:43):

I see. Indeed, if we consider only bundles over

\mathbb{C}

(or any other ring actually), we can do something as follows. Let

p_1 : E_1 \rightarrow \mathbb{C}

and

p_2 : E_2 \rightarrow \mathbb{C}

, then we can define

p_1 + p_2 : E_1 \times E_2 \rightarrow \mathbb{C}

as the composite

E_1 \times E_2 \xrightarrow{(p_1, p_2)} \mathbb{C} \times \mathbb{C} \xrightarrow{+} \mathbb{C}

We can do the same for any other binary continuous operations like, e.g., multiplication.

Similarly, given a complex number

s

and a bundle

p : E \rightarrow \mathbb{C}

, I can define the bundle

s \cdot p : E \rightarrow \mathbb{C}

by pointwise multiplication

e \mapsto s \cdot p(e)

Now, let's use the notation

z : \mathbb{C} \rightarrow \mathbb{C}

for the bundle corresponding to the identity morphisms. Then we wave

z^n : \mathbb{C}^n \rightarrow \mathbb{C}

, and more generally for any polynomial

\begin{align*} P(z) : \prod_{0 \le k \le n} \mathbb{C}^k &\rightarrow \mathbb{C} \\ ((z_{ki})_{1 \le i \le k})_{0 \le k \le n} &\mapsto \sum_{0 \le k \le n} a_k \cdot \prod_{1 \le i \le k} z_{ki} \end{align*}

Peva Blanchard (Apr 09 2024 at 21:49):

(This is a weird entity. I would expect the total space to look more like a polynomial, but here it is just a big cartesian product)

Peva Blanchard (Apr 09 2024 at 21:58):

P

is a power series instead of a mere polynomial, the bundle

P(z)

is not well-defined. It is not clear at all if the series converges; plus it is a weird series as it involves infinitely many different variables.

Peva Blanchard (Apr 09 2024 at 22:07):

If I assume that the series has a positive radius of convergence, e.g. 1, it might help. We know then that for any complex number

s

with

|s| < 1

, the series

\sum_k a_k \cdot s^k

converges to a well-defined (complex) value, and the induced function is continuous.

I could restrict my setting to the situation where the

z_{ki}

all have modulus less than

1

. But yet, as we have infinitely many variables, I'm not sure that the series

\sum_k a_k \prod_{1 \le i \le k} z_{ki}

converges. Even less that it depends continuously on the

z_{ki}

's.

Peva Blanchard (Apr 09 2024 at 22:19):

Morgan Rogers (he/him) (Apr 10 2024 at 05:37):

Nice attempt! You got the main ideas I think, but some things to note: first, at the moment you have a lot of variables around; there is a way to reduce them by also precomposing with something. Second, there are indeed a bunch of subtleties to beware of for power series: for it to define a bundle, you not only need to restrict to a subset where the series converges, you need to make sure the function is continuous! It's hard to express those conditions categorically, which is why you'll rarely see analysis and category theory talking to each other. Or to turn that comment into an exercise: one way you might hope to express a power series is as a diagram of bundles over

\mathbb{C}

whose colimit would determine the power series. But this can't work because of the way we define morphisms of bundles; can you see why?

Peva Blanchard (Apr 10 2024 at 11:42):

Indeed, there is a brutal way to reduce the number of variables. It suffices to precompose what I did with the diagonal map

z \mapsto (z, z, z, \dots)

. This amounts to consider only the bundles of the form

\mathbb{C} \rightarrow \mathbb{C}

More precisely, given a polynomial

P

, this amounts to consider the function

z \mapsto P(z)

as a bundle

\mathbb{C} \rightarrow \mathbb{C}

When

P

is a (formal) power series, then the domain of

P

cannot be the whole complex plane. For instance,

So we must restrict the domain. Let's consider all the ways to restrict this power series. That is we consider all the bundles

U \rightarrow \mathbb{C}

with

z \mapsto P(z)

, where

U

is an open subset where the series converges and is continuous. My knowledge about complex analysis is a bit rusty, so I'm being a bit sketchy here...

These bundles are objects in the over category

Top/\mathbb{C}

. Let's consider the category

\mathcal{P}

with those bundles as objects, and as morphisms the ones induced by inclusion of subsets

U \subseteq V

. We could take the colimit

L : X \rightarrow \mathbb{C}

\mathcal{P}

Top/\mathbb{C}

, provided it exists (I don't know any argument in favor of that, on the top of my mind).

It looks like

X

would be the "maximal" domain of definition of the power series

P

. But, thanks to your hint, I think this is wrong. That's because there are other morphisms in

Top/\mathbb{C}

, e.g., homeomorphisms. So, there could be issues like the total space

X

being homeomorphic to the maximal domain of

P

(???).

Peva Blanchard (Apr 10 2024 at 11:46):

Oh yes, in particular, the coefficients of the power series are not preserved by homeomorphisms.

Peva Blanchard (Apr 10 2024 at 11:47):

Peva Blanchard (Apr 10 2024 at 11:48):

Which means that one cannot hope to recover the power series from the colimit

L

, even if it exists.

John Baez (Apr 10 2024 at 13:25):

By the way, there's a huge amount to say about analytic functions and power series using sheaves: this is one of the things sheaves were developed for!

Morgan Rogers (he/him) (Apr 10 2024 at 15:10):

Great work! I was actually trying to hint at something a bit more basic than this: the fact that morphisms of bundles over

X

fix the values in

X

. I can't express the bundle corresponding to a power series as the colimit of the partial sums because there aren't bundle morphisms between the bundles corresponding to those partial sums in general!

David Egolf (Apr 10 2024 at 16:24):

I was wondering if one could come up with a procedure for enumerating (listing the elements of, I assume?) a subset

U \subseteq X

of interest given a way to test if elements in

X

are in

U

. But I suppose if

X

has infinitely many elements, this could be very impractical - this procedure may often require an infinite number of tests to be run. From that perspective, it does make more sense to focus on testing individual elements - as that is something that we can probably actually do!

Peva Blanchard (Apr 10 2024 at 17:14):

Actually, if all you have is a testing procedure for

U

, via a characteristic function

X \rightarrow \{0,1\}

that consumes elements of

X

, there is no generic way to build a procedure that produces elements of

U

. For that you need another assumption, e.g., that you already have a procedure that produces elements of

X

, which you can then "filter" using the characteristic function.

Peva Blanchard (Apr 10 2024 at 17:17):

I don't want to reveal too much about John's later posts (also because I don't have an expert knowledge in those things), but these "testing procedures", or "characteristic functions", will play a crucial role with respect to an important notion in topos theory, namely that of "subobject classifier".

John Baez (Apr 10 2024 at 22:53):

Yes, I was focused on testing procedures. My claim that inverse images are "logically simpler" than images merely meant this:

In this sense, images are defined using an existential quantifier

\exists

. So:

But since inverse images are defined much more simply, without any quantifiers, they preserve unions, intersections and complements!

David Egolf (Apr 11 2024 at 17:27):

I'm feeling tired today, but I'd like to try and make at least a little progress on the current puzzle. Here it is again:

David Egolf (Apr 11 2024 at 17:32):

First, I want to show that

\Gamma_p: \mathcal{O}(X)^{\mathrm{op}} \to \mathsf{Set}

is a functor, and hence a presheaf. Here's what it does on objects and morphisms:

David Egolf (Apr 11 2024 at 17:36):

To show that

\Gamma_p

is a functor , we first need to show that restricting a section of

p

actually gives us a section of

p

. Let

s:U \to Y

be a section of

p

over

U

, where

U

is an open subset of

X

. We want to show that

s|_V: V \to Y

given by

s|_V = s \circ i_{V \to U}

is a section of

p

over

V

. (Here

V

is an open subset of

X

with

V \subseteq U

To do this, it suffices to show that

p \circ s|_V = 1_V

. Checking this at an element

v \in V

, we find

p \circ s|_{V}(v) = p \circ (s \circ i_{V \to U})(v) = (p \circ s) \circ i_{V \to U}(v) = 1_U \circ i_{V \to U}(v) = 1_U(v) = v

. We conclude that

p \circ s|_V = 1_V

, as desired. (Also,

s|_V

is continuous, as it is given by composing continuous functions).

David Egolf (Apr 11 2024 at 17:42):

Next, we want to show that

\Gamma_p(1_U) = 1_{\Gamma_p(U)}

for any object (open subset of

X

)

U

. By definition,

\Gamma_p(1_U): \Gamma_p(U) \to \Gamma_p(U)

is the function that takes a given section

s: U \to Y

and restricts its domain to

U

, yielding

s: U \to Y

. We note that this is the identity function on

\Gamma_p(U)

, so that

\Gamma_p(1_U) = 1_{\Gamma_p(U)}

David Egolf (Apr 11 2024 at 17:45):

To finish showing that

\Gamma_p: \mathcal{O}(X)^{\mathrm{op}} \to \mathsf{Set}

is a functor (and hence a presheaf), we need to show that

\Gamma_p

respects composition. That is, if we have an equation of the form

r \circ r' = r''

\mathcal{O}(X)^{\mathrm{op}}

, then we need to show that

\Gamma_p(r) \circ \Gamma_p(r') = \Gamma_p(r'')

. This is true because restricting the domain of a section in two steps, or restricting the domain all at once yields the same result.

We conclude that

\Gamma_p: \mathcal{O}(X)^{\mathrm{op}} \to \mathsf{Set}

is a presheaf!

David Egolf (Apr 11 2024 at 17:49):

The next order of business is to show that

\Gamma_p

is not only a presheaf, but also a sheaf! But that's a job for another day, when I have a bit more energy.

Peva Blanchard (Apr 11 2024 at 19:41):

spoiler

Let $(U_i)_{i\in I}$ be an open cover of $U$ , and let $(s_i)_{i\in I}$ be a family of compatible sections

$s_{i|U_i \cap U_j} = s_{j|U_i \cap U_j}$

By definition, $s_i : U_i \rightarrow Y$ is a continuous function such that $p \circ s_i = 1_{U_i}$ . Since the presheaf of continuous functions is a sheaf, there exists a (unique) continuous functions $s : U \rightarrow Y$ that restricts to $s_i$ on each $U_i$ . It remains to show that $p \circ s = 1_U$ . But the latter is easily checked pointwise. Hence, $s$ is a section of $p$ .

Any other section $z : U \rightarrow E$ of $p$ that restricts to $s_i$ on each $U_i$ must be equal, as a continuous function, to the function $s$ (because, again, continuous functions form a sheaf). In other words, $s$ is uniquely determined.

This concludes the proof that the presheaf of sections of $p$ is actually a sheaf.

John Baez (Apr 12 2024 at 10:16):

Great! Good luck on your energy levels. I think you'll find that showing that the presheaf of sections of a bundle is a sheaf is similar to the earlier problem where you showed that the presheaf of continuous real-valued functions is a sheaf. Indeed that earlier problem can be seen as a special case of this one if you take

Y = X \times \mathbb{R}

David Egolf (Apr 14 2024 at 15:50):

I now want to show that the presheaf of sections

\Gamma_p: \mathcal{O}(X)^{\mathrm{op}} \to \mathsf{Set}

is in fact a sheaf.

To do this, let's start out with a bunch of

s_i \in \Gamma_p(U_i)

i

varies. Let's require that

(s_i)|_{U_i \cap U_j} = (s_j)|_{U_i \cap U_j}

for all

i,j

and

\cup_i U_i = U

. (Here, each

U_i

is an open subset of

X

). To conclude that

\Gamma_p

is a sheaf, we need to show that there always exists a unique

s \in \Gamma_pU

such that

s|_{U_i} = s_i

for all

i

Recalling that

p:Y \to X

, any particular

s_i:U_i \to Y

is a continuous map such that

p \circ s_i = 1_{U_i}

. Intuitively, we want to "glue together" these sections to get a section

s \in \Gamma_p(U)

p

over

U

. If

s

exists, it is unique. That is because for any

u \in U

u \in U_i

for some

i

, so we must have

s(u) = s|_{U_i}(u) = s_i(u)

. This produces a function

:U \to Y

because

(s_i)|_{U_i \cap U_j} = (s_j)|_{U_i \cap U_j}

for all

i,j

It remains to show that

s

exists. To do that, we need to check that defining

s

s(u) = s|_{U_i}(u) = s_i(u)

(where

u \in U_i

) gives us a continuous function

s: U \to Y

such that

p \circ s = 1_U

We start by considering continuity. By the "local criterion for continuity" discussed above, a function

s:U \to Y

is continuous exactly if for any point

u \in U

there is a neighborhood

U_u

u

such that

s|_{U_u}: U_u \to Y

is continuous. For any

u \in U

, there is some

U_i

so that

u \in U_i

, because

\cup_i U_i = U

. And by assumption we know that

s|_{U_i} = s_i

is continuous. We conclude that

s: U \to Y

is continuous.

Next, we need to show that

p \circ s = 1_U

. For any

u \in U

, there is some

i

so that

u_i \in U_i

. Then,

p \circ s(u) = p \circ s_i(u) = 1_{U_i}(u) = u

. We conclude that

p \circ s = 1_U

, as desired.

David Egolf (Apr 14 2024 at 16:08):

Thanks for the good luck! Sometimes hoping for higher energy levels does feel a bit like waiting for a lucky dice roll; I find it's quite difficult to predict my energy levels accurately.

I want to consider the case

p: X \times \mathbb{R} \to X

, where

p

sends

(x,a)

x

for any

a

. Then a section of

p

over

U

(where

U

is an open subset of

X

) is a continuous function

s:U \to X \times \mathbb{R}

such that

p \circ s = 1_U

. I want to show that a section of

p

over

U

gives us a real-valued continuous function

:U \to \mathbb{R}

, and a real-valued continuous function

:U \to \mathbb{R}

gives us a section of

p

over

U

A section

U \to X \times \mathbb{R}

is in particular a continuous function. Therefore, by the universal property of products, it corresponds to two continuous functions: (1) a function

:U \to X

and (2) a function

U \to \mathbb{R}

. So, given a section

s:U \to X \times \mathbb{R}

, we get a real-valued continuous function

:U \to \mathbb{R}

, given by

\pi_\mathbb{R} \circ s: U \to \mathbb{R}

Let's now start with a continuous function

f: U \to \mathbb{R}

. We want to construct a section of

p

over

U

from

f

. By the universal property of products, to get a continuous function

s: U \to X \times \mathbb{R}

, we just need to specify a continuous function from

U

X

and a continuous function from

U

\mathbb{R}

. Let's take our function

:U \to X

to be the inclusion

i:U \to X

(which is continuous because

U

has the subspace topology).

We want to show that the induced function

s: U \to X \times \mathbb{R}

is in fact a section. Indeed,

p \circ s(u) = p(u, f(u)) = u

, as desired.

David Egolf (Apr 14 2024 at 16:15):

I guess what I really wanted to show is that there is a bijection between the set of sections of

p

over

U

and the set of real-valued continuous functions from

U

. I'm running out of steam, but this seems important to note: Since

p \circ s= 1_U

for any section

s

, we have that

s(u)

is of the form

(u, f(u))

for some

f

. That is, the function

:U \to X

induced by a section

s

must be the inclusion.

I'm hoping that one can make use of this fact to show that the procedures I described above ((1) for constructing a continuous real-valued function on

U

from a section of

p

over

U

and (2) for constructing a section of

p

over

U

from a continuous real-valued function on

U

) are in fact inverses of one another.

John Baez (Apr 14 2024 at 16:20):

Yes, that's a really important observation. And there's nothing really special about

\mathbb{R}

here. More generally this is how you can take any section of a bundle

p: X \times A \to X

over

U \subseteq X

and turn it into a continuous function

f: U \to A

. And you're right: this gives a bijection between such sections and continuous functions

f: U \to A

John Baez (Apr 14 2024 at 16:21):

So, sections of bundles are a generalization of continuous functions. I'll let you do the work, but I wanted to have the fun of stating the dramatic conclusion!

Peva Blanchard (Apr 14 2024 at 20:21):

Just to formalize the statement. Does it mean that there is a natural isomorphism between the sheaf of continuous functions on

X

and the sheaf of sections of the bundle

X \times \mathbb{R} \rightarrow X

John Baez (Apr 15 2024 at 08:41):

It's not much extra work to state (or prove) the idea more generally: for any topological spaces

X

and

A

, there's a natural isomorphism between the sheaf of continuous

A

-valued functions on

X

and the sheaf of sections of the bundle

X \times A \to X

Peva Blanchard (Apr 15 2024 at 09:24):

It also motivates "fiber bundles". The common fiber somehow acts like the codomain of values of the "functions". The difference is that the total space is not necessarily neatly decomposed as a cartesian product

X \times A

Also, I understand why we would want to consider "étale spaces": the ultimate form of this correspondance game is the equivalence between the category of sheaves on

X

and the category of étale spaces over

X

Peva Blanchard (Apr 15 2024 at 09:33):

Mmh, I think my mental picture is wrong ... an étale space

E \rightarrow X

does not seem to generalize fiber bundle.

John Baez (Apr 15 2024 at 10:51):

Yes, those are both right. We could make the second one more precise if we wanted, either in lowbrow ways or in highbrow ways using sheaf cohomology. But now is probably not the time to do that, especially since the "course" he's going through does not introduce fiber bundles.

John Baez (Apr 15 2024 at 10:52):

The course talks more about étale spaces fairly soon, so let's wait a bit and revisit this. But you're right: I would say étale spaces generalize covering spaces, which are fiber bundles with discrete fiber.

David Egolf (Apr 15 2024 at 20:13):

In the next puzzle, we work on building a functor

\Gamma: \mathsf{Top}/X \to \widehat{O(X)}

. We've already seen how to make a sheaf

\Gamma_p

X

from a continuous function

p:Y \to X

. It remains to figure out how

\Gamma

acts on morphisms in

\mathsf{Top}/X

, and then to check that our resulting

\Gamma

really is a functor.

David Egolf (Apr 15 2024 at 20:23):

First, we recall that a morphism from a bundle

p:Y \to X

to a bundle

p': Y' \to X

is a continuous function

f:Y \to Y'

such that

p' \circ f = p

In picture form, this commutative diagram describes a morphism from

p

p'

:
a morphism from p to p'

Notice that if

y \in Y

"sits over"

x \in X

(so that

p(y)=x

), then

f(y) \in Y'

also sits over

x

(as

p'(f(y)) = p(y)=x

). We might think of our continuous function

f:Y \to Y'

as being possible to decompose into several pieces, where the

x_{th}

piece of

f

maps

p^{-1}(x)

to some subset of

(p')^{-1}(x)

David Egolf (Apr 15 2024 at 20:37):

To show that

f \circ s: U \to Y'

is a section of

p': Y' \to X

, we need to show that

p' \circ (f \circ s): U \to X

sends each element of

u

to itself. Noting that

p' \circ f = p

, we find

p' \circ (f \circ s)(u) = (p' \circ f)(s(u)) = p(s(u))

. Since

s

is a section of

p

over

U

p(s(u)) = u

. We conclude that

p' \circ (f \circ s)(u) = u

for all

u \in U

, and so

f \circ s

is indeed a section of

p'

over

U

David Egolf (Apr 15 2024 at 20:39):

The next order of business is to describe what

\Gamma

does on morphisms. But I'll stop here for today!

Peva Blanchard (Apr 15 2024 at 21:05):

I'll just restate, in this setting, an example that we discussed earlier. Let's just look at the case where

Y = \mathbb{R} \times X

and

Y' = [-1, 1] \times X

. The bundles

p

and

p'

are just the projection on the second factor.

\begin{align*} f : Y &\rightarrow Y' \\ (v, x) &\mapsto (arctan~v, x) \end{align*}

I choose

arctan

, but clearly we can replay this game with any function

\mathbb{R} \rightarrow \mathbb{R}

Peva Blanchard (Apr 15 2024 at 21:12):

\begin{align*} g : \mathbb{R} \times \mathbb{R} &\rightarrow [-1,1] \times \mathbb{R} \\ (v, x) &\mapsto (arctan(v - x), x) \end{align*}

I think this is an example of morphism in

Top/\mathbb{R}

which does not arise from a continuous real-valued function

\mathbb{R} \rightarrow \mathbb{R}

as before.

David Egolf (Apr 16 2024 at 16:22):

I'll work in

\mathsf{Top}/X

. Let us assume we have two bundles of this form:

p:A \times X \to X

and

p':A' \times X \to X

, where

p(a,x)=x

and

p'(a',x)=x

for all

a \in A

a' \in A

, and

x \in X

If we have a continuous function

f:A \to A'

, then the function

(f, 1_X):A \times X \to A' \times X

which sends

(a,x)

(f(a),x)

is continuous. Is it also a morphism of bundles from

p

p'

? Let's consider

p' \circ (f,1_X)(a,x)

for some

(a,x) \in A \times X

. We get

p' \circ (f, 1_X)(a,x) =p'((f(a),x) = x = p(a,x)

. We conclude that from any continuous function

f:A \to A'

we get an morphism of bundles from

p

p'

given by

(f,1_X):A \times X \to A' \times X

David Egolf (Apr 16 2024 at 16:23):

I think then the question is: what other morphisms exist from

p:A \times X \to X

p':A' \times X \to X

that can not be produced in this way?

David Egolf (Apr 16 2024 at 16:30):

Any continuous function

h

from

A \times X

A' \times X

induces a continuous function

f

from

A \times X

A'

. Let's define

f_x: A \to A'

f_x(a) = f(a,x)

. In some cases, this

f_x

can vary as

x

does!

I'm guessing we can't induce a morphism of bundles where this kind of thing happens when we start out with just a single continuous function from

A

A'

Peva Blanchard (Apr 16 2024 at 16:33):

Yes exactly! To be complete, we should prove that there is no

f

such that

g(v, x) = (f(v), x)

for all

v, x

David Egolf (Apr 17 2024 at 19:09):

I think I have a good guess regarding how to finish describing

\Gamma

. Once I get a bit more energy - hopefully soon - I will type that up here. But today I need to rest up!

Julius Hamilton (Apr 19 2024 at 14:12):

I used to have energy problems, but then I started taking meds (in case that might help you).

David Egolf (Apr 22 2024 at 16:05):

Alright, let me take a stab at describing what

\Gamma: \mathsf{Top}/X \to \widehat{\mathcal{O}(X)}

does on morphisms. Let's assume we have two bundles over

X

, namely

p:Y \to X

and

p':Y' \to X

. These induce presheaves (indeed sheaves) on

X

, by sending each open subset of

X

to an appropriate set of sections over that subset.

Let's call these sheaves

\Gamma_p: \mathcal{O}(X) \to \mathsf{Set}

and

\Gamma_{p'}: \mathcal{O}(X) \to \mathsf{Set}

. Given a morphism of bundles from

p

p'

induced by a continuous function

f:Y \to Y'

, we want to define a natural transformation

\Gamma(f): \Gamma_p \to \Gamma_p'

Let's set up a naturality square corresponding to the morphism

r: U \to V

\mathcal{O}(X)

where

V

and

U

are open subsets of

X

and

V \subseteq U

. We recall that:

Given a section

s

p

over

U

and a morphism of bundles

f:Y \to Y'

, we can form

f \circ s: U \to Y'

. We saw earlier that this is indeed a section of

p'

over

U

. So, post-composing by

f

provides a function from sections of

p

over

U

to sections of

p'

over

U

David Egolf (Apr 22 2024 at 16:06):

Based on the above, we now draw a proposed naturality square corresponding to the morphism

r:U \to V

\mathcal{O}(X)

:
square

To show we get a natural transformation from

\Gamma_p

\Gamma_{p'}

in this way, we still need to show this square commutes for an arbitrary morphism

r: U \to V

David Egolf (Apr 22 2024 at 16:17):

Let's pick an

s:U \to Y \in \Gamma_p(U)

and trace it around the diagram. Restricting its domain to

V

can be accomplished by precomposing with the (continuous) inclusion map

i:V \to U

. Going around the top right side of the square, we get

f_* \circ |_V(s) = f_*(s \circ i) = f \circ (s \circ i)

. Going around the bottom left side of the square, we get

|_V \circ f_*(s) = |_V \circ (f \circ s) = (f \circ s) \circ i

. By associativity of composition, these two results are equal, and so the square commutes.

We conclude that post-composing with

f

at each component describes a natural transformation

\Gamma(f):\Gamma_p \to \Gamma_{p'}

David Egolf (Apr 22 2024 at 16:22):

Next, let's show that

\Gamma:\mathsf{Top}/X \to \widehat{\mathcal{O}(X)}

is a functor.

First we need to show that

\Gamma(1_p) = 1_{\Gamma(p)}

for any bundle

p:Y \to X

. The identity morphism

1_p

p

is induced by the (continuous) identity function

1_Y: Y \to Y

. So,

\Gamma(1_p): \Gamma_p \to \Gamma_p

is the natural transformation which post-composes by

1_Y

at each component. This is indeed the identity natural transformation from

\Gamma(p) = \Gamma_p

to itself, as desired.

David Egolf (Apr 22 2024 at 16:28):

Finally, we need to show that

\Gamma(f \circ f') = \Gamma(f) \circ \Gamma(f')

for two composable bundle morphisms

f

and

f'

. Let's compare the components of these two natural transformations. The

U_{th}

component of

\Gamma(f \circ f')

is a function that post composes

f \circ f'

after a section

s

, so it is the function

s \mapsto (f \circ f') \circ s

. The

U_{th}

component of

\Gamma(f) \circ \Gamma(f')

is given by composing the

U_{th}

component of

\Gamma(f)

after the

U_{th}

component of

\Gamma(f')

. That means that the

U_{th}

component of

\Gamma(f) \circ \Gamma(f')

corresponds to a function

s \mapsto f \circ (f' \circ s)

. By associativity of composition, we conclude that the

U_{th}

component of

\Gamma(f \circ f')

and

\Gamma(f) \circ \Gamma(f')

are equal. So,

\Gamma(f \circ f') = \Gamma(f) \circ \Gamma(f')

, as desired.

We conclude that

\Gamma: \mathsf{Top}/X \to \widehat{\mathcal{O}(X)}

is indeed a functor!

David Egolf (Apr 23 2024 at 15:18):

I'm excited, because the next section of the current blog post talks about "germs"! The rough intuition I have for germs is that they can be used to describe the different possible "very local behaviours" super close to a point.

For example, "The Rising Sea" (by Vakil) defines germs at a point

x

to be equivalence classes of smooth functions defined on open sets containing

x

: we say that

f:U \to Y

is in the same equivalence class as

f': U' \to Y

if there is some

V \subseteq U \cap U'

such that

x \in V

and

f|_U = f'|_{U'}

. So, intuitively, two functions defined on open sets containing

x

are in the same germ at

x

if they restrict to the same function when we "zoom in" to some open set that is "close enough" to

x

David Egolf (Apr 23 2024 at 15:23):

Above, we saw how to make a presheaf on

X

from a bundle over

X

. We now want to go in the other direction: can we make a bundle over

X

from a presheaf on

X

Presheafs on

X

and bundles over

X

can both be viewed as "attaching information" to parts of

X

. Given a bundle

f:Y \to X

, the data "attached" to some point

x \in X

f^{-1}(x) \subseteq Y

. Given a presheaf

F: \mathcal{O}(X)^{\mathrm{op}} \to \mathsf{Set}

, the data "attached" to an open subset

U

F(U)

So, to make a bundle from a presheaf, we need to figure out how to attach data to individual points of

X

given data attached to each open subset of

X

David Egolf (Apr 23 2024 at 15:36):

Assume we have some presheaf

F: \mathcal{O}(X)^{\mathrm{op}} \to \mathsf{Set}

. We can come up with a set

\Lambda(F)_x

to "attach" to

x \in X

as follows:

David Egolf (Apr 23 2024 at 15:40):

Next time, I'd like to think about

\Lambda(X)_x

in the particular case where

F

is the presheaf (which is also a sheaf) that sends each open subset

U

X

to the set of continuous real-valued functions

:U \to \mathbb{R}

John Baez (Apr 23 2024 at 15:59):

Good! That's a great example for getting a more concrete picture of germs. I recommend taking

X = \mathbb{R}

so you can actually graph these continuous functions and visualize these germs. And in this example I also recommend comparing the sheaves of

Remember, a functions from an open set of

\mathbb{R}

\mathbb{R}

is analytic if at each point it has a Taylor series with a positive radius of convergence.

The reason I bring this up is that derivatives, Taylor series and germs are three famous ways to study how a function looks in an arbitrarily small neighborhood of a point. And there are some revealing differences in the 3 cases listed above!

Peva Blanchard (Apr 23 2024 at 16:13):

f(x) = \begin{cases} 0 &\text{if } x \le 0 \\ e^{-\frac{1}{x^2}} &\text{otherwise} \end{cases}

One can try to describe the germ of

f

x = 0

, when regarded as a continuous, resp. smooth, function.

It also turns out that

f

is not analytic, because of what happens at

x = 0

John Baez (Apr 23 2024 at 16:31):

Yes, this is a great example of how the germs of smooth functions differ from those of analytic functions. There is more to say about this but I'll let David proceed at his own desired pace so he's not "drinking from a firehose".

JR (Apr 23 2024 at 20:53):

FWIW this function is often used (after integrating and introducing radial or similar coordinates) to construct (relatively) explicit partitions of unity.

David Egolf (Apr 24 2024 at 15:59):

I next want to want to understand this part of the blog post, where

F

is the presheaf which sends each open subset

U

X

to the set of continuous real-valued functions

F(U) = \mathsf{Top}(U, \mathbb{R})

Specifically I'd like to prove the statement "any two such functions give the same germ iff they become equal when restricted to some open neighborhood of

x

Once I've done that, then I think it would be good to do the things that John Baez suggests above: take

X = \mathbb{R}

and make some graphs to visualize germs, and compare the germs we get when considering sheaves of continuous, smooth, or analytic

\mathbb{R}

-valued functions. (Then I think it'll be time for the next puzzle, probably!)

David Egolf (Apr 24 2024 at 16:51):

I'm not sure where to start, but I think it may be helpful to get some sense for what a cone under our diagram

F \circ I: O_x \to \mathsf{Set}

is like.

So, let

\alpha: F \circ I \to \Delta_S

be a cone under

F \circ I

. Note that

\alpha

is a natural transformation from

F \circ I

to the functor

\Delta_S: O_x \to \mathsf{Set}

that is constant at the set

S

. Since

\alpha

is a natural transformation, all its "naturality squares" must commute.

Let's examine the naturality square for the morphism

r:U \to V

O_x

, where

U

and

V

are open subsets of

X

each containing

x

, such that

V \subseteq U

. Here's the corresponding naturality square:
square

Since

\alpha

is a cone under

F \circ I

, this diagram commutes. That means

\alpha_V \circ |_V = \alpha_U

. This will be useful to know in a moment. Intuitively, this tells us that restricting a continuous function (from an open subset containing

x

to an open subset containing

x

) doesn't change the germ of

x

it corresponds to.

I'm interested in the case where two continuous functions

f:U \to \mathbb{R}

and

f':U' \to \mathbb{R}

get mapped to the same germ (same element of

\Lambda(F)_x

). And I want to show that this happens when there is some open

V \subseteq U' \cap U

so that

x \in V

and

f|_V = f'|_V

. Let's draw part of a cone

\alpha: F \circ I \to \Delta_S

(for some set

S

) under the diagram

F \circ I

in the situation where there is some open

V \subseteq U \cap U'

containing

x

:
diagram

We have that

\alpha_V \circ |_V = \alpha_{U \cap U'}

and

\alpha_{U \cap U'} \circ |_{U' \to U \cap U'} = \alpha_{U'}

. That implies that

\alpha_V \circ |_V \circ |_{U' \to U \cap U'} = \alpha_{U'}

. Similarly,

\alpha_V \circ |_V \circ |_{U \to U \cap U'} = \alpha_{U}

Let's now assume that we have continuous functions

f:U \to \mathbb{R}

and

f':U' \to \mathbb{R}

such that

|_V \circ |_{U \to U \cap U'}(f) = |_V \circ |_{U' \to U \cap U'}(f')

. Thus,

\alpha_V \circ |_V \circ |_{U \to U \cap U'}(f) = \alpha_V \circ |_V \circ |_{U' \to U \cap U'}(f')

. Therefore,

\alpha_{U}(f) = \alpha_{U'}(f')

So, we see that if

f:U \to \mathbb{R}

and

f':U' \to \mathbb{R}

restrict to the same function on some open subset of

U \cap U'

that contains

x

, then they get mapped to the same element by any cone under

F \circ I: O_x \to \mathsf{Set}

. In particular, they must correspond to the same germ of

x

It remains to show that if two continuous functions

f:U \to \mathbb{R}

and

f':U' \to \mathbb{R}

correspond to the same germ of

x

, then they must restrict to the same function on some open

V \subseteq U \cap U'

containing

x

. I'll leave that for next time, though.

John Baez (Apr 24 2024 at 17:38):

Good work! We'll be able to draw a lot of lessons from what you're doing now, because many of the ideas you're coming up with now (and will come up with next time :upside_down: ) apply in far more general situations than the one you're considering here. But I won't distract you with those lessons until you're done!

Jacob Zelko (Apr 24 2024 at 18:51):

I have to say a huge thank you to @David Egolf and @John Baez (and multiple others) for this wonderful discussion here on Topos Theory. I am not yet at the point of getting into these blogs like David has, but I am slowly beginning to catch hints of Topos Theory in some of the readings/investigations I have been doing. At some point I think I shall converge back to this discussion but am very thankful it is on this Zulip for future reference. At any rate, I follow along with great curiosity in silent reflection of these points!

David Egolf (Apr 25 2024 at 17:49):

Next, I want to show that if two continuous functions

f:U \to \mathbb{R}

and

f':U' \to \mathbb{R}

(with

U

and

U'

being open sets containing

x

) correspond to the same germ of

x

, then they must restrict to the same function on some open

V \subseteq U \cap U'

containing

x

To get there, I first want to think conceptually about what it means for our set of germs

\Lambda(F)_x

to be (part of the data of) a colimit of

F \circ I: O_x \to \mathsf{Set}

. The full data of the colimit of

F \circ I

is some natural transformation

\alpha: F \circ I \to \Delta_{\Lambda(F)_x}

. By definition of a colimit, this cocone of

F \circ I

is initial in the category of cocones of

F \circ I

. That is, for every other cocone

\beta: F \circ I \to \Delta_S

(where

\Delta_S

is the functor

:O_x \to \mathsf{Set}

constant at some set

S

), there is a unique natural transformation

\Delta_g: \Delta_{\Lambda(F)_x} \to \Delta_S

so that

\Delta_g \circ \alpha = \beta

The triangle diagram lives in the category

[O_x, \mathsf{Set}]

of functors from

O_x

\mathsf{Set}

, together with natural transformations between them. The arrow at the bottom

g:\Lambda(F)_x \to S

is a function; it is a morphism in

\mathsf{Set}

. Note that a natural transformation from one constant functor to another is induced by a morphism from the object the first functor is constant at to the object the second functor is at.

In this diagram,

\alpha: F \circ I \to \Delta_{\Lambda(F)_x}

I think can be viewed as an "observation" of the functor

F \circ I

. I think our goal is to find the "most informative observation" of the functor

F \circ I: O_x \to \mathsf{Set}

having a target of some constant functor

:O_x \to \mathsf{Set}

. Indeed, I think that

\alpha

is the "most informative" observation of

F \circ I \to O_x

, in the sense that any other observation of it

\beta

can be computed as

\Delta_g \circ \alpha

for some

\Delta_g

Let's think about this a bit more using components. Let

\alpha_U: (F \circ I)(U) \to \Lambda(F)_x

be the

U

-th component of

\alpha

, where

U

is some open subset of

X

containing

x

. The

U

-th component of

\Delta_g

is just

g: \Lambda(F)_x \to S

, and so the commutativity of our diagram implies that

g \circ \alpha_U = \beta_U

. Taking some particular

f \in (F \circ I)(U)

, so that

f:U \to \mathbb{R}

is a continuous function defined on the open set

U

containing

x

, we learn that

g(\alpha_U(f)) = \beta_U(f)

. So, given the germ that

f

belongs to, namely

\alpha_U(f)

, we can compute the observation

\beta_U(f)

using some

g

. For this reason, I think it makes sense to say that that germ of a particular function

f:U \to \mathbb{R}

x

is the "most informative" observation of that function "locally about

x

Next time, I want to make use this intuition to show that if two continuous functions

f:U \to \mathbb{R}

and

f':U' \to \mathbb{R}

correspond to the same germ of

x

, then they must restrict to the same function on some open

V \subseteq U \cap U'

containing

x

. To do that, here's my current rough plan:

John Baez (Apr 25 2024 at 23:01):

I don't think a proof by contradiction is necessary here, but you can try it and then perhaps straighten it out to a direct proof.

David Egolf (Apr 26 2024 at 15:43):

This makes me want to find a direct proof! But I'll start out with the (attempted) proof by contradiction, and see what happens.

Let

\alpha: F \circ I \to \Delta_{\Lambda(F)_x}

be the proposed colimit of

F \circ I:O_x \to \mathsf{Set}

. And to obtain a contradiction, assume we have two continuous functions

f:U \to \mathbb{R}

and

f':U' \to \mathbb{R}

(with

U

and

U'

open sets containing

x

), such that:

We aim to construct a cocone

\beta: F \circ I \to \Delta_S

(for some set

S

) of

F \circ I

so that there is no

\Delta_g: \Delta_{\Lambda(F)_x} \to \Delta_S

satisfying

\Delta_g \circ \alpha = \beta

. That would show that

\alpha

can't possibly act in this way if we want it to be a colimit.

David Egolf (Apr 26 2024 at 15:51):

To construct

\beta

my plan to use what I think is supposed to be the actual colimit. We define an equivalence relationship on real-valued functions defined on an open set of

X

containing

x

. We decree that

h:U \to \mathbb{R} \sim h':U' \to \mathbb{R}

exactly if there is some open set

V \subseteq U \cap U'

containing

x

on which

h|_V = h'_V

. Then, we form the set

S

by having one element per equivalence class. I'll call the element of

S

corresponding to the equivalence class of

h:U \to \mathbb{R}

by the name

[h]

. Then, we let

\beta_U(h:U \to \mathbb{R}) = [h]

Notice that if there is no open

V \subseteq U \cap U'

containing

x

where

f:U \to \mathbb{R}

and

f:U' \to \mathbb{R}

restrict to the same function, then

f

and

f'

are not equivalent. That means that

\beta_U(f) \neq \beta_U(f')

. We will aim to use this in a minute to obtain a contradiction.

(I'd love to finish this off today, but I think I'll need to rest up and come back to this hopefully tomorrow!)

Peva Blanchard (Apr 26 2024 at 16:58):

Yes, I think an explicit construction of the colimit as a quotient by an equivalence relation is the right way!

Here is a very basic example with finite sets. The bottom right corner is the colimit of the diagram consisting of the three other corners. The square brackets enclose equivalence classes.

The way I like to see it is two-steps: first we take the disjoint union (the bullets

a,\dots,e

), and then we glue things together by adding wires (labeled

0,1

). The equivalence classes correspond to the connected components of the resulting graph.
image.png

David Egolf (Apr 26 2024 at 17:38):

David Egolf (Apr 28 2024 at 17:24):

I think we're in the home stretch now. The next thing I want to do is to show that the following really defines an equivalence relationship on real-valued functions mapping from open subsets of

X

that contain

x \in X

h:U \to \mathbb{R} \sim h':U' \to \mathbb{R}

and

h':U' \to \mathbb{R} \sim h'':U'' \to \mathbb{R}

, then we want to show that

h \sim h''

. Since,

h \sim h'

, there is some open

V \subseteq U \cap U'

containing

x

so that

h|_V = h'|_V

. And since

h' \sim h''

there is some open

V' \subseteq U' \cap U''

containing

x

so that

h'_{V'} = h''_{V'}

. Now,

V \cap V'

is open, and is a subset of both

V

and

V'

. So on

V \cap V'

, we have that

h_{V \cap V'} = h'_{V \cap V'}

and

h'_{V \cap V'} = h''_{V \cap V'}

. Hence

h_{V \cap V'} = h''_{V \cap V'}

and thus

h \sim h''

h:U \to \mathbb{R} \sim h':U' \to \mathbb{R}

, we want to show that

h' \sim h

. Since

h \sim h'

, there is some open

V \subseteq U \cap U'

containing

x

so that

h|_V = h'|_V

. That implies that

h'_V = h|_V

and hence

h' \sim h

We conclude that

\sim

is indeed an equivalence relation on the set of real-valued functions having some open domain of

X

that contains

x

David Egolf (Apr 28 2024 at 17:34):

Next, I want to show that

\beta

is a cocone of

F \circ I

. Recall from above that

\beta_U: (F \circ I)(U) \to S

is defined as

\beta_U(h:U \to \mathbb{R}) = [h]

, where

[h]

is the equivalence class of

h

according to the equivalence relationship

\sim

To show that

\beta

is a cocone of

F \circ I

, it suffices to show that any naturality square of

\beta: F \circ I \to \Delta_S

commutes. Given a morphism

r:U \to V

O_x

(where

U

and

V

are open subsets of

X

containing

x

, with

V \subseteq U

), here is the correspond naturality square:
naturality square

To show this square commutes, it suffices to show that

\beta_U = \beta_V \circ |_V

. At a particular element of

(F \circ I)(U)

, say

f:U \to \mathbb{R}

, that means that

\beta_U(f) = \beta_V \circ |_V(f)

To show this is true, it suffices to show that

f \sim f|_V

. Since

V

is an open subset of

X

containing

x

, and

f|_V = (f|_V)|_V = f_V

, we conclude that

f \sim f_V

. Hence

\beta_U(f) = \beta_V \circ |_V(f)

for any

f\in (F \circ I)(U)

, and so

\beta_U = \beta_V \circ |_V

for any

U

and

V

We conclude that an arbitrary naturality square of

\beta

commutes, so that

\beta

is a natural transformation

:F \circ I \to \Delta_S

, and thus a cocone.

David Egolf (Apr 28 2024 at 17:48):

Now, we are in a good spot to demonstrate a contradiction. Recall that we assumed that:

We will now show that there is no natural transformation

\Delta_g: \Delta_{\Lambda(F)_x} \to \Delta_S

such that

\Delta_g \circ \alpha = \beta

. (Which would be a contradiction, because

\alpha

is supposed to be a colimit). For this equation to hold, it must hold at every component. In particular, we must have

(\Delta_g)_U \circ \alpha_U = \beta_U

and

(\Delta_g)_{U'} \circ \alpha_{U'} = \beta_{U'}

. Noting that every component of

\Delta_g

is just

g

, we have that

g \circ \alpha_U = \beta_U

and

g \circ \alpha_{U'} = \beta_{U'}

We also know that

\alpha_U(f) = \alpha_{U'}(f')

. Using all this, we conclude that

\beta_{U'}(f) = g \circ \alpha_{U'}(f) = g \circ \alpha_{U}(f) = \beta_U(f)

. But this is a contradiction: by definition of

f

and

f'

we can't possibly have

\beta_{U'}(f) = \beta_U(f)

. (

\beta_{U'}(f) = \beta_U(f)

would imply that

f \sim f'

, which would imply that there is some open set containing

x

in the intersection of the domains of

f

and

f'

where they restrict to the same function - and we know this is false by the assumptions we have placed on

f

and

f'

We conclude that if

\alpha: F \circ I \to \Delta_{\Lambda(F)_x}

is to be the colimit of

F \circ I

, then two functions with the same germ at

x

must have some open neighborhood in the intersection of their domains containing

x

such that they restrict to the same function!

David Egolf (Apr 28 2024 at 17:53):

Peva Blanchard (Apr 29 2024 at 08:46):

In case you are interested, here is, I think, a more direct proof. It amounts to showing that the quotient you suggest satisfies the universal property of the colimit.

spoiler

First, consider the disjoint union

$S(F)_x = \bigsqcup_{U \in O_x^{op}} F(U)$

over the open neighbourhoods of $x$ . An element of $S(F)_x$ is a pair $(U, f)$ with $U$ an open neighbourhood of $x$ , and $f \in F(U)$ .

We define the relation $\sim$ on $S(F)_x$ as

$(U, f) \sim (V, g) \text{ iff } \exists \text{ open } W \subseteq U \cap V, x \in W, f_{|W} = g_{|W}$

It is an equivalence relation. Moreover, if $U\subseteq V$ , then for any $f \in FV$ ,

$(U, f_{|U}) \sim (V, f) ~~~~ \text{(Eq 1)}$

Let $\Lambda(F)_x = S(F)_x / \sim$ be the quotient of $S(F)_x$ by this equivalence relation. For every open neighbourhood $U$ of $x$ , we have a function

$\begin{align*} \iota_U : FU &\rightarrow \Lambda(F)_x \\ f &\mapsto \text{the equivalence class of } (U, f) \end{align*}$

By (Eq 1), the following diagram commutes:
image.png

We show that $\Lambda(F)_x$ satisfies the universal property of the colimit. Let $A$ be a set with functions $FU \xrightarrow{a_U} A$ such that the following diagram commutes:
image.png

Let $a : S(F)_x \rightarrow A$ be the disjoint sum of the $a_U$ 's (universal property of the disjoint sum).
The commutativity of the diagram implies that $a$ is constant on elements belonging to the same equivalence class. Therefore, there exists a unique map $\alpha: \Lambda(F)_x \rightarrow A$ such that the following diagram commutes:
image.png

In other words, $\Lambda(F)_x$ satisfies the universal property of the colimit.

David Egolf (Apr 29 2024 at 16:06):

I think there might be a typo. If I understand correctly, we have that

U \subseteq V

, but the morphism from

FV

FU

is called

-|V

in the diagrams above. I would have expected it to be called something more like

-|U

, as I think it corresponds to restricting from

V

U

David Egolf (Apr 29 2024 at 16:13):

So, I think you start out by describing the cocone

\iota: F \circ I \to \Delta_{\Lambda(F)_x}

(to use the notation I was using above), where the component

\iota_U:(F \circ I)(U) \to\Lambda(F)_x

sends each element of

FU

to its equivalence class under

\sim

Then, to show this cocone satisfies the universal property of the colimit, you introduce another cocone

: F \circ I \to \Delta_A

having

U

-th component

a_U:(F \circ I)(U) \to A

David Egolf (Apr 29 2024 at 16:15):

Next, you define an

a:S(F)_x \to A

. Here,

S(F)_x

is the disjoint union of the sets

(F \circ I)(U)

U

varies over open sets containing

x

. Because the disjoint union is the coproduct in

\mathsf{Set}

, a collection of functions

a_U: (F \circ I)(U) \to A

induce a function

a:S(F)_x \to A

David Egolf (Apr 29 2024 at 16:23):

You then I think note that if

f:U \to \mathbb{R} \sim g:V \to \mathbb{R}

then

a(f) = a(g)

. If

f \sim g

, that means there is some

W \subseteq U \cap V

containing

x

where

f|_W = g|_W

. In this situation, this diagram commutes:
diagram

Since

|_W(f) = |_W(g)

a_W \circ |_W(f) = a_W \circ |_W(g)

. By commutativity of the diagram, this implies that

a_U(f) = a_U(g)

. Since

a

is induced using the universal property of disjoint unions, this implies that indeed

a(f) = a(g)

David Egolf (Apr 29 2024 at 16:32):

Now,

a:S(F)_x \to A

. We want to use

a

to induce a natural transformation from

\Delta_{\Lambda(F)_x}

\Delta_A

. To do this, we just need a morphism

\alpha:\Lambda(F)_x \to A

At this point, I think we want to use something like the "universal property of quotients" to induce our

\alpha

. I don't remember how that stuff goes very well right now... But I assume the basic idea is to set

\alpha([f]) = a(f)

We have to show this is well-defined. If

f \sim g

, then

\alpha([f]) = a(f)

and

\alpha([g]) = a(g)

, but since

f \sim g \implies a(f)=a(g)

, we learn that

\alpha([f]) = \alpha([g])

. So,

\alpha

is indeed well-defined.

David Egolf (Apr 29 2024 at 16:48):

David Egolf (Apr 29 2024 at 16:51):

To show that

\alpha

induces a morphism of cocones, we need to show that

\alpha \circ \iota_U = a_U

for all

U \in O_x

. For some

f \in (F \circ I)(U)

, we have

\alpha(\iota_U(f)) = \alpha([f]) = a(f) = a_U(f)

, as desired.

David Egolf (Apr 29 2024 at 16:53):

Finally, we want to show that

\alpha: \Lambda(F)_x \to A

is the unique morphism

:\Lambda(F)_x \to A

that induces a morphism of cones from our cocone with tip

\Lambda(F)_x

to our cocone with tip

A

David Egolf (Apr 29 2024 at 17:25):

So, we just saw that we need

\alpha \circ \iota_U(f) = a_U(f) = a(f)

for all

U

. Since

\iota_U

projects to equivalence classes, this means we need

\alpha([f]) = a(f)

. As

U

varies, we'll obtain this condition for all equivalence classes. So, I think

\alpha([f]) = a(f)

for all

[f] \in \Lambda(F)_x

is forced, if

\alpha

is to be a morphism of our cocones.

I think that means we can conclude that

\alpha

does indeed induce the unique morphism from our cocone with tip

\Lambda(F)_x

to our cocone with tip

A

. We conclude that our cocone with tip with tip

\Lambda(F)_x

is indeed initial, and so it is indeed the colimit of our diagram!

David Egolf (Apr 29 2024 at 17:26):

Thanks, @Peva Blanchard , for working out the direct proof! It found it interesting and helpful to review. :smile:

David Egolf (Apr 29 2024 at 17:35):

Starting to move in the direction of examples of germs, there is a nice example in the book "An Introduction to Manifolds" (by Tu), on page 12:

Peva Blanchard (Apr 29 2024 at 17:53):

David Egolf (Apr 30 2024 at 17:25):

Before moving on to the next puzzle, I'd like to try and visualize a germ for the presheaf

F: \mathcal{O}(\mathbb{R})^{\mathrm{op}} \to \mathsf{Set}

, which sends each open subset

U

\mathbb{R}

to the set of continuous real-valued functions

:U \to \mathbb{R}

To visualize a germ at

x \in \mathbb{R}

, (which is an element of

\Lambda(F)_x

), I'll draw a little cartoon of a bunch of continuous functions (defined on different open sets containing

x

) that correspond to the same germ. That is, they become the same function when restricted to a "small enough" open set containing

x

David Egolf (Apr 30 2024 at 17:31):

I'd be happy to talk more about examples of germs (e.g. in the continuous vs smooth vs analytic cases), but I don't know really know how to go about comparing those. So I'll move on to the next puzzle. But if you have something you'd like to say regarding examples of germs, please feel welcome to share your thoughts here!

David Egolf (Apr 30 2024 at 17:37):

Kevin Carlson (Apr 30 2024 at 17:38):

One very important fact about analytic germs is that you know how to name all of them! In fact you probably learned how in a calculus course.

David Egolf (Apr 30 2024 at 17:52):

Thanks, @Kevin Carlson for your comment! It's been a while since I took a calculus course, and I can't remember if we ever used the word "analytic". But let me see if I can figure out what you're hinting at.

One way to put an equivalence relationship on a set

S

is to use a function

f:S \to P

where

f(s)

is some property of

s

. Then we let

s \sim s' \iff f(s) = f(s')

S

is the set of real-valued analytic functions, with each element of

S

defined in some open set

U \subseteq \mathbb{R}

containing

x \in \mathbb{R}

, I want to try setting

f(s)

to be the Taylor series of

s

about

x

. Then I'm hoping that the equivalence relationship induced by

f

is the same as the equivalence relationship "belongs to the same germ". If that works out, I am hoping that would imply that the analytic germs at

x

are in bijection with the Taylor series about

x

that converge in some open subset of

\mathbb{R}

containing

x

David Egolf (Apr 30 2024 at 17:59):

f(s) = f(s')

, that implies that

s

and

s'

have the same Taylor series about

x

. Because

s

and

s'

are analytic,

f(s)

and

f(s')

both have a positive radius of convergence about

x

. I think that means that

s

and

s'

become equal when restricted to this region of convergence about

x

. And this restricted function is still analytic, so I think this implies that

s

and

s'

belong to the same analytic germ at

x

David Egolf (Apr 30 2024 at 18:01):

s

and

s'

belong to the same analytic germ at

x

, then they are both analytic and have some common analytic restriction to some open subset about

x

. That restriction, being analytic, can be expressed as a Taylor series in some region with a positive radius of convergence about

x

. And so,

s

and

s'

have the same Taylor series about

x

when we are "close enough" to

x

. I am hoping that implies that

s

and

s'

must have the same Taylor series about

x

, so that

f(s) = f(s')

David Egolf (Apr 30 2024 at 18:03):

Well, I feel rather shaky on this stuff. Any corrections or clarifications would be appreciated! :smile:

Kevin Carlson (Apr 30 2024 at 18:05):

That’s the idea! Sounds like you’re still just a little stuck on whether having the same Taylor series on a small enough neighborhood of a point means you have the same Taylor series at that point. But there’s no difference between “my Taylor series near

a

” and “my Taylor series at

a

”, because, recall, the Taylor series is calculated by calculating all the derivatives of

s

a.

So if two analytic functions agree near

a

, they have the same Taylor series there. And conversely, since you compute the functions by actually plugging into the Taylor series where it converges! Hopefully that wasn’t handing you anything it would’ve been more fun to figure out on your own, just trying to help remind you of some old calculus stuff.

David Egolf (Apr 30 2024 at 18:13):

Thanks for clarifying! That makes sense: since a Taylor series is computed entirely using information "extremely close" to

a

(by computing

s(a), s'(a), s''(a), \dots

), if two analytic functions agree on some open set containing

a

, they must have the same Taylor series at

a

. (All the derivatives are computed using limits which only care about behaviour as we get "really close" to

a

: we'll eventually get inside the open set where these two functions agree during the limiting process). In particular, if two analytic functions obtain the same Taylor series at

a

when we restrict both of them to some open set about

a

(which means they agree on some open set containing

a

), then the two original analytic functions must have the same Taylor series at

a

Peva Blanchard (Apr 30 2024 at 18:30):

f(x) = \begin{cases} 0 &\text{if } x \le 0 \\ e^{-\frac{1}{x^2}} &\text{otherwise} \end{cases}

John Baez (May 01 2024 at 07:18):

Yes, the disjoint union (aka "coproduct"). You'd never want to say two germs at two different points

x

are equal.

John Baez (May 01 2024 at 07:21):

John Baez (May 01 2024 at 07:23):

If you haven't thought much about analytic functions, it might help to know that @Peva Blanchard is giving the standard example to show how the concept is a bit subtle. This is a function that has an

n

th derivative at

x = 0

for all

n = 0, 1,2, \dots

, which is still not analytic. In fact all these derivatives are zero, yet the germ of this function at

x = 0

is nonzero!

John Baez (May 01 2024 at 07:34):

Puzzle. Find a function that vanishes at

x = 0

, along with its first million derivatives:

f(0) = 0, \frac{df}{dx}(0) = 0, \frac{d^2 f}{dx^2}(0) = 0, \dots, \frac{d^{1,000,000} f}{dx^{1,000,000}}(0) = 0

John Baez (May 01 2024 at 07:38):

Peva's example is much stranger, because we don't stop at a million or any finite number - all the derivatives of this function are all well-defined for all

x

, and they all vanish at

x = 0

, but this nonzero for all

x > 0

John Baez (May 01 2024 at 18:02):

The point of Peva's example is that if you have a function

f \colon \mathbb{R} \to \mathbb{R}

that is infinitely differentiable, its germ at x = 0 can contain more information than all its derivatives at x = 0. But for analytic functions, all the information about the germ is contained in the derivatives - since you can recover the function from its power series, at least in some neighborhood of x = 0.

David Egolf (May 01 2024 at 18:38):

Thanks to both of your for your comments! I'm taking a little break today from this thread, but I hope to return to it tomorrow. The idea that a smooth function can have more information in its germ at a point (in addition to the values of all its derivatives at that point) is interesting, and I look forward to responding in more detail to your comments soon.

David Egolf (May 02 2024 at 16:05):

The first idea that comes to mind for me is to try

f(x)=x^n

for

n

big enough. Each derivative we take reduces the exponent of

x

1

. I think this implies that the first

n-1

derivatives are all zero. (Eventually though, after we take

n

derivatives, we get

f^{(n)}(x) = n!

which is non-zero at

x=0

.) I think setting

n=1,000,000+1

gives us a function

f(x)=x^{1,000,001}

that meets the requirements of the puzzle.

David Egolf (May 02 2024 at 16:09):

I was fairly sure we don't ever want to consider two germs at different points to be equal, but I started slightly worrying about this issue because the

\bigcup

symbol was used instead of the

\coprod

symbol in the blog post:
notation

Actually, I suppose that each of

\Lambda(F)_x

is only defined up to isomorphism if we just require each

\Lambda(F)_x

to be (part of) a colimit of an appropriate diagram. From that perspective, it seems bad to take the union of these

\Lambda(F)_x

x

varies, because the union is an operation that cares about the equality of elements of the different sets we are taking a union of. (And we can change which elements in different

\Lambda(F)_x

are equal by swapping out isomorphic copies of some

\Lambda(F)_x

David Egolf (May 02 2024 at 16:14):

Huh! I suppose this function "takes off" from zero so slowly that all its derivative at

0

don't even notice! So we have two smooth functions (this one, and the function constant at zero) that have the same Taylor series at

0

, but there is no open set containing

0

in which those two functions restrict to the same function!

In this example, we see that computing all the derivatives at a point

x

of a smooth function doesn't always determine uniquely which smooth germ of

x

that function belongs to.

David Egolf (May 02 2024 at 16:22):

I find myself wondering what additional information (in addition to the value of all the derivatives) is needed to determine the germ that a smooth function

f

belongs to at some point

x

. I suppose we'd like to find some information that determines

f

on some small enough neighborhood of

x

. We just saw that all the values of the derivatives of

f

x

aren't always going to be enough to do this! So we need some additional information.

But I'm unsure how we could go about discovering what that additional information is.

John Baez (May 02 2024 at 16:56):

John Baez (May 02 2024 at 16:59):

In a sense the difficulty of this question is why the concept of 'germ' is so useful: the germ of a function is the tautological answer to this question!

Peva Blanchard (May 02 2024 at 17:03):

I'm wondering how we could "measure" the "complexity" of the set of germs at

x

. For instance, the analytic germs at

x

seem to form a vector space of countable dimension (I think).

Kevin Carlson (May 02 2024 at 17:07):

It's not exactly of countable dimension, in the usual linear algebra sense, since the space of infinite sequences has uncountable dimension (you can't get any Taylor series with infinitely many nonzero coefficients as a linear combination of $$x^n$$s!) But it's "countable-dimensional" in the functional analysis sense, which is that there's a countable "basis" when you allow for convergent infinite sums from that basis, or similarly, the linear span of that countable "basis" is dense. So one way of taking David's interesting question is, whether we can find an explicit basis for the germs of smooth functions at a point. I don't know the answer but I have an intuition that we cannot!

Peva Blanchard (May 02 2024 at 17:25):

It is tricky because this requires (at least) a topology on the set

\Lambda(F)_x

of germs. There is the coarsest topology making the projection

p: \bigsqcup_{x\in X} \Lambda(F)_x \rightarrow X

continuous. But, this is not enough: the induced topology on the subset

\Lambda(F)_x

is trivial (I think).

Peva Blanchard (May 02 2024 at 17:32):

We would need to "topologize" the sheaf of continuous/smooth/analytic functions: each

F(U)

is a topological space (instead of just a set) for every open subset

U \subseteq X

Peva Blanchard (May 02 2024 at 17:32):

John Baez (May 03 2024 at 07:21):

If you want a vector space of germs that's of countably infinite dimension, the nicest choice is the sheaf of polynomial functions on the real line, or the complex plane...

... or any algebraic variety, which is roughly a space described by a bunch of polynomial equations, like the space of solutions of

x^2 = y^3 + y

. But people call polynomial functions on algebraic varieties regular functions.

Algebraic varieties are the traditional object of study of algebraic geometry, and the sheaf of regular functions on an algebraic variety became the star of algebraic geometry: people call it

\mathcal{O}

You can define algebraic varieties over various fields, but the most traditional case uses

\mathbb{C}

. For any 'smooth'

n

-dimensional complex algebraic variety, the germ of its sheaf of regular functions at any point is isomorphic to the germ of the sheaf of polynomial functions on

\mathbb{C}^n.

After algebraic varieties were quite well understood and Grothendieck started chafing at their limitations, he defined the concept of 'scheme', which is roughly a topological space equipped with a sheaf that acts like the sheaf of regular functions.

I'm not giving a precise definition here, but it's very notable that the concept of scheme explicitly involves the concept of sheaf! So modern algebraic geometry, which uses schemes, is heavily reliant on sheaves.

John Baez (May 03 2024 at 07:26):

If we start exploring sheaves that are like the sheaf of analytic functions, we are moving in a somewhat different direction. There's an important concept of complex manifold, which is a space covered by 'charts' that are copies of

\mathbb{C}^n

for some

n

, with transition functions that are analytic. Any such manifold has a sheaf of analytic functions on it... and the germ of this sheaf at any point is isomorphic to the germ of analytic functions at any point of

\mathbb{C}^n

John Baez (May 03 2024 at 07:33):

Just as we can have algebraic varieties that aren't smooth, like the space of solutions of

x^2 = y^3

(which has a sharp 'cusp' at the origin), we can also define complex analytic varieties, which generalize complex manifolds but don't need to be smooth. I know almost nothing about these, but they're again defined using sheaves.

John Baez (May 03 2024 at 08:53):

Right. But it's very peculiar. It's like starting your car so smoothly that at first you don't accelerate at all.

For the nth derivative to become bigger than zero, the (n+1)st derivative needs to be bigger than zero first... and here that happens for all n, yet all these derivatives start at zero.

John Baez (May 03 2024 at 08:59):

As you increase

x

from 0 to

+\infty

this function goes from 0 to 1. Its first derivative goes up to about 1/2 and then goes down. But before that, its second derivative goes up to 3... later it goes down. And before that, its third derivative goes up to about 20. And before that, its fourth derivative goes up to about 200. And so on.

John Baez (May 03 2024 at 09:01):

So while you may feel the function takes off very gently, because all its derivatives are zero at

x = 0

, in fact there's a huge flurry of activity going on for arbitrarily small

x

David Egolf (May 03 2024 at 17:23):

Looking at those examples of germs was interesting! And it's cool to learn that sheaves get used in all kinds of places. It's always a nice bonus when learning about one thing makes it a bit easier to learn some other things!

I want to return my attention to the next puzzle, which has to do with putting a topology on

\Lambda(F) = \coprod_{x \in X} \Lambda(F)_x

, the set of all our germs for our sheaf

F

of continuous real-valued functions on

X

. I'm still trying to understand the topology described in the blog post - but I'll write out my understanding so far.

We'd really like to have a "germ bundle"

p: \Lambda(F) \to X

that sends a particular germ

p

to the point

x \in X

that it is associated with. (Each germ in

\Lambda(F)

is associated with exactly one point

x \in X

, as it belongs to exactly one of

\Lambda(F)_x

x

ranges over

X

). If we could construct a bundle from a sheaf in this way, then we'd be able to think about sheaves from the perspective described here ("Sheaves in Geometry and Logic", page 64):

David Egolf (May 03 2024 at 17:27):

Now, for

p: \Lambda(F) \to X

to really be a bundle, it needs to be a continuous function. To talk about its continuity, we need to put a topology on

\Lambda(F)

. However, referencing pages 84-85 of "Sheaves in Geometry and Logic", this isn't the only function that we want to be continuous when we select an appropriate topology for

\Lambda(F)

We also have some other interesting functions which we'd like to be continuous, so that they can be sections of our bundle. Given some

s \in F(U)

, so in our case

s:U \to \mathbb{R}

\mathsf{Top}

, we define a function

g(s):U \to \Lambda(F)

("g" refers to "germ") defined by

g(s)(x) = [s]_x

, where by

[s]_x

I mean the germ of

s

at the point

x \in U

. Note that

p \circ g(s)(x) = p([s]_x) = x

, so that if

g(s):U \to \Lambda(F)

was continuous, it would provide a section of our bundle

p: \Lambda(F) \to X

. In this way, we are hoping to associate each element of a sheaf set

F(U)

(which is a set of continuous real-valued functions in the case of our

F

) to a corresponding section of our germ bundle

p: \Lambda(F) \to X

. To make this happen, we need to choose the topology on

\Lambda(F)

appropriately, so that

g(s):U \to \Lambda(F)

is continuous (for each

s \in F(U)

U

varies).

David Egolf (May 03 2024 at 17:32):

I'll stop here for today. Next time, I'm planning to look at the minimum of open sets we need to put in our topology for

\Lambda(F)

so that

p: \Lambda(F) \to X

becomes continuous. Then I think I want to check that a given

g(s):U \to \Lambda(F)

still has a hope of being continuous, even after we've declared those subsets of

\Lambda(F)

to be open.

John Baez (May 03 2024 at 20:57):

By the way, I'd like to know any sort of answer to this question. I was optimistic when I saw this question on Math Stack Exchange:

but the answers were completely useless (except for one answer who told the original questioner that he was asking about the germ of a smooth function: he hadn't known this concept had a name). I think there should be interesting things to say about this question even if a fully satisfying answer is not known.

David Egolf (May 03 2024 at 23:56):

One very rough idea that comes to mind: instead of taking the limit of something like

(f(x)-f(0))/x

x

approaches

0

, maybe we could consider "taking a limit" of the truth values of a bunch of propositions like "

f(x)>0

" as

x

approaches 0.

When

f(x) = e^{-1/x^2}

for

x>0

, we get a sequence of truth values that looks like: true, true, true... as we assess the truth value of "

f(x)>0

" as

x

approaches

0

. By contrast, if

f(x)=0

for all

x

, then the sequence of truth values we get from "

f(x)>0

" is false, false, false.. as we assess the truth value of "

f(x)>0

" as

x

approaches zero.

I'm not sure how useful this is... I was just trying to think of "measurements really close to 0" that determine that the zero function is different from our "slow takeoff" function which at

x=0

is zero and has all derivatives equal to zero.

David Egolf (May 04 2024 at 00:00):

I also wonder how one could formalize this "huge flurry of activity". Maybe that could be helpful for distinguishing these functions from one another using some kind of measurement involving a limiting process which approaches

x=0

Peva Blanchard (May 04 2024 at 00:04):

Let

C_{\infty}

be the sheaf of smooth functions on the unit interval

X = [0,1]

, and

F

be any sub-sheaf of

C_{\infty}

Given a smooth function

f

, I want to consider its derivatives all at once. So we can consider the

\mathbb{N}

-fold power of

F

, namely, the sheaf

\prod\limits_{n\in\mathbb{N}}F

. We have natural transformation

\eta : F \rightarrow \prod\limits_{n\in\mathbb{N}}F

whose component over an open subset

U\subseteq X

is given by

We have another linear function

\epsilon : \Lambda(F)_x\rightarrow \mathbb{R}

given by a evaluating a function at

x

. Then we have a linear function

\Lambda(F)_x \xrightarrow{\eta} \prod\limits_{n\in\mathbb{N}} \Lambda(F)_x \xrightarrow{\epsilon^{\mathbb{N}}} \mathbb{R}^{\mathbb{N}}

which maps the germ of a function

f

x

to the sequence

(f^{(n)}(x))_{n\in\mathbb{N}}

of values of its derivatives at

x

. Finally, we can consider the kernel

K(F)_x

of this linear function.

It seems that

\eta

is injective, while

\epsilon

is surjective (hence

\epsilon^{\mathbb{N}}

too).

When

F

is a sub-sheaf of the sheaf of analytic functions, the kernel is trivial,

K(F)_x = 0

. This is because the germ of an analytic function at

x

is entirely determined by the values of its derivatives at the point

x

Question: Is the converse true? I.e., if $K(F)_x = 0$ for all $x$ then is $F$ a sub-sheaf of the sheaf of analytic functions?

We can go further and try to describe the kernel

K(C_{\infty})_x

for the sheaf of smooth functions.

To build a germ in

K(C_{\infty})_x

, we must first choose a sequence

(U_n, f_n)_{n\in\mathbb{N}}

with

U_n

an open neighborhood of

x

, and

f_n

a smooth function such that

f_n(x) = 0

. This data already gives quite a lot of freedom. The tricky condition is to ensure that:

A strategy would be to start from something that does not care about this condition, and iterate so that in the limit the tricky condition holds. (I'm being hand-wavy here because I haven't figured it out yet)

John Baez (May 04 2024 at 06:49):

I had an idea that seems related to @Peva Blanchard's. There's a sheaf

C^\infty

of smooth real-valued functions on

\mathbb{R}

, and its germ at

0

is some vector space

\Lambda(C^\infty)_0

. We want to understand this space. Peva has described a map, I'll abbreviate it as

sending the germ of any smooth function

f: \mathbb{R} \to \mathbb{R}

to the list of derivatives

This is well-defined and this is the 'understandable aspect' of

\Lambda(C^\infty)_0

. So we really want to understand the kernel

\mathrm{ker}(\phi)

This raises the question: can we extract any real numbers from the germ of a smooth function at

0

in a linear way, other than by taking derivatives of that function at

0

Question. Can we explicitly describe any nonzero linear map

\ell : \mathrm{ker}(\phi) \to \mathbb{R}

Since we know

\ker(\phi)

is infinite-dimensional, there exist infinitely many linearly independent linear maps

\ell : \mathrm{ker}(\phi) \to \mathbb{R}

. But this does not imply that we can get our hands on any of them, because it's possible that my last sentence can only be proved using the axiom of choice (or some weaker nonconstructive principle)! There are some famous examples of this frustrating situation in analysis.

I've asked this question on MathOverflow and will see if it gets any useful answers.

John Baez (May 04 2024 at 06:53):

Theorem. The map

\phi: \Lambda(C^\infty)_0 \to \mathbb{R}^{\mathbb{N}}

sending the germ of any smooth function

f: \mathbb{R} \to \mathbb{R}

to its list of derivatives

(f(0), f'(0), f''(0), \dots )

is surjective.

John Baez (May 04 2024 at 06:55):

Puzzle. Find a smooth function

f: \mathbb{R} \to \mathbb{R}

whose nth derivative at

0

2^{n!}

has zero radius of convergence. But such a function does exist! As a clue, I'll say that to construct it, it helps to use the fact that there exists a smooth function that's zero for

x \ge 1

and

x \le 0

, and positive for

0 < x < 1

John Baez (May 04 2024 at 07:00):

By the way, I know I'm digressing from the main theme of this discussion, which is sheaves. But it's hard to resist, because I've spent a lot of time teaching analysis, and the difference between the sheaf of smooth functions and the sheaf of analytic function is pretty interesting, not only as example of how different sheaves work differently, but because mathematicians and physicists spend a lot of time working with smooth and analytic functions.

Morgan Rogers (he/him) (May 04 2024 at 10:25):

You could take any sequence tending to

0

and ask about the limit of a sequence derived from the values of the function of that point. For instance, you could ask about the limit of

f(1/n)\cdot n!

. The hard part is guaranteeing that such a functional will converge and isn't simply a function of the derivatives at

0

. Or you could ask about the relative measure of points at which the function is 0 on a sequence of intervals tending to 0. That is, take the limit as

a \to 0

\mu(\{-a < x < a \mid f(x) = 0\})/2a

. This is bounded, at least, but there's again no guarantee of convergence (at least a priori; maybe there's some slick analytic argument proving that this converges)

John Baez (May 04 2024 at 13:23):

This raises a good issue, namely, how fast can a smooth function f grow for small x if all its derivatives vanish at x=0. Your proposed quantity will be finite if for all such f there exists C with

This seems unlikely since all I know is that for all such f and all natural numbets k there exists C with

for all large enough n. This follows from the first k derivatives of f vanishing at x=0.

John Baez (May 04 2024 at 13:30):

Unfortunately there is no slowest growing function that grows faster than all polynomials! So we can probably show no candidate like what you suggested can work: it'll either be zero for all smooth f whose derivatives all vanish at 0, or infinite for some such f.

David Egolf (May 04 2024 at 15:55):

I'm not following in detail, but I just wanted to highlight a strategy that I noticed Peva Blanchard and John Baez use above. (Which I thought was really cool!) We're interested in information besides the derivatives of a function that can help us determine which germ at a point a smooth function belongs to. The strategy - to my understanding - goes like this:

Morgan Rogers (he/him) (May 04 2024 at 15:56):

Hmm so we need something that converges for all such

f

(so that

\ell

is well-defined) but isn't forced to be

0

. Well that's fun to think about. I'll let you get back to sheaves now :)

David Egolf (May 04 2024 at 16:15):

Now, we saw earlier that

f(x)

(which is

0

for

x \leq 0

and

e^{-1/x^2}

for

x > 0

) and the zero function

0

are not in the same germ. That means that

[f]

and

[0]

are different elements of

V

. But, they have the same derivatives, so that

\phi([f]) = \phi([0])

(

\phi

is defined in that question to be the function that takes the germ of a smooth function to the derivatives of that function). This means that

[f]-[0]

is in the kernel of

\phi

I'd like to define a non-zero linear real-valued map

\ell:\ker(\phi) \to \mathbb{R}

from the kernel of

\phi

. To define such a map, I think it suffices to specify the value of the map on each element of a set of basis vectors for

\ker(\phi)

. I am hoping that we could find just two linearly independent vectors in

\ker(\phi)

and say what

\ell

does to those, and then just let

\ell

send all vectors that aren't a linear combination of those two to zero.

I think we already have one nonzero vector

[f]-[0]

\ker(\phi)

. If we could just find another one, that is linearly independent from this one, maybe we could construct an

\ell

from that? So, I'm wondering if we can think of more examples of pairs of smooth real-valued functions (defined on some open set containing

0

) that have the same derivatives at

0

, but don't belong to the same germ at zero.

(I wonder if

f^2

also has all derivatives equal to zero at zero, and if it belongs to a different germ from

f

...)

Morgan Rogers (he/him) (May 04 2024 at 18:26):

You can multiply

f

by any function which is bounded at

0

to get another potentially linearly independent function, I think.

David Egolf (May 04 2024 at 19:12):

I suppose that defining some

\ell

in the way I sketched above wouldn't really help us that much. That's because such an

\ell

would assign a real number to each germ at zero, but it wouldn't directly provide this "measurement" for smooth functions. So, although such an

\ell

I think could tell certain germs apart (which can't be distinguished using derivatives), it seems like we'd need something more to determine which smooth functions with the same derivative values at a point don't belong to the same germ at that point.

John Baez (May 04 2024 at 21:00):

How does this work? Think about a simpler case: trying to define a linear function

\ell \colon \mathbb{R}^3 \to \mathbb{R}

that maps

(1,0,0)

1

(0,1,0)

2

, and all vectors that aren't a linear combination of those two to zero. What should

\ell(x,y,z)

be?

David Egolf (May 04 2024 at 22:49):

Hmm, well we want

\ell

to be

\mathbb{R}

-linear. So,

\ell(x,y,z) = \ell(x,0,0) + \ell(0,y,0) + \ell(0,0,z) = x \ell(1,0,0) + y \ell(0,1,0) + z \ell (0,0,1) = x + 2y + z \ell(0,0,1)

. Since

(0,0,1)

isn't a linear combination of

(1,0,0)

and

(0,1,0)

, we set

\ell(0,0,1)=0

. So we find

\ell(x,y,z) = x + 2y

John Baez (May 05 2024 at 06:36):

Okay, that's a linear function, but it's not doing what you said. You said all vectors that aren't a linear combination of the first two should be sent to zero. But

(1,1,1)

is not a linear combination of

(1,0,0)

and

(0,1,0)

, and

\ell(1,1,1)

is not zero.

John Baez (May 05 2024 at 06:37):

What you in fact did is choose one vector that's not a linear combination of the first two, and decree that

\ell

of it is zero. You chose the vector

(0,0,1)

. If you'd chosen the vector

(1,1,1)

, for example, and decreed that

\ell

of that is zero, you'd get a different linear map

\ell

John Baez (May 05 2024 at 06:42):

Returning to the actual problem, it's this arbitrary choice that makes defining a nonzero linear function from

\mathrm{ker}(\phi)

\mathbb{R}

so difficult! And

\mathrm{ker}(\phi)

is not just 3-dimensional, it's infinite-dimensional, so the choice requires a lot more thought - and it seems nobody knows how to do it, except by resorting to the axiom of choice.

John Baez (May 05 2024 at 06:45):

You have a vector space

V

and you're trying to define a nonzero linear map

\ell : V \to \mathbb{R}

. You know a couple of vectors

v_1, v_2 \in V

(you might know more) and you say you want

You can do this if

v_1

and

v_2

are linearly independent. By a general theorem, which relies on the axiom of choice, we know such an

\ell

exists. If

\mathrm{dim}(V) \gt 2

many such

\ell

exist. But getting your hands on one is an entirely different matter!

John Baez (May 05 2024 at 06:47):

You can get your hands on one if you can find a linear subspace

W \subset V

such that

2) Every vector in

V

is a linear combination of

v_1, v_2

and some vector in

W

John Baez (May 05 2024 at 06:50):

for all

w \in W

. At this point we've gotten our hands on

\ell

. But

\ell

depends on our choice of

W

How do we know there always exists

W

obeying conditions 1) and 2)? There's a theorem saying that there exists a basis of

V

that starts with

v_1

and

v_2

and continues with some other vectors

w_i

. Then we can define

W

to be the space of all linear combinations of these other vectors

w_i

However, to prove this theorem you need a version of the axiom of choice: in general there's no 'procedure' to choose these vectors

w_i

. You chose the vector

(0,0,1)

because you only needed one and it was staring you in the face. But in our actual example the dimension of

\mathrm{ker}(\phi)

is uncountably infinite and - to the best of my knowledge - nobody knows a basis for it. That's why my MathOverflow problem seems to be hard.

John Baez (May 05 2024 at 07:00):

There might still be some other way to define a nonzero linear map

\ell : \mathrm{ker}(\phi) \to \mathbb{R}

John Baez (May 05 2024 at 07:26):

This discussion may seem like a huge disgression from sheaves, and in a way it is. But the issue of how relying on the axiom of choice makes it difficult to get your hands on things you want is a big deal in analysis, so it's nice that we've bumped into an example. And a good way to do math without the axiom of choice is topos theory, which is what we're supposed to be learning here!

David Egolf (May 05 2024 at 15:39):

David Egolf (May 05 2024 at 15:55):

But, potentially, although there's no procedure in general for completing a basis for an arbitrary infinite dimensional vector space, there could maybe be such a procedure in this particular case (?). It just might be hard to find (if it exists) I guess!

David Egolf (May 05 2024 at 16:06):

On a related note, it's a weird feeling to know that many examples of a certain kind of thing exist, but at the same time we may not be able to name any examples :astonished:!

John Baez (May 05 2024 at 17:09):

This is actually quite common in mathematics, and there are even many situations where we know that the probability of a number having some property is 1 yet we don't know if most familiar numbers have that property (though surely they must).

John Baez (May 05 2024 at 17:14):

It's sort of like how knowing there are lots of ants doesn't mean you know any of their names: they are numerous yet anonymous.

John Baez (May 05 2024 at 21:07):

There are other ways to define linear maps than saying what they do on each member of a basis, and often they are easier to work with, e.g. taking the derivative at x is a linear map from germs of smooth functions at x to real numbers, and we don't need to pick a basis to define it! But I don't see how to use an approach like that for this problem, either. It may be lack of cleverness, or it may be a deep issue.

Peva Blanchard (May 05 2024 at 21:45):

There is another example that I find fascinating. The set of computable real numbers is countable. This implies that almost all real numbers are not computable: there is no (finitely described) algorithm to enumerate their digits. To put it in more colloquial terms: there is no "reasonable" way to poke inside them.

It does not mean we cannot define one though. For instance, Chaitin's constant is the probability that a random Turing machine will halt. This constant is well defined, and we know it is not computable.

John Baez (May 06 2024 at 07:23):

Another related example: we say a real number is normal in base 10 if in its decimal expansion every string of n digits appears with frequency

1/10^n

, which is what you'd expect of a 'random' number. More generally we can talk about normal numbers in any base. A number that's normal in every base is called uniformly normal.

The set of numbers that are not normal in base

b

has measure zero, and the countable union of sets of measure zero again has measure zero, so the set of numbers that are not uniformly normal has measure zero.
In simple rough terms: the probability that a number is uniformly normal is 1.

For this reason, and because people have actually done compute calculations to check, everyone believes

\pi, e, \sqrt{2}

, and other famous irrational numbers are uniformly normal. But nobody has been able to show this for any interesting examples.

John Baez (May 06 2024 at 07:24):

There is some slight hope that people can show

\pi

is normal in base 16 (and thus base 2), because there's a cool formula that makes it easy to compute individual base 16 digits of

\pi

without computing all the previous digits. But people haven't succeeded yet.

John Baez (May 06 2024 at 07:25):

While we're digressing, I got this interesting email about my 'germs of smooth functions' question:

John Baez (May 06 2024 at 07:28):

In case this is too hard to understand, one thing he's saying is that we can define 'bounded' sets in the vector space I called

\mathrm{ker}(\phi)

, so we can talk about linear functions

\ell: \mathrm{ker}(\phi) \to \mathbb{R}

that map bounded sets to bounded sets... but the only such

\ell

is zero.

I guess this means it'll be hard to find an explicit such

\ell

... though "hard to find" is a touchy-feely concept.

Kevin Carlson (May 06 2024 at 17:31):

Another closely related thing he said that might be unclear is that the dual of the whole space of germs is the space of “distributions generated by the Dirac delta and its derivatives”, which means that we weren’t missing any nice linear functions on the space of germs of smooth functions—they’re really only the differentiation operations.

Kevin Carlson (May 06 2024 at 17:31):

David Egolf (May 07 2024 at 16:29):

I feel like it's a good time to work a bit on the next puzzle, again. Recall that this involves thinking about the topology on

\Lambda(F) = \coprod_{x \in X}\Lambda(F)_x

, where

\Lambda(F)_x

is the set of germs at

x

for our sheaf

F

, which sends an open set

U

X

to the set of continuous real-valued functions

:U \to \mathbb{R}

There is a function

p:\Lambda(F) \to X

that sends each germ to the point it is associated with. We would like this function to be continuous. To ensure that, we need

p^{-1}(U) \subseteq \Lambda(F)

to be open in

\Lambda(F)

, for each open set

U \subseteq X

However, we're not done yet, as we want certain functions to

\Lambda(F)

to be continuous as well. Given some

s \in F(U)

(a continuous function from

U

\mathbb{R}

), we define the function

g(s): U \to\Lambda(F)

that acts by

g(s): x \mapsto [s]_x

, where

[s]_x

is the germ at

x

that

s

belongs to. We have that

p \circ g(s)(x) = p([s]_x) = x

, so that if

g(s):U \to \Lambda(F)

was continuous, it would be a section of our bundle

p:\Lambda(F) \to X

David Egolf (May 07 2024 at 16:41):

How can we ensure that

g(s): U \to \Lambda(F)

is continuous for all

s \in F(U)

and all open

U

? We need

g(s)^{-1}(V) \subseteq U

to be open for every open set

V \subseteq \Lambda(F)

. Earlier, we declared that certain subsets of

\Lambda(F)

need to be open: namely the preimage of the open sets in

X

under our projection mapping

p:\Lambda(F) \to X

. Given some open subset

U' \subseteq X

, that means that

p^{-1}(U') = \coprod_{x \in U'}\Lambda(F)_x

needs to be open.

So, let us consider an open set in

\Lambda(F)

of this form. That is , we let

V = \coprod_{x \in U'}\Lambda(F)_x

for some open subset

U'

X

. What is

g(s)^{-1}(V)

? These are the points in

U

that map to

V

under

g(s)

. Since

V

consists exactly of germs associated to points in

U'

, the part of

U

that maps to

V

under

g(s)

U \cap U'

. Therefore,

g(s)^{-1}(V) = g(s)^{-1}(p^{-1}(U')) = U \cap U'

. Since

U

and

U'

are both open in

X

U \cap U'

is open too. We conclude that declaring enough subsets of

\Lambda(F)

to be open so that

p:\Lambda(F) \to X

becomes continuous is compatible with the "germ assignment" functions

g(s):U \to \Lambda(F)

being continuous.

David Egolf (May 07 2024 at 16:48):

Now, we actually still aren't done, I believe. I think we want to declare as many subsets of

\Lambda(F)

to be open as possible, while preserving the continuity of

p:\Lambda(F) \to X

and

g(s):U \to \Lambda(F)

(for every

s \in F(U)

and for every

U

). (Although I'm not sure why we'd want to do this.)

Let's consider some particular

g(s): U \to \Lambda(F)

, which sends

x

[s]_x

. Without knowing anything extra about

U

, we know that in the subspace topology on

U

these two subsets are continuous: (1) the empty subset and (2) the subset that is all of

U

. Making use of the fact that

U

is open, if we declare

g(s)(U)

to be open in

\Lambda(F)

, then

g(s)^{-1}(g(s)(U)) = U

is open, and so the continuity of

g(s)

is not disrupted.

David Egolf (May 07 2024 at 16:54):

I'll stop here for now, but it still remains to show that declaring

g(s)(U)

to be open for each

s \in F(U)

(as

U

varies) preserves the continuity of every "germ assigning function"

g(s)

(I hope to finish up thinking about the topology on

\Lambda(F)

soon... I recognize it's probably not the easiest thing to have a conversation around!)

John Baez (May 07 2024 at 18:14):

John Baez (May 07 2024 at 19:27):

1) In topology I tend to think visually, so I find it hard to start solving this puzzle until I draw a picture of

\Lambda F

and the open neighborhoods described here. I'd probably try to take the sheaf of smooth real-valued functions on

\mathbb{R}

, and try to draw one of these open sets

U

. The picture might not be accurate, but it woulds somehow help me think about whether

p

is continuous.

John Baez (May 07 2024 at 19:37):

2) Here's an example of how it helps: in the process of thinking about this picture, I'm instantly led to remember that continuity can be studied locally. A function

f

is continuous iff it is continuous at each point

a

in its domain, and this in turn is true iff

f^{-1}

of every open set

U

contained in some neighborhood

V

f(a)

is open. We discussed the first fact earlier here somewhere, but I forget if we discussed the second fact. It comes to mind now because we're trying to show

p: \Lambda(F) \to X

is continuous and our picture of the open sets of

\Lambda(F)

is a local one.

David Egolf (May 08 2024 at 15:42):

Thanks for the suggestion! I felt like I was making progress with what I typed out above, but it wasn't feeling very intuitive. Drawing a picture sounds like it may help with gaining some intuition. So, I think I'll shift over to working on this, next. (I may go back to thinking about the continuity of the "germ assigning" functions

g(s)

later).

David Egolf (May 08 2024 at 15:47):

Alright, let me try to draw a picture of

\Lambda F

and its open neighborhoods. The elements of

\Lambda F

are germs of

F

at various points. And our open neighborhoods in the topology described in the puzzle are unions of the sets

g(s)(U)

s \in FU

varies over

FU

and as

U

varies over the open sets of

X

So, let's pick some particular

s \in FU

to be some

s:U \to \mathbb{R}

. If I let

X = \mathbb{R}

, then I can draw of picture of this

s

. That seems like a place to start.

David Egolf (May 08 2024 at 15:50):

So then, here's a picture of some

s:U \to \mathbb{R}

. The open set

U

is indicated in red.
picture of s

David Egolf (May 08 2024 at 15:57):

Now, let's consider

g(s)(U)

. For each

x \in U

g(s)(x) = [s]_x

, the germ of

s

x

. So,

g(s)(U)

is the set of germs of

s

. Each germ at

x

I think is roughly like a "local shape" that functions can have at

x

. In general, a germ of a continuous function contains more information than just its derivatives at that point. But to get a picture, I'll pretend that the germ of

s

x

is determined just by the slope of

s

x

. (I'm assuming that this particular

s

is differentiable, too).

I'll organize my drawing of

g(s)(U)

, which is to be an open set of

\Lambda F

, by thinking of

\Lambda F

as having a collection of "local shapes" (germs) for each point which "hover over" each point

x \in X

David Egolf (May 08 2024 at 16:05):

Here's my attempted visualization of

g(s)(U)

, which picks out the "local shape" (germ) of

s

at each point in

U

:
visualizing germs of s

This whole 2D region is part of

\Lambda(F)

. So, for each point

x \in X

we have a collection of shapes hovering above (and below) the

x

-axis corresponding to different germs at that point. The blue point at some

x

g(s)(x)

, which in this simplified drawing is supposed to (partially) describe the local shape of

s

x

using its first derivative there. Notice that the first derivative really doesn't provide enough information to reconstruct our function about some point (in particular it forgets "vertical shifting"), but this is at least some of the information that describes our

s

about each point.

This is not what I expected an open set of

\Lambda(F)

to look like! My picture might be too inaccurate and approximate for it to give good intuition, but maybe not! It seems interesting.

David Egolf (May 08 2024 at 16:11):

Now, let's consider our

p: \Lambda(F) \to X

in this example (which is the function we wish to show is continuous). We need

p^{-1}(U)

to be open in

\Lambda(F)

for each open subset

U

X

. Let's take

U

to the open subset of

X = \mathbb{R}

indicated in red above. Then a point in

\Lambda(F)

maps to some

x \in U

if it is a germ associated to

x \in U

. In our picture, this will correspond to the points hovering above (and below)

U

David Egolf (May 08 2024 at 16:15):

Here's a picture of

p^{-1}(U)

, which is a subset of

\Lambda(F)

:
preimage of an open set

U

is indicated by the red line segments,and

p^{-1}(U)

is indicated by the shaded light red regions. Intuitively, this preimage is the disjoint union of all the "local behaviours/shapes" possible for continuous real-valued functions (as provided by

F

) at each point.

\Lambda(F)

is to be continuous, this

p^{-1}(U)

needs to be open. With our proposed topology, that means it needs to be the union of the image of some "germ assigning" functions (one of which we visualized in blue in a drawing above). I guess that means that for any germ

\lambda

at some

x \in U

, there needs to be at least one function which belongs to that germ, so that its behaviour about

x

is described by

\lambda

. If that's right, the continuity of

p: \Lambda(F) \to X

might correspond roughly to the idea that "every possible local behaviour at

x

occurs for at least one element

s

of a sheaf set

F(U)

, for some

U

containing

x

David Egolf (May 08 2024 at 16:21):

I don't think we directly discussed the second fact, although I may just be forgetting. Next time, I'll plan to start by proving that fact! Then I'll try to connect that fact to the pictures I've drawn above.

Although, thinking it over a bit, I think I might have an idea of how to solve the puzzle already... I guess I'll see what I feel like trying out tomorrow!

John Baez (May 08 2024 at 19:38):

It seems like a pretty good picture to me. It's really important to realize that for many familiar sheaves

F

\mathbb{R}

, like the sheaf of smooth functions, the corresponding space of germs

\Lambda(F)

is not very easy to draw or visualize. And I think the best way to realize this is to try to draw it. You've drawn a kind of 'approximation' to it - and by thinking about the information your drawing leaves out, you're starting to get a sense for how peculiar this space is!

One thing that's strange about this space is that

\Lambda(F)

is not 'Hausdorff'. This means you can find two different germs

g, g' \in \Lambda(F)

that can't be separated by open sets: i.e., you can't find disjoint open sets

U, U' \in \Lambda(F)

with

g \in U, g' \in U'

. That germ at zero of the weird function @Peva Blanchard described cannot be separated by open sets from the germ at zero of the constant function 0. That's because these functions are equal at all points slightly left of zero. (For a proof of the similar fact about continuous functions see this).

Well, I'm probably getting ahead of myself here, so I should stop. But my main point is, you're doing a fine job of attempting to draw a space that's impossible to draw in a fully accurate way... and I've found such attempts very useful!

Peva Blanchard (May 08 2024 at 21:57):

This is really nice. Thanks to your detailed exposition David, I corrected a very wrong picture I had.

Indeed, I thought that the topology on

\Lambda(F)

was the coarsest topology making the projection

p : \Lambda(F) \rightarrow X

continuous. This means that any open set in

\Lambda(F)

is a union of sets of the form

p^{-1}(U)

for every open subset

U

X

But this topology is not enough (too coarse). Because we also want to think about a section

s \in F(U)

p

as a continuous function

g(s) : U \rightarrow \Lambda(F)

Peva Blanchard (May 08 2024 at 21:57):

Now, maybe I can share the mental picture I have now about the required topology on

\Lambda(F)

. (Hopefully, it is correct). I find it easier to deal with neighborhoods instead of open sets. Given a germ

a \in \Lambda(F)_x

x

, what does it mean for another germ

b \in \Lambda(F)_y

at a different base point

y

to be "in the neighborhood of

a

"? We can answer that question by providing a witness of the fact that they are close to each other. Such a witness is a pair

(U, s)

where

U

is an open set containing both

x

and

y

, and

s \in F(U)

is a section such that

I picture this witness as providing a connecting path between

a

and

b

. With this picture, we see that for every

z \in U

, the germ

[s]_z

s

z

is in the neighborhood of

a

. In other words, a pair

(U, s)

with

s \in F(U)

and

U \ni x = p(a)

encodes a specific neighborhood of

a

Peva Blanchard (May 08 2024 at 22:29):

To continue with this picture, we can interpret the separation of two points

a

and

b

\Lambda(F)

(the "Hausdorff" property as explained by @John Baez )

These points are separated if we can find two disjoint neighborhoods of

a

and

b

respectively. Informally, this means that we have a neighborhood

(U, s)

a

, and a neighborhood

(V, t)

b

, and such that

s

and

t

never "agree" over

U \cap V

For instance, let's consider the germ

a

of the funny function

e^{-\frac{1}{x^2}}

0

, and the germ

b

of the constant zero function at

0

. In that case,

Peva Blanchard (May 08 2024 at 22:32):

By the way, does it mean that the topology of

\Lambda(F)

is Hausdorff when

F

is the sheaf of analytic functions? (I'll think about it, no need to answer right away)

David Egolf (May 09 2024 at 17:05):

Thanks to both of you for your interesting comments! Thinking about whether

\Lambda(F)

is Hausdorff is interesting. (Side note: somehow the open sets on

\Lambda(F)

remind me of the closed sets in the Zariski topology...which I seem to recall is not (usually?) Hausdorff either.)

I'm going to take a break from this thread today, to rest up, but I hope to get back to it tomorrow!

Peva Blanchard (May 10 2024 at 09:09):

I think the answer is yes. I wasn't sure if it would digress too much, so I opened another topic.

Peva Blanchard (May 10 2024 at 09:48):

By the way, something clicked for me about "evaluating a function at some point".

Because of my

Set

-based math education, I am used to thinking about a function

f : X \rightarrow Y

as being a graph, i.e., the set of pairs

(x, y) \in X \times Y

with

y = f(x)

. In that case, the evaluation of

f

x

is just picking out the second component of this pair, namely, the value

f(x)

But, with our previous discussion, it turns out this evaluation procedure is actually very narrow. When we deal with continuous map

f

, another evaluation procedure is given by "taking the germ

[f]_x

f

x

". The mental picture I have in mind, is a sequence of open neighborhoods

U_n

x

that converges towards

x

, and over which we take the restrictions of

f

. This is like distilling to get the most concentrated information about

f

x

This reminds me of the way we define a distribution as a the dual of a space

\mathcal{S}

of test functions. Formally, a distribution

T

is a linear map

\mathcal{S} \rightarrow \mathbb{R}

. I picture a test function

\phi

as some kind of smooth bump around a point somewhere, so that the value

T(\phi)

sums up the "behavior of

T

around that point". In a way, test functions play the same role as the open subsets of

X

in the previous paragraph, the map

\phi \mapsto T(\phi)

is analog to the restriction map

U \mapsto f_{|U}

. We can evaluate a distribution at a point

x

by considering a sequence

\phi_n

of test functions that "converge towards

x

", and taking the limit of the

T(\phi_n)

's.

David Egolf (May 10 2024 at 16:29):

Wow, there is a lot interesting stuff to catch up on here :sweat_smile:! Today, I'll try to understand what you both are saying regarding the fact that

\Lambda(F)

is not Hausdorff, for

F

our sheaf of continuous real-valued functions on open subsets of

\mathbb{R}

To show that

\Lambda(F)

is not Hausdorff, we need to find two points (germs)

f,f' \in \Lambda(F)

so that there are no open sets

U

and

U'

with

f \in U

f' \in U'

and

U \cap U' = \emptyset

. In other words, for any open set

U

containing

f

and any open set

U'

containing

f'

U \cap U'

is always non-empty.

John Baez (May 10 2024 at 16:33):

Right! And you can find an example of this! It's a lot easier to find one for the sheaf of continuous functions than with smooth functions, where a sneaky example Peva described comes to our aid.

David Egolf (May 10 2024 at 17:00):

This is feeling tricky for me today. But, referencing this page, I think we want to consider this situation:

We have this situation with

U = \mathbb{R}

x=0

s

the zero function, and

s'

the function that is zero for

x <0

and

e^{-1/x^2}

for

x \geq 0

s

and

s'

have different germs at zero, but any open set

V

that contains

0

also contains some negative numbers with some "breathing space" around them. So, we can pick some negative number

x'

V

: then

s

and

s'

must have the same germ at

x'

. (That is because they both restrict to the zero function for sufficiently small open intervals about a negative number

x'

David Egolf (May 10 2024 at 17:09):

So, we can form two sequences of open sets in

\Lambda(F)

, by taking

g(s)(V_i)

and

g(s')(V_i)

V_i

becomes a smaller and smaller open neighborhood containing

0

. We can form a sequence

x'_i

such that

x'_i

is some negative number present in both

V_i

and

V_i'

. Applying

g(s)

and

g(s')

to this sequence gives us two sequences

[s]_{x'_i}

and

[s']_{x'_i}

. But these two sequences are actually equal, because

s

and

s'

have the same germs at any negative point

x'_i

Now, intuitively, the sequence

[s]_{x'_i}

should converge to

[s]_x

and the sequence

[s']_{x'_i}

should converge to

[s']_x

. We just noted that both of these sequences are equal... but our proposed limits of them are different (as

[s]_x \neq [s']_x

)! So it seems like we might have a situation where limits aren't unique, which I think would relate to

\Lambda(F)

being non-Hausdorff, referencing this page.

There's probably a simpler way to do this, explained here. I'll look at that next.

David Egolf (May 10 2024 at 17:37):

We will aim to show that

[s]_x

and

[s']_x

are points of

\Lambda(F)

that can't be separated by open sets. That is, there are no open subsets

\lambda_s

and

\lambda_{s'}

\Lambda(F)

with

[s]_x \in \lambda_s

and

[s']_x \in \lambda_{s'}

with

\lambda_s \cap \lambda_{s'}= \emptyset

To obtain a contradiction, let us assume that

\Lambda(F)

is Hausdorff, so that there are such disjoint open sets

\lambda_s

and

\lambda_{s'}

. By definition of the topology on

\Lambda(F)

\lambda_s = \cup_i g(s_i)(U_i)

for some

s_i

and

U_i

and similarly

\lambda_{s'} = \cup_j g(s'_j)(U_j')

for some

s_i'

and

U_i'

Since

[s]_x \in \lambda_s

[s]_x \in g(s_i)(U_i)

for some particular

i

, where

x \in U_i

. Since germs can only be equal if they are associated to the same point, this implies that

[s]_x = [s_i]_x

. Similarly,

[s']_x \in g(s'_j)(U'_j)

for some particular

j

, where

x \in U_j

, so that

[s']_x = [s'_j]_x

Since

\lambda_s

and

\lambda_{s'}

are assumed disjoint, that means that

g(s_i)(U_i)

and

g(s'_j)(U'_j)

are also disjoint. Thus, for any

x' \in U_i \cap U'_j

[s_i]_{x'} \neq [s'_j]_{x'}

David Egolf (May 10 2024 at 17:43):

Since

s_i

and

s

have the same germ at

x

, there is some open subset containing

x

where these two functions restrict to the same function. Similarly, there is some open subset containing

x

where

s_j'

and

s'

restrict to the same function. Taking the intersection of these two open sets, we get an open set

W

containing

x

where for each point

x' \in W

we have

[s_i]_{x'} = [s]_{x'}

and

[s_j']_{x'}=[s']_{x'}

. Now, we know that

[s]_{x'} \neq [s_j']_{x'}

for any

x' \in W

. Therefore,

[s]_{x'} \neq [s']_{x'}

for any

x' \in W

However, we know by assumption that there is always some point in any open subset containing

x

where

s

and

s'

have the same germ. Thus, we have obtained a contradiction. We got this contradiction by assuming that

\Lambda(F)

was Hausdorff. We conclude that

\Lambda(F)

must not be Hausdorff!

Whew! Hopefully I did that correctly. There is still a lot more of catching up for me to do in this thread, but I'll stop here for today.

Peva Blanchard (May 11 2024 at 22:17):

I think the proof is correct. The proposition can be used to present concrete examples of continuous functions

s, s'

, simpler than

0

and

e^{-\frac{1}{x^2}}

, that cannot be separated by open sets in

\Lambda(F)

(the sheaf of continuous functions).

By the way, @John Baez gave a very neat puzzle on the other thread. (spoiler alert, I gave a proof there).

David Egolf (May 13 2024 at 17:44):

The next thing I'm hoping to do in this thread is to prove this:
John Baez said:

Unfortunately, I don't have the energy in the tank to work on this today. Once I have energy, I hope to return to this thread and work on what I just described.

David Egolf (May 14 2024 at 16:48):

Before I start on this topology exercise, I wanted to mention that I rather like @Peva Blanchard's mental picture regarding the topology on

\Lambda(F)

. I think the basic idea is this: two germs

a,b

are in the same open set

g(s)(U)

if they are germs of the function

s \in F(U)

for some points

x,x' \in U

. So, in a way, this particular continuous function

s:U \to \mathbb{R} \in F(U)

provides a "bridge" that lets us "connect" two germs, in the sense that its set of germs is an open set containing both

a

and

b

David Egolf (May 14 2024 at 16:57):

This gets me wondering if we can define a category

C

using this intuition. Let the objects of

C

be the germs of

F

, the elements of

\Lambda(F)

. And let us put a morphism

s:U \to \mathbb{R} \in F(U)

from

a

b

a

and

b

are both germs of

s

at some points in

U

. We'll also want to put a morphism

s:U \to \mathbb{R}

from

b

a

in this case, because the condition we are checking is symmetric in

a

and

b

To make a category from this, we'd need to define composition. I'm not immediately sure if there's a nice way to do this... and I don't want to get too sidetracked, so I'll stop here.

David Egolf (May 14 2024 at 17:00):

We've already seen above that a "function

f

is continuous iff it is continuous at each point

a

in its domain". We want to show that these conditions are equivalent to the condition that

f^{-1}

of every open set

U

contained in some neighborhood

V

f(a)

is open.

These kinds of statements still intimidate me a bit, so I'll try to draw a picture to illustrate what we're trying to prove.

David Egolf (May 14 2024 at 17:13):

Here,

V

is an open set containing

f(a)

, and

U

is an open set with

U \subseteq V

. I could have alternatively drawn

U

so that it includes

f(a)

, but since that isn't required I chose not to.

EDIT: I need to update the picture... the result to be shown is slightly different than what I listed above.

John Baez (May 14 2024 at 17:14):

John Baez (May 14 2024 at 17:16):

So, the result to be shown is "f is continuous at

a

if for some neighborhood

V

f(a)

, the inverse image of every open subset

U \subseteq V

containing

f(a)

is an open set containing

a

John Baez (May 14 2024 at 17:18):

Compare this to the definition of "continuous at

a

f

is continuous at

a

iff the inverse image of every open set

U

containing

f(a)

contains an open set containing of

a

John Baez (May 14 2024 at 17:19):

So the difference is saying it's enough to look at open sets

U

containing

a

that "aren't too big".

John Baez (May 14 2024 at 17:21):

Intuitively this makes sense, since we're talking continuity "at

a

". This should only depend on what's going on near

a

, and near

f(a)

David Egolf (May 14 2024 at 17:35):

Oh, I like that! That does help make it more intuitive. I'll draw a new picture now, and I'm hopeful that this intuition will be reflected in that picture as well.

David Egolf (May 14 2024 at 17:42):

Here's a picture illustrating the condition "for some neighborhood

V

f(a)

, the inverse image of every open subset

U \subseteq V

containing

f(a)

is an open set containing

a

":
picture

David Egolf (May 14 2024 at 17:51):

We'd like to show that if

f

satisfies this condition in the picture, then

f

is continuous at

a

. That is, we'd like to show there is some open set

N

containing

a

such that

f|_N

is continuous. My first guess was to try and set

N= f^{-1}(V)

. The problem with this is that

f^{-1}(V)

isn't necessarily open.

David Egolf (May 14 2024 at 17:51):

Oh, wait, yes

f^{-1}(V)

does have to be open! That's because

V \subseteq V

is an open set containing

f(a)

, and so

f^{-1}(V)

is an open set containing

a

David Egolf (May 14 2024 at 17:55):

Alright, so let's set

N = f^{-1}(V)

and try to show that

f|_N

is continuous. We have that

f|_N:N \to V

, where

N

and

V

both have the subspace topology. Let's consider some open subset

U

V

. We'd like to show that

f_N^{-1}(U)

is open. Now, if

f(a) \in U

, we know that

f^{-1}(U)

is open and hence

f^{-1}(U) \cap N = f^{-1}(U)

is open in

N

It remains to consider the case where

U \subseteq V

is an open subset of

V

that doesn't contain

f(a)

. I don't immediately see how to show that

f|_{N}^{-1}(U)

is still an open subset of

N

David Egolf (May 14 2024 at 18:03):

Well, I think I'm stuck here for the moment, but at least some progress was made. I'll stop here for today!

John Baez (May 14 2024 at 18:12):

You mean "doesn't contain

f(a)

", not "doesn't contain

a

". But more importantly....

John Baez (May 14 2024 at 18:13):

I don't think this case matters. Only stuff around

a

can possibly matter. Today I accidentally wrote down a bogus definition of "continuous at

a

", but then I fixed it. Here's the fixed version:

John Baez (May 14 2024 at 18:14):

John Baez (May 14 2024 at 18:19):

So note, we're not demanding that

f^{-1}(U)

is open, which would be too much since parts of

U

might be very far from

a

. We're just demanding that

f^{-1}(U)

contain an open neighborhood of

a

David Egolf (May 14 2024 at 20:38):

I was working from this definition of "continuous at

a

f:X \to Y

is continuous at

a

exactly if there is some open set

N \subseteq X

containing

a

such that

f|_N:N \to Y

is continuous.

Maybe next time I'll try to show that the definition I was using is equivalent to the definition which you provided:
John Baez said:

John Baez (May 14 2024 at 20:48):

John Baez (May 14 2024 at 20:50):

It should be equivalent to the one I gave, but I don't mean to be overwhelming you with the task of showing lots of definitions are equivalent!

Graham Manuell (May 15 2024 at 03:54):

I don't think these two conditions are equivalent. David's is stronger. It means

f

is continuous in a neighbourhood of

a

David Egolf (May 15 2024 at 04:36):

I wondered if Schechter's "Handbook of Analysis and Its Foundations" talked about this. On page 417, it defines a function

f:X \to Y

to be continuous at the point

x_0

if this condition is satisfied: the inverse image of each neighborhood of

f(x_0)

is a neighborhood of

x_0

. This reminds me of the definition that @John Baez gave above. It should be noted that Schechter uses the term "neighborhood" in a way he defines on page 110:

S

is a neighborhood of a point

z

z \in G \subseteq S

for some open set

G

Schechter also touches on a condition similar to one I described above, saying on page 418 that a mapping

f:X \to Y

is continuous iff

f

is "locally continuous" in the sense that each point in

X

has a neighborhood

N

such that

f|_N:N \to Y

is continuous. He doesn't use the phrase "continuous at a point" in this context.

David Egolf (May 15 2024 at 04:41):

Schechter also says (on page 417) that the following two conditions are equivalent for a function

f:X \to Y

between two topological spaces:

David Egolf (May 15 2024 at 04:42):

Although this is all somewhat tangential to sheaves, I am pleased that - I think - I am slowly starting to get some of this topology stuff straight! :sweat_smile:

At this point, I might just assume that everything Schechter says here is true, to better focus on the main topic of this thread. Namely, by assuming the things I just listed above are true, I'd like to see if I can then prove that

p:\Lambda(F) \to X

is continuous.

John Baez (May 15 2024 at 07:35):

Sure! Schechter is organizing these things better than I am, by the way. I hadn't realize how many subtly different ways there are to say "continuous at a point", all of which are equivalent. Apparently I just make one up each time I need this concept.

David Egolf (May 15 2024 at 16:24):

Alright, let's again consider our projection function

p: \Lambda(F) \to X

which sends each germ to the point it is associated with. To show

p

is continuous, we have a few different equivalent conditions available to us now. If we can prove any of these conditions are true for

p

p

is continuous:

Schechter also provides several more equivalent conditions for the continuity of a function, but hopefully one of the conditions I've listed will be helpful for solving this puzzle.

David Egolf (May 15 2024 at 16:34):

I'm going to try using condition (2), because it's least familiar to me and I'm curious about it. :laughing:
So, let's consider some point

[s]_x \in \Lambda(F)

. This is a germ associated to the point

x \in X

, consisting of an equivalence class of real-valued continuous functions which are each defined on some open set containing

x

.(Recall that two such functions are equivalent exactly if they agree on some open set containing

x

). In particular, we're considering the equivalence class of some continuous function

s:U \to \mathbb{R}

with

x \in U

Now, let us introduce a neighbourhood

N

p([s]_x) = x

. This is a subset of

X

containing an open set

N'

so that

x \in N'

. We wish to show that

p^{-1}(N)

is a neighborhood of

[s]_x

David Egolf (May 15 2024 at 16:38):

By definition,

p^{-1}(N) = \{\lambda \in \Lambda(F) | p(\lambda) \in N\}

. That is, this preimage consists exactly of all the germs associated to points in

N

. Since

p([s]_x) =x \in N

, we do have

x \in p^{-1}(N)

. It remains to show that we can find some open subset of

p^{-1}(N)

which contains

[s]_x

David Egolf (May 15 2024 at 16:42):

We've already got

[s]_x \in p^{-1}(N)

, and we're looking to build up an open set in

\Lambda(F)

about

[s]_x

consisting only of germs associated to points in

N

. To build this open set, we need to find some points in

\Lambda(F)

that are "near" to

[s]_x

. By definition of the topology of

\Lambda(F)

, we know that

g(s)(U)

is an open subset of

\Lambda(F)

. I think we can use this to get some germs "nearby"

[s]_x

that also sit in

p^{-1}(N)

David Egolf (May 15 2024 at 16:47):

To do this, let's restrict

s:U \to \mathbb{R}

(with

x \in U

). We know that

N' \subseteq N

is open and contains

x

. Hence

U \cap N' \subseteq U

is an open set containing

x

. Then,

s|_{N' \cap U}: {N' \cap U} \to \mathbb{R}

and

g(s|_{N' \cap U})(N' \cap U)

is an open set of

\Lambda(F)

. Since

N' \cap U

is a subset of

N' \subseteq N

g(s)({N' \cap U})

is an open subset of

p^{-1}(N)

containing

[s]_x

I think we have found an open set containing

[s]_x

that is a subset of

p^{-1}(N)

! That is, I think we've shown that

p^{-1}(N)

is a neighbourhood of

[s]_x

N

is a neighbourhood of

x = p([s]_x)

. Thus,

p

is continuous at any

[s]_x \in \Lambda(F)

, and hence it is continuous!

John Baez (May 15 2024 at 22:14):

I'm a bit confused because I thought you were solving this puzzle, which is not about

\mathbb{R}

-valued functions, but rather an arbitrary sheaf

F

on a topological space

X

John Baez (May 15 2024 at 22:15):

Are you doing the special case where

F

is the sheaf of continuous

\mathbb{R}

-valued functions on a topological space

X

David Egolf (May 15 2024 at 22:35):

Yes, I was doing that special case. But I think I'll plan to next give this a try for an arbitrary sheaf

F

on a topological space

X

! I am hoping that the pattern of the argument will be similar.

John Baez (May 16 2024 at 05:31):

I think it should be almost identical! Working with a sheaf of continuous functions makes things easier to visualize, so it's a good test case.

David Egolf (May 16 2024 at 16:59):

Let

F

be a sheaf on a topological space

X

. Then we wish to show that our map

p: \Lambda(F) \to X

is continuous. (Recall that

p

sends each germ in

\Lambda(F)_x

x

). Following the argument above - which was carried out for a special case - let's consider some point

[s]_x \in \Lambda(F)

, which is a germ associated to the point

x \in X

. This is the germ in

\Lambda(F)_x

that some sheaf element

s \in F(U)

belongs to, where

x \in U

Now, let us introduce a neighbourhood

N

p([s]_x)=x

. This is a subset of

X

containing an open set

N'

so that

x \in N'

. We wish to show that

p^{-1}(N)

is a neighbourhood of

[s]_x

David Egolf (May 16 2024 at 17:04):

By definition

p^{-1}(N)

consists exactly of all the germs associated to points in

N

. Since

p([s]_x)=x \in N

, we do have

x \in p^{-1}(N)

. It remains to show that we can find some open (in

\Lambda(F)

) subset of

p^{-1}(N)

which contains

[s]_x

We've already got

[s]_x \in p^{-1}(N)

, and we're looking to build up an open set in

\Lambda(F)

about

[s]_x

consisting only of germs associated to points in

N

. To build this open set, we need to find some points in

\Lambda(F)

that are "near" to

[s]_x

David Egolf (May 16 2024 at 17:08):

For our

s \in F(U)

, let

g(s)(U)

denote the set of all germs of

s

over the various points of

U

. By definition of the topology of

\Lambda(F)

, this is an open set. And we know that

g(s)(U)

contains

[s]_x

, as

x \in U

Now, from this, we wish to construct an open set of

\Lambda(F)

that contains

x

and is a subset of

p^{-1}(N)

David Egolf (May 16 2024 at 17:13):

To do this, we will "restrict"

s

to the open set

U \cap N' \subseteq N

which contains

x

. Since we are working in the general case, this restriction is more abstract than just restricting the domain of a function. However, since

F

is a presheaf, we have a restriction function

r|_{U \to (U \cap N')}: F(U) \to F(U \cap N')

available to us, so our restriction of

s \in F(U)

is simply

r|_{U \to (U \cap N')}(s)

. By definition of the topology of

\Lambda(F)

g(r|_{U \to (U \cap N')}(s))(U \cap N')

is an open set of

\Lambda(F)

David Egolf (May 16 2024 at 17:20):

It remains to show that

g(r|_{U \to (U \cap N')}(s))(U \cap N')

(1) is a subset of

p^{-1}(N)

and (2) contains

[s]_x

. To show (1), note that this set consists only of germs associated to

r|_{U \to (U \cap N')}(s)

over the points of

U \cap N'

. Since

U \cap N' \subseteq N

, all of these germs belong to points in

N

, and so

g(r|_{U \to (U \cap N')}(s))(U \cap N')

is a subset of

p^{-1}(N)

To show (2), we note that restricting a presheaf element does not change its germ at a point. This is because each germ set

\Lambda(F)_x

is the tip of a cocone for the diagram consisting of the various

F(V)

with

V

varying over the open sets of

X

containing

x

, together with the restriction functions between them. So,

[s \in F(U)]_x =[r|_{U \to (U \cap N')}(s) \in F(U \cap N')]_x

. Hence

[s]_x \in g(r|_{U \to (U \cap N')}(s))(U \cap N')

David Egolf (May 16 2024 at 17:25):

We conclude that

g(r|_{U \to (U \cap N')}(s))(U \cap N')

is an open set of

\Lambda(F)

containing

x

, and that it is also a subset of

p^{-1}(N)

. Thus, if

N

is a neighbourhood of

p([s]_x)=x

, then

p^{-1}(N)

is a neighborhood of

x

So,

p

is continuous at any point

[s]_x \in \Lambda(F)

. Hence,

p:\Lambda(F) \to X

is continuous!

David Egolf (May 16 2024 at 17:31):

I don't think I used the fact that

F

was a sheaf anywhere, although I did make use of the fact that

F

was a presheaf. So, I think the same result should hold for

F

an arbitrary presheaf over some topological space

X

David Egolf (May 16 2024 at 17:34):

Assuming the above is correct, we have now shown that we can make a bundle

p:\Lambda(F) \to X

from a presheaf

F

X

! The next puzzle asks us to upgrade this process to get a functor

\Lambda:\widehat{\mathcal{O}(X)} \to \mathsf{Top}/X

. Here

\widehat{\mathcal{O}(X)}

is the category of presheaves on

X

and

\mathsf{Top}/X

is the category of bundles over

X

To define our functor

\Lambda

, as a first step we'll need to show how to get a morphism of a bundles from a morphism of presheaves. (I'll stop here for today!)

John Baez (May 16 2024 at 17:47):

Great! All this looks good, and I especially like how you "psychoanalyzed" your proof and noticed that it works for presheaves. I probably should have posed the puzzle for presheaves.

While I forget exactly what I did in the course notes, I imagine soon we'll do something like this:

1) get a functor from presheaves on

X

to bundles over

X

, sending each presheaf to its bundle of germs
2) get a functor from bundles on

X

to sheaves over

X

, sending each bundle to its sheaf of sections
3) compose these functors to get a functor from presheaves to sheaves, called sheafification
4) show that sheafification is left adjoint to the obvious forgetful functor from sheaves on

X

to presheaves on

X

Peva Blanchard (May 17 2024 at 05:13):

What a nice thread. I really like how the calm pace of this discussion leads to upbeat non-trivial concepts like sheafification.

David Egolf (May 17 2024 at 17:32):

"Sheafification" is such a fun word... it reminds me of another fun word I learned recently: "rectangulation"! Peeking ahead in the blog post, I suppose we're probably going to have an "ètalification" functor too, given by composing our functors in the opposite order (so we get a functor that converts each bundle to an étale bundle).

David Egolf (May 17 2024 at 17:39):

Anyways, to get there, we first need to show we really do have a functor

G: \widehat{\mathcal{O}(X)} \to \mathsf{Top}/X

which converts presheaves and presheaf morphisms to bundles and bundle morphisms. (This functor is called

\Lambda

in the blog post, but I'll call it

G

for the moment, to avoid confusion due to the fact that

\Lambda(F)

means the space of all the germs of

F

We just saw that we can get a bundle

G(F)

from a presheaf

F

on a topological space

X

by forming the bundle of germs

G(F):\Lambda(F) \to X

, which sends each germ to the point it belongs to. Now, let's assume we have a morphism of a presheafs on

X

, namely

\alpha: F \to F'

. We wish to construct a morphism of bundles from

G(F):\Lambda(F) \to X

G(F'):\Lambda(F') \to X

. That means we're looking for a continuous map

G(\alpha): \Lambda(F) \to \Lambda(F')

so that

G(F') \circ G(\alpha) = G(F)

. Strictly speaking,

G(\alpha)

has source of

G(F)

and target of

G(F')

, but I'll use the same symbol to refer to its underlying continuous map from

\Lambda(F)

\Lambda(F')

. Hopefully this won't be too confusing!

David Egolf (May 17 2024 at 17:52):

David Egolf (May 17 2024 at 17:55):

For this diagram to commute, we must have that

G(\alpha)

maps germs of

F

associated to

x \in X

to germs of

F'

associated to

x

. So, we can consider the function

G(\alpha)

as being formed from multiple functions, one for each

x \in X

. I'll call

x

-th function

G(\alpha)_x:\Lambda(F)_x \to \Lambda(F')_x

, where

\Lambda(F)_x

is the set of germs of

F

associated to

x

I'm not sure how to define

G(\alpha)_x

. But maybe we can start by considering

\alpha_U:F(U) \to F'(U)

U \subseteq X

becomes a smaller and smaller open set that contains

x

. Intuitively, I'd like to set

G(\alpha)_x

to be some kind of "limit" of

\alpha_U

U

approaches

x

David Egolf (May 17 2024 at 18:14):

I'm wondering if there is some way to define

G(\alpha)_x

as some colimit, analogous to how

\Lambda(F)_x

is (part of) a colimit. This is the picture I've been starting at:
picture

Maybe we could try to define

G(\alpha)_x

as the (hopefully unique) function that makes this diagram commute? I've only drawn part of the full diagram I have in mind; we should have

F(U)

and

F'(U)

present in the full diagram as

U

varies over all open sets containing

x

David Egolf (May 17 2024 at 18:47):

Trying to draw the full picture, I thought of a diagram involving functors and natural transformations:
picture 2

In this picture, the functors map to

\mathsf{Set}

from the full subcategory of

\mathcal{O}(X)^{\mathrm{op}}

given by taking only the open sets containing

x

I

is the inclusion functor from this full subcategory. The natural transformations pointing down the page correspond to our colimit co-cones. Finally,

\Delta_{\Lambda(F)_x}

is the functor constant at the set

\Lambda(F)_x

, and

\Delta_{\Lambda(F')_x}

is defined similarly.

The idea is that

G(\alpha)_x

could (hopefully) be defined in terms of the (hopefully) unique natural transformation making this diagram commute.

I'm not sure if this is a good direction to explore... I'll stop here for today. Any hints or thoughts relating to

G(\alpha)

G(\alpha)_x

would be most welcome!

Peva Blanchard (May 17 2024 at 21:13):

Trying to define

G(\alpha)_x

as the "colimit of the

\alpha_U

's" reveals a good mental picture, but maybe too involved for a formal proof.

Instead, I suggest to look at how

G(\alpha)

acts on a specific germ

[s]_x

x

, e.g., choosing a representative

s \in FU

for some open neighborhood

U

x

. How would you define the germ

G(\alpha)([s]_x)

Peva Blanchard (May 17 2024 at 22:48):

I got interested into that. I think I found a proof that we have an adjunction

\Lambda \dashv \Gamma

. If true, this implies that sheafification is a monad, while étalification is a comonad.

Peva Blanchard (May 18 2024 at 09:30):

(@Eric M Downes It took me a few seconds to understand the emoji "Grothenwoke" :D)

Eric M Downes (May 18 2024 at 09:52):

David Egolf (May 18 2024 at 20:05):

I just realized that where I left off above and your hint here are (I think) quite related. If the diagram I drew above is to commute, then it must commute in particular at each component of the natural transformations involved. Requiring commutativity at the

U

-th component, we then want this diagram to commute:
diagram at U

If this is to commute, then it must in particular commute at each element. So, pick some

s \in (F \circ I)(U)

. (Note that

x \in U

automatically, by definition of the subcategory

I

is mapping from). Then we need

[\alpha_U(s)]_x = G(\alpha)_x([s]_x)

. There is still some work left to define

G(\alpha)

from this, but this feels like progress.

(In general, I wonder if this kind of thing can provide an interesting strategy for trying to induce a map between two colimits of different diagrams).

Peva Blanchard (May 18 2024 at 21:21):

Yes that's right. I think you can already define

G(\alpha) : \Lambda(F) \rightarrow \Lambda(F')

point-wise.

You can use the formula you inferred from the naturality condition: for any germ of the form

[s]_x

, with

s \in FU

and

U

an open neighborhood of

x

But, first, you need to prove that this is well-defined, i.e., that this definition is invariant when we choose another representative

[s]_x = [t]_x

for some other

t \in FV

over another neighborhood

V

x

John Baez (May 19 2024 at 06:12):

Yes, I would be inclined to define the map between etale spaces over a space

X

coming from a map between presheaves on

X

by saying what it does to each germ. I'd do that by treating a germ as an equivalence class of sections, then doing the standard trick of choosing a representative of that equivalence class, writing down some formula that parses, and then checking that the answer doesn't depend on the representative.

I consider all of this "follow your nose" mathematics: writing down the only guess you can easily think of given the data available, then checking it works. I never considered David's more thoughtful approach of working explicitly with colimit diagrams. Probably it's because I consider that more "bulky", and harder to do calculations with. So while I applaud David's approach in spirit I would unthinkingly have taken Peva's approach, and I think in practice that's the easier one.

David Egolf (May 19 2024 at 18:21):

I'm all for "following my nose"... the only problem with the nose-following approach is that sometimes my nose doesn't know the right way to go :sweat_smile:. But I suppose that mostly comes with experience.

David Egolf (May 19 2024 at 18:24):

...I realized that composing the morphisms along the top and right-hand side of this diagram gives us a natural transformation from

F \circ I

to a functor that is constant at a particular object. That is, these morphisms compose to give us a co-cone under

F \circ I

. Then the unique existence of

G(\alpha)_x

follows by the fact that our set of germs of

F

x

is the tip of a colimit cocone (and hence is initial among cones of

F \circ I

David Egolf (May 19 2024 at 18:28):

I still want to check "by hand" that setting

G(\alpha)([s]_x) = [\alpha_U(s)]_x

is "well-defined". (Although I suspect it must be, at this point, in light of the paragraph immediately above this one).

We need to check that if

[s]_x = [t]_x

for some

t \in F(V)

for

x \in V

, then

[\alpha_U(s)]_x = [\alpha_V(t)]_x

David Egolf (May 19 2024 at 18:52):

I want to use the fact that restricting a sheaf element doesn't change the germ it belongs to. So, I'm aiming to show that

\alpha_U(s)

and

\alpha_V(t)

restrict to the same thing on some open set containing

x

Now, we know that

[s]_x = [t]_x

,with

s \in F(U)

and

t \in F(V)

. I think we proved a while ago that this implies there is some open set

W

containing

x

so that

s|_W = t|_W

David Egolf (May 19 2024 at 18:56):

I'll use this fact, together with the naturality of

\alpha

, referencing this diagram:
diagram

David Egolf (May 19 2024 at 19:01):

We start with

s \in F(U)

and

t \in F(V)

W

was defined so that we have

F(r_{U \to W})(s) = F(r_{V\to W})(t)

. Consequently,

\alpha_W \circ F(r_{U \to W})(s) = \alpha_W \circ F(r_{V\to W})(t)

. Since the left and right "trapezoids" of our diagram commute (because

\alpha

is a natural transformation), we have that

F'(r_{V \to W}) \circ \alpha_V = \alpha_W \circ F(r_{V\to W})

and similarly

F'(r_{U\to W}) \circ \alpha_U = \alpha_W \circ F(r_{U \to W})

Putting this all together, we find that

F'(r_{U \to W})(\alpha_U(s)) =F'(r_{V \to W})(\alpha_V(t))

. Thus,

\alpha_U(s)|_W = \alpha_U(t)|_W

David Egolf (May 19 2024 at 19:08):

Since

\alpha_U(s) \in F'(U)

and

\alpha_V(t) \in F'(V)

restrict to the same thing on

W

(which is an open set containing

x

), and since two sheaf elements have the same germ at

x

if they restrict to the same thing in some open set containing

x

, we conclude that

[\alpha_U(s)]_x = [\alpha_V(t)]_x

So, if

[s]_x = [t]_x

for some

t \in F(V)

with

x \in V

, we have that

[\alpha_U(s)]_x = [\alpha_V(t)]_x

as desired. We conclude that setting

G(\alpha)([s]_x) = [\alpha_U(s)]_x

actually defines a function!

David Egolf (May 19 2024 at 19:15):

It still remains to show that

G(\alpha):\Lambda(F) \to \Lambda(F')

is continuous. But I will leave that for another day!

David Egolf (May 19 2024 at 19:54):

This is a bit tangential, but the above has helped me realized how we can take the "limit" or "colimit" of a natural transformation between two diagrams with limits or colimits! This seems pretty cool because it lets us "condense" the data of a natural transformation (which could consist of many morphisms) to a single morphism.

David Egolf (May 19 2024 at 19:56):

Here

D

and

D'

are diagrams of the same shape and

\alpha:D' \to D

is a natural transformation.

u_D

and

u_{D'}

correspond to the limit cones over

D

and

D'

. Then composing

\alpha \circ u_{D'}

gives us a cone over

D

. Since

u_D

is the terminal such cone, there is a unique morphism

:\lim D' \to \lim D

which induces a natural transformation

:\Delta_{\lim D'} \to \Delta_{\lim D}

so that the diagram commutes.

For example, in a category with products, I expect that the "product"

f \times g

of two morphisms is a

\lim \alpha

, in the case where

D

and

D'

are discrete diagrams with two objects. In this case, the data of a natural transformation

\alpha

corresponds to two morphisms

f

and

g

Peva Blanchard (May 19 2024 at 20:08):

Yes, I've been thinking about "condensing a natural transformation" too, and your "colimit of natural transformations" picture.

I will probably open another topic to discuss the details, but here is an overview.

There is a fact about presheaves on a topological space (or more generally any category): it is a closed category

This means that if you have two presheaves

F, F'

X

, there is another presheaf

[F, F']

. Intuitively, the presheaf

[F, F']

represents (in the category of presheaves on

X

) all the natural transformations from

F

F'

Then, you can look at the associated bundle

\Lambda([F, F'])

over

X

. And there are interesting things.

For instance, the function

G(\alpha)_x

you were looking for a few messages above would correspond to a point of this bundle. And the

G(\alpha)

to a section of this bundle (a global section, i.e., over the entire space

X

~~Actually, there is a correspondance between natural transformations from $F$ to $F'$ and global sections of the bundle $\Lambda([F, F'])$ .~~

Actually, any natural transformations from

F

F'

yields a global section of the bundle

\Lambda([F, F'])

John Baez (May 20 2024 at 10:51):

That's true. But now that you've had an experience, I hope you see that this strategy counts as following your nose, each step following naturally from the one before:

John Baez (May 20 2024 at 11:05):

I would count this as sufficient, though there are certainly details one can unpack here, which you unpacked in your much more careful argument here.

David Egolf (May 21 2024 at 16:37):

Given a morphism of presheaves

\alpha: F \to F'

, where each presheaf is a presheaf on a topological space

X

, we were able to define a function from

\Lambda(F)

\Lambda(F')

, which sends each germ of

F

at a point to a germ of

F'

at that same point. Namely, we got the function

G(\alpha):\Lambda(F) \to \Lambda(F')

which acts by

G(\alpha)[s]_x = [\alpha_U(s)]_x

, where

s \in F(U)

and

x \in X

David Egolf (May 21 2024 at 16:48):

At this point, part of me wishes we had defined the topology on

\Lambda(F)

(and on

\Lambda(F')

) in a different but equivalent way, in terms of some universal property. I am guessing that doing that might help make it clearer why

G(\alpha)

needs to be continuous.

David Egolf (May 21 2024 at 16:50):

Before thinking about that, let me see how far I can get while working with the definition we've used so far. I will try to show that

G(\alpha)

is continuous at an arbitrary point

[s]_x \in \Lambda(F)

, where

s \in F(U)

and

x \in U

. Let

N

be a neighborhood of

G(\alpha)([s]_x)

. This is a subset of

\Lambda(F')

containing an open set

N'

so that

[\alpha_U(s)]_x \in N'

. We wish to show that

G(\alpha)^{-1}(N)

is a neighbourhood of

[s]_x

To show that

G(\alpha)^{-1}(N)

is a neighbourhood of

[s]_x

it suffices to find some open set that contains

[s]_x

and is a subset of

G(\alpha)^{-1}(N)

David Egolf (May 21 2024 at 17:12):

To get further, I want to find an open set containing

[\alpha_U(s)]_x

using

s

. Since

s \in F(U)

and

\alpha_U:F(U) \to F'(U)

, we have that

\alpha_U(s) \in F'(U)

. Hence, the set

g(\alpha_U(s))(U)

of all the germs of

\alpha_U(s)

over

U

forms an open set of

\Lambda(F')

. And since

x \in U

[\alpha_U(s)]_x \in g(\alpha_U(s))(U)

David Egolf (May 21 2024 at 17:17):

We've just seen that

g(\alpha_U(s))(U)

is an open subset of

\Lambda(F')

containing

[\alpha_U(s)]_x

. Next, I want to create an open set from this one, aiming to obtain a subset of our neighbourhood

N

. Since

N

is a neighbourhood of

[\alpha_U(s)]_x

, it contains an open set

N'

that contains

[\alpha_U(s)]_x

. Consequently,

g(\alpha_U(s))(U) \cap N'

is an open set containing

[\alpha_U(s)]_x

that is a subset of

N

David Egolf (May 21 2024 at 17:22):

This set

g(\alpha_U(s))(U) \cap N'

is some open set that is a subset of

g(\alpha_U(s))

. Hence it is the union of sets of the form

g(s')(V)

, by definition of the topology of

\Lambda(F')

. Each of these sets, being a subset of

g(\alpha_U(s))(U)

, contains only germs belonging to

\alpha_U(s)

over some subset of

U

. So, each of these open sets is really of the form

g(\alpha_U(s))(V)

for

V \subseteq U

an open subset of

X

Since this set contains

[\alpha_U(s)]_x

, there is some open

V \subseteq X

containing

x

such that

[\alpha_U(s)]]_x \in g(\alpha_U(s))(V)

. This

g(\alpha_U(s))(V)

is an open set of

\Lambda(F')

containing

[\alpha_U(s)]_x

, that is also a subset of

N

. Hence,

G(\alpha)^{-1}(g(\alpha_U(s))(V))

is a subset of

G(\alpha)^{-1}(N)

containing

[s]_x

. If we can show that

G(\alpha)^{-1}(g(\alpha_U(s))(V))

is open, then I think we will have shown that

G(\alpha)

is continuous at

[s]_x

David Egolf (May 21 2024 at 17:39):

G(\alpha)^{-1}(g(\alpha_U(s))(V))

is a bit of a mouthful, but I'm hoping working with it won't be too bad. I'll stop here for today though!

John Baez (May 22 2024 at 10:44):

Hmm, something seems 'heavy' about this discussion so far. Let me see if I can lighten it a bit. I'll follow my nose for a little while and see where it leads, but I won't go too far.

We're trying to show that

G(\alpha): \Lambda(F) \to \Lambda(F')

is continuous. So we should think about how we defined the topology on

\Lambda(F)

. In my course notes I said something like this:

John Baez (May 22 2024 at 10:45):

The description of the topology must determine the strategy for how we'll show

G(\alpha)

is continuous. Since inverse images automatically preserve unions, we don't need to check that the inverse image of a general open set under

G(\alpha)

is open. It's enough to check it for the open neighborhoods of the form described above. So let's make up some convenient notation for them.

We've already decided to call any point in

\Lambda(F)

something like

[s]_x

where

x \in X

and

s \in FU

, where

U

is any open neighborhood of

X

Above I described a basis of open neighborhoods of

[s]_x

, which are sets like this:

The vertical bar means "such that". I used to use a colon to mean "such that", but I decided that was confusing.

This is an efficient notation for our basis of open neighborhoods, so we should be able to do computations with it fairly painlessly.

John Baez (May 22 2024 at 10:50):

Similarly, any point in

\Lambda(F')

will be an equivalence class like

[t]_x

where

x \in X

and

t \in F'U

, where

U

is any open neighborhood of

x

. And we get a basis of open neighborhoods of

[t]_x

that are sets like this:

John Baez (May 22 2024 at 10:55):

Since we understand the topology on

\Lambda(F)

and

\Lambda(F')

in terms of a basis of open neighborhoods, to show

G(\alpha) : \Lambda(F) \to \Lambda(F')

is continuous we should check continuity at a point for every point

[s]_x \in \Lambda(F)

John Baez (May 22 2024 at 11:02):

We saw

G(\alpha)

maps

[s]_x

[\alpha(s)]_x

. So to check that

G(\alpha)

is continuous at the point

[s]_x

, in principle we need to check that the inverse image of any open neighborhood of

[\alpha(s)]_x

is an open neighborhood of

[s]_x

But since I'm a master of topology, I know it's enough to check that the inverse image of one of our "basis" open neighborhoods contains a "basis" open neighborhood. I realize now that this may be a potential stumbling block for you, @David Egolf - it's a trick one learns in a topology class.

John Baez (May 22 2024 at 11:13):

We've found a slick notation for these "basis" open neighborhoods. So let's write down one of these open neighborhoods of

[\alpha(s)]_x

. It will look like this:

John Baez (May 22 2024 at 11:15):

Now let's figure out its inverse image and see if it contains an open neighborhood of

[s]_x

Well, I had better stop here... I may have already done too much, but I wanted to reach what I called the "potential stumbling block".

David Egolf (May 22 2024 at 17:45):

Wow, thanks! It will take me some time to work through what you just said, but it looks to be quite helpful! I didn't expect topology to come up quite so often in these blog posts :sweat_smile:. But it's good - I'm happy to be learning about these practical topology strategies!

Todd Trimble (May 22 2024 at 17:57):

John knows this of course, but this should be changed to read: every element in the inverse image of a basic open has a basic open neighbohood contained in the inverse image.

John Baez (May 22 2024 at 18:32):

Here's what I was trying to say. I was using the concept of "basis of open neighborhoods", which this link calls simply a "neighborhood basis":

(Check the link for the definition, David.) The reason is that in my lecture notes I described the topology on etale spaces in terms of a neighborhood basis, while avoiding the use the jargon "neighborhood basis".

Say quite generally that we have a function

f: X \to Y

between topological spaces

X

and

Y

, and we're trying to show

f

is continuous at some point

x \in X

. Say we know a neighborhood basis for every point

x \in X

and every point in

y \in Y

. Then to show

f

is continuous at

x \in X

, it's enough to check that for every

U

in the neighborhood basis of

f(x)

, the inverse image

f^{-1}(U)

contains a set in the neighborhood basis of

x

John Baez (May 22 2024 at 18:37):

By now it seems like I've slipped into acting like David understands the concept of "neighborhood basis", which is unreasonable. I guess I'm being a bad teacher and describing how I'd solve a homework problem by following my nose, without remembering just how long my nose has grown over the years, and how long this has taken.

John Baez (May 22 2024 at 18:41):

If I'd left him alone David would have solved the problem in his own way. I may have made the bad teacher's mistake of saying "oh, why don't you just do this?", where "this" is some trick only known to the teacher.

David Egolf (May 23 2024 at 15:30):

I'm very glad to learn about new strategies or tricks! People pointing out different ways to think about a problem is one of the things I hoped would happen when I started this thread. I've run across at least some of the topology concepts you're using above, but I'm excited to see how they can make a specific problem easier to solve.

David Egolf (May 23 2024 at 15:32):

This makes sense. Let

U

be an arbitrary open set of

\Lambda(F')

. Then we wish to show that its inverse image

G(\alpha)^{-1}(U)

is also open. But since we have a basis for the topology on

\Lambda(F')

, we know that

U

is the union of some

b_i

, where each

b_i

is in our basis. Then

G(\alpha)^{-1}(U) = G(\alpha)^{-1}(\cup_i b_i) = \cup_i G(\alpha)^{-1}(b_i)

. Since the union of open sets is open, we see that if the inverse image of each basis set is open, then the inverse image of arbitrary open sets is open.

John Baez (May 23 2024 at 15:46):

That's the idea! Later, due to remark by Todd, I started discussing the difference between a 'basis of open sets' and a 'basis of open neighborhoods of a point p'. The open sets I described are both of these, depending on whether you hold p (your germ) fixed or let it vary. The 'basis of open neighborhoods of a point' idea is especially nice for studying continuity at that point. But maybe you don't need to worry about this until you run into it on your own.

David Egolf (May 23 2024 at 15:47):

Next up, I'd like to review the concept of a "basis of open neighbourhoods", which you're using above. I think I've actually seen this before, but I haven't used it to solve problems yet. I referenced this and this.

I think this is the definition of a basis of open neighbourhoods at a point

p

in some topological space

T

: it is a collection of open subsets

B_p

T

, each containing

p

, such that for any open subset

U

that contains

p

there is some

V \in B_p

so that

V \subseteq U

. Intuitively, this is a collection of open sets "about

p

" that lets us get "arbitrarily close" to

p

David Egolf (May 23 2024 at 16:10):

Let me try to relate this condition for continuity at a point to the one I was using earlier. Above, I was trying to show continuity at

x

by checking that the inverse image of a (not necessarily open) neighbourhood of

f(x)

is a neighbourhood of

x

I'd like to think about how it is enough to consider the inverse image only of (open) neighbourhood basis sets, instead of the inverse image of arbitrary neighbourhoods of

f(x)

. Throughout, I assume we have a neighbourhood basis for

f(x)

and for

x

. Let

N

be an arbitrary neighbourhood of

f(x)

. It contains an open set

N'

that contains

f(x)

. Then, there is some

V

in our neighbourhood basis for

f(x)

so that

V \subseteq N' \subseteq N

. If

f^{-1}(V)

contains an open set containing

x

, then certainly

f^{-1}(N)

contains an open set containing

x

. So, if the inverse image of every (open) neighbourhood basis set for

f(x)

is a neighbourhood of

x

, then the inverse image of any neighbourhood of

f(x)

is a neighbourhood of

x

To show that the inverse image of a (open) neighbourhood basis set of

f(x)

is a neighbourhood of

x

, it suffices to show that its inverse image contains an open set containing

x

. If its inverse image contains some set in the neighbourhood basis of

x

, then its inverse image certainly contains an open set containing

x

David Egolf (May 23 2024 at 16:29):

To use the above in our case, we need to figure out a neighbourhood basis for each

[s]_x \in \Lambda(F)

, and for each

G(\alpha)([s]_x) = [\alpha(s)]_x

. To do that, we need to figure out a strategy for getting "arbitrarily close" to these points.

To get a bunch of open sets containing

[s]_x

that "get arbitrarily close" to it, the first idea that comes to mind for me is to take the open set of germs of

s

as we restrict its domain to be smaller and smaller open sets about

x

. So, the sets in our proposed open neighbourhood basis for

[s]_x

are of the form

g(s)(U)

U

becomes a smaller and smaller open set containing

x

. In alternate notation, they are of the form

\{[s]_y | y \in U\}

, as

U

ranges over all the open sets containing

x

. I think @John Baez was indicating that we really do get a (open) neighbourhood basis for

[s]_x \in \Lambda(F)

in this way. However, I don't immediately see how to prove this.

I'll stop here for today! Next time, I'm hoping to prove that we do get a neighbourhood basis in this way for a point

[s]_x \in \Lambda(F)

John Baez (May 23 2024 at 19:54):

In my course notes I defined the topology on

\Lambda(F)

in essentially this way, by specifying these neighborhood bases, so I see nothing to prove! If you have some other way to define the topology, then you can try to prove this.

John Baez (May 23 2024 at 19:59):

John Baez (May 23 2024 at 20:00):

David Egolf (May 24 2024 at 17:18):

David Egolf (May 24 2024 at 17:23):

This may be a situation where the thing to be proved is very fast and simple to prove once you know how to do it! But it seems to me that there is really something to be proved here.

David Egolf (May 24 2024 at 17:27):

Let

N

be an arbitrary neighbourhood of

[s]_x

\Lambda(F)

, where

s \in F(U)

. It then contains an open set

N'

that contains

[s]_x

. By definition of the topology of

\Lambda(F)

, we know that

N' = \cup_i b_i

where each

b_i \in B

. Since

[s]_x \in N'

, that implies that there is some specific

b_i \subseteq N'

so that

[s]_x \in b_i

David Egolf (May 24 2024 at 17:29):

Now, each basis element is the set of germs of some sheaf set element over some open set of

X

. Hence

b_i = \{[t]_y |y \in V\}

where

V

is an open set of

X

and

t \in F(V)

. Since

[s]_x \in b_i

, we have that

x \in V

. Hence

t \in F(V)

has the same germ at

x

as our

s \in F(U)

does.

We know that two sheaf set elements

t

and

s

have the same germ at a point

x

exactly if they restrict to the same sheaf set element on some open subset of

X

containing

x

. Thus, we have

t|_W = s|_W

for some open set containing

x

. Note that

W \subseteq U

and

W \subseteq V

David Egolf (May 24 2024 at 17:37):

Now,

\{[t|_W]_y | y \in W\}=\{[s|_W]_y | y \in W\}

is an open set containing

x

and further it belongs to our proposed neighbourhood basis

B_p

. Since

W \subseteq V

, we have that

\{[s|_W]_y | y \in W\} \subseteq b_i \subseteq N' \subseteq N

. Hence, we have found an element of

B_p

that is a subset of an arbitrary neighbourhood of

[s]_x

We conclude that

B_p

really does form a neighbourhood basis of open sets for

[s]_x

David Egolf (May 24 2024 at 17:44):

This is interesting to me, as it intuitively says that we can "approach arbitrarily close" to a point

[s]_x \in \Lambda(F)

just by looking at the germs of various restrictions of

s

. This simplifies things: in this context we only have to think about the germs of the restrictions of a single sheaf element

s

, instead of all sheaf elements that happen to have the same germ as

s

x

I'll stop here for today. Next time, I'm hoping to use this neighbourhood basis to try and show the continuity of

G(\alpha): \Lambda(F) \to \Lambda(F')

John Baez (May 24 2024 at 20:43):

John Baez (May 24 2024 at 20:52):

Yes, that's a good thing to keep in mind with these etale spaces. So it's good you did that proof just now.

I just assumed it was obvious that if we have a bunch of open sets

\{O_\alpha\}

forming a basis for a topology, the sets in that basis containing a particular point

p

form an open neighborhood basis for

p

Let's see if I was fooling myself. It suffices to show that if

V

is any open set containing

p

, there exists

\alpha

such that

Since

\{O_\alpha\}

is a basis for the topology,

V

is a union of some collection of these sets:

so at least one of the

O_\alpha

for

\alpha \in S

contains

p

, and for this one we have

David Egolf (May 24 2024 at 21:01):

I had been thinking along these lines, but then I realized that there can be a lot more open sets in our basis for the topology on

\Lambda(F)

that include

[s]_x

, besides those of the form

\{[s]_y | y \in V\}

for

V \subseteq U

some open set containing

x

(where

s \in F(U)

is not allowed to vary). For example, for some

t \in F(V')

with

x \in V'

satisfying

[t]_x = [s]_x

, we have that

\{[t]_y | y \in V'\}

is an open set in our basis that contains

[s]_x

. And this open set is potentially different than the ones we can get just using

s

, I think.

So, the collection of sets

B_p

discussed above (which has sets given by the germs of

s\in F(U)

when restricted to various open subsets of

U

containing

x

) I think is smaller than the collection of sets from our basis that contain

[s]_x

John Baez (May 25 2024 at 05:21):

You're right, so I was being sloppy! I'm glad you caught that. I had to run through an example in my mind to see that this collection

B_p

is really smaller. Luckily it's still a neighborhood basis, and it's a much more convenient neighborhood basis.

(My downfall was wanting to be extremely quick and informal in my course notes, and not use terms like "basis" or "neighborhood basis". I think it may save everyone work if I come out and clearly specify a neighborhood basis for each point. But let's see how things go.)

David Egolf (May 27 2024 at 16:32):

Ok, now that I understand this neighbourhood basis, let me see about trying to use it to prove that

G(\alpha):\Lambda(F) \to \Lambda(F')

is continuous. Recall that

G(\alpha)

is going to be a morphism of bundles induced by a morphism

\alpha:F \to F'

of presheaves on

X

. And

G(\alpha)

acts by

[s]_x \mapsto [\alpha(s)]_x

David Egolf (May 27 2024 at 16:36):

To show that

G(\alpha)

is continuous, we will aim to show it is continuous at an arbitrary point

[s]_x

\Lambda(F)

. To show this, we need to show that the inverse image of any neighbourhood of

G(\alpha)([s]_x) = [\alpha(s)]_x

is a neighbourhood of

[s]_x

. But we recently saw that it suffices to show that the inverse image of any set in a neighbourhood basis of open sets for

[\alpha(s)]_x

contains a set in a neighbourhood basis of open sets for

[s]_x

David Egolf (May 27 2024 at 16:45):

We recently saw that for a point

[s]_x

(with

s \in F(U)

and

x \in U

) we have a neighbourhood basis of open sets

B_{[s]_x}

having elements of the form

\{[s]_y | y \in U'\}

where

U'

is some open set containing

x

and contained in

U

Similarly, we have a neighbourhood basis of open sets

B_{[\alpha(s)]_x}

. An element of this neighbourhood basis is of the form

\{[\alpha(s)]_y |y \in U'\}

for

U'

some open set containing

x

and contained in

U

. Here,

\alpha(s)

is shorthand for

\alpha_U(s) \in F'(U)

David Egolf (May 27 2024 at 16:48):

So, let us consider the inverse image under

G(\alpha)

of some arbitrary set in our neighbourhood basis of open sets for

[\alpha(s)]_x

. Let's say we pick the neighbourhood basis element

\{[\alpha(s)]_y | y \in U'\}

, where

U'

is an open set of

X

containing

x

and contained in

U

. Given that

G(\alpha)([s]_y) = [\alpha(s)]_y

, what can we say about the inverse image of this neighbourhood basis element?

David Egolf (May 27 2024 at 16:51):

Well, we see that any

[s]_y

with

y \in U'

is in the inverse image. So, the set

\{[s]_y | y \in U'\}

is contained in the inverse image. But, since

U'

is an open set containing

x

and contained in

U

, this set is an element of our neighbourhood basis of open sets for

[s]_x

We conclude that

G(\alpha)

is continuous at an arbitrary point

[s]_x \in \Lambda(F)

, and hence it is continuous!

John Baez (May 27 2024 at 17:59):

Great! I guess it's clear now why I pushed you into this neighborhood basis idea. It makes this proof into a delicious downhill slide.

John Baez (May 27 2024 at 18:00):

John Baez (May 27 2024 at 18:33):

If we'd defined the topology in some other equivalent way from the start, some other proof might be good - but I haven't actually thought about other ways to define the topology on an etale space. You mentioned defining it using some universal property. One way might be to say: we give

\Lambda(F)

the weakest topology (fewest open sets) such that some class of maps out of it is continuous, or the strongest topology (most open sets) such that some class of maps into it is continuous. Maybe one of these works. But in this theorem we need to show continuity of maps

\Lambda(F) \to \Lambda(F')

- out of one etale space and into another.

We also want the projection

p

from

\Lambda(F)

X

to be continuous. I seem to recall you've already shown that? I think that's also easy with this neighborhood basis approach. If we give

\Lambda(F)

the weakest topology such that

p: \Lambda(F) \to X

is continuous, do we get the same topology we're using now? I don't know; it should be easy to figure out but not today.

Peva Blanchard (May 27 2024 at 20:44):

Actually, I made the mistake of taking this weakest topology

W

as the topology

T

on the bundle

\Lambda(F)

. A priori, definition-wise,

W

is coarser than

T

since

T

adds enough open sets to interpret sections of

F

as continuous functions. But, I haven't tried to exhibit an actual example where

W

would be strictly coarser than

T

John Baez (May 27 2024 at 21:14):

Okay - now I remember those comments of yours. I don't alas know such a counterexample, and when I try to visualize one I instantly think of this: one thing about etale spaces is that

p: \Lambda(F) \to X

is not only continuous, it's a [[local homeomorphism]], where a section

s: U \to \Lambda(F)

provides a continuous inverse to the projection restricted to the open neighbohood

\{[s]_y \; \vert \; y \in U\}

of the germ

[s]_x

. This seems relevant somehow. Maybe prevents the existence of a counterexample? Or maybe we can use this condition as a kind of requirement that helps specify the topology of the etale space?

Peva Blanchard (May 27 2024 at 22:43):

I think I found an example showing that

W

(the coarsest topology on

\Lambda(F)

making the projection

\Lambda(F) \rightarrow X

continuous) is strictly coarser than

T

(the actual topology defined in John's blog post).

Take

X = [0,1]

and

F

the presheaf that maps any open subset

U \subseteq X

to the set

2 = \{0,1\}

. Then

\Lambda(F) = X \times 2

An open set in

X \times 2

, w.r.t

W

, is exactly a set of the form

U \times 2

, for some open subset

U

X

. This prevents

p

to be a local homeomorphism w.r.t.

W

But we know that

p

is a local homeomorphism w.r.t.

T

. So

W

is strictly coarser than

T

Peva Blanchard (May 27 2024 at 22:47):

Peva Blanchard (May 27 2024 at 22:52):

This is interesting. The requirement of being a local homeomorphism adds open subsets to the initial topology

W

~~But it looks like we cannot express the topology $T$ as the initial or final topology of some collection of functions.~~

Peva Blanchard (May 27 2024 at 23:11):

Oh, actually, it's quite possible that

T

is the final topology on

X

making all the functions

x \mapsto [s]_x

continuous. I'll think about that.

Peva Blanchard (May 27 2024 at 23:11):

David Egolf (May 28 2024 at 17:00):

David Egolf (May 28 2024 at 17:02):

Yes, I proved that earlier in this thread. (I had to scroll a long ways back to check though!)

David Egolf (May 28 2024 at 17:03):

I think reflecting on the topology we put on

\Lambda(F)

is quite interesting, and that discussion on that topic is a good fit for this thread.

I'm hoping that we can imagine a strategy or goal that would have led us to put the topology on

\Lambda(F)

that we did. One of my goals in learning math is to better understand how to create nice mathematical situations/structures. (I think this is a bit different than learning how to prove that structures that other people have come up with are quite nice.)

David Egolf (May 28 2024 at 17:12):

With the topology we chose to put on

\Lambda(F)

, it will turn out that

\Lambda:\widehat{\mathcal{O}(X)} \to \mathsf{Top}/X

is left adjoint to the functor

\Gamma:\mathsf{Top}/X \to \widehat{\mathcal{O}(X)}

. In particular, that implies we have a natural isomorphism

\mathsf{Top}/X(\Lambda(F),-) \cong \widehat{\mathcal{O}(X)}(F,\Gamma(-))

for any presheaf

F

. That implies that for any bundle

p:Y \to X

we have a bijection:

\mathsf{Top}/X(\Lambda(F),p) \cong \widehat{\mathcal{O}(X)}(F,\Gamma(p))

David Egolf (May 28 2024 at 17:15):

That means if I pick some particular natural transformation

\alpha

from

F

\Gamma(p)

(which is the sheaf of sections of

p:Y \to X

), then there is some unique corresponding bundle morphism

\alpha':\Lambda(F) \to p

. If we let

\Lambda(F)

also denote the topological space that

\Lambda(F)

(the bundle) maps down to

X

\alpha'

then corresponds to a continuous map from

\Lambda(F)

Y

David Egolf (May 28 2024 at 17:17):

Now, imagine that we we hadn't yet set the topology on

\Lambda(F)

, but we want to set things up so that

\Lambda

is left adjoint to

\Gamma

. Then we are motivated in our choice of topology of

\Lambda(F)

: we need to choose our topology so that all of the induced

\alpha':\Lambda(F) \to Y

are continuous.

David Egolf (May 28 2024 at 17:21):

I am not confident in working with adjunctions yet, so actually using this idea to figure out what topology we'd need on

\Lambda(F)

sounds tricky to me currently. But my rough hope is this:

David Egolf (May 28 2024 at 17:23):

The ideas @Peva Blanchard and @John Baez sketched above relating to the topology on

\Lambda(F)

are also interesting. And one of those may be the way to go instead, I'm not sure!

But I like that this approach lets us imagine a goal that could have led us to our choice of topology on

\Lambda(F)

. Namely, this goal: try to define the topology on

\Lambda(F)

so that

\Lambda

is left adjoint to

\Gamma

Peva Blanchard (May 28 2024 at 21:41):

Indeed, given a natural transformation

\alpha : F \rightarrow \Gamma(p)

, we can define the set-function

\alpha' : \Lambda(F) \rightarrow p

with

s \in FU

and

U

an open neighborhood of

x

. Of course, this requires to prove that it is well-defined, i.e., that it is independent of the chosen representative.

And your perspective leads to another candidate for the topology on

\Lambda(F)

. Namely, the coarsest topology on

\Lambda(F)

making all those

\alpha'

continuous.

John Baez (May 28 2024 at 21:58):

That's a good goal, @David Egolf! Not enough people explicitly make this a goal, but I think it's something that can be learned, and category theory can be seen as a huge toolbox of methods for doing exactly this, though every other branch of math is important too.

I'd try to use this method to get those open sets I love, the sets I call

\{ [s]_y \;\vert \; y \in U \}

, as inverse images of the sort you're talking about. Then you'd know that your topology has to at least contain those, which would be a big step forward.

David Egolf (May 29 2024 at 18:16):

I am realizing that it will be helpful to learn more about this adjunction before figuring out what topology it requires on

\Lambda(F)

. (For example, I don't yet know enough about the adjunction to check that an

\alpha'

ends up matching @Peva Blanchard's description). For that reason, I think I want to progress a bit further on the puzzles of the current blog post (and the next one) before thinking about this in more detail. But I have made a note to return to this question later, once we've worked through the discussion of the adjunction in the blog posts!

David Egolf (May 29 2024 at 18:24):

Starting with a natural transformation

\alpha: F \to F'

between presheaves on

X

, we saw above how to form a morphism of bundles from

\Lambda(F)

\Lambda(F')

. It remains to show that this process actually defines a functor. (However, I need to rest up today, so I will return to this hopefully tomorrow!)

David Egolf (May 30 2024 at 16:29):

Given

\alpha:F \to F'

, then the induced morphism of bundles

\Lambda(\alpha):\Lambda(F) \to \Lambda(F')

corresponds to the continuous function

[s]_x \mapsto [\alpha_U(s)]_x

, for

s \in F(U)

for

U

some open subset of

X

. This function maps from the topological space

\Lambda(F)

to the topological space

\Lambda(F')

. (Above, we called

\Lambda(\alpha)

by the name

G(\alpha)

Here I am using

\Lambda(F)

to denote both the topological space of germs of

F

and the projection from that space to

X

, which sends each germ to the point it belongs to. Hopefully context will make it clear which usage I intend.

David Egolf (May 30 2024 at 16:31):

Next, let

1_F: F \to F

be the identity natural transformation from

F

F

. Then

\Lambda(1_F):\Lambda(F) \to \Lambda(F)

correspond to the continuous function

[s]_x \mapsto [(1_F)_U(s)]_x

. But since each component of

1_F

is an identity function,

(1_F)_U(s)=s

. Thus, the induced map is

[s]_x \mapsto [s]_x

, which is the identity map from

\Lambda(F)

\Lambda(F)

. We conclude that

\Lambda

is preserving identity morphisms.

David Egolf (May 30 2024 at 16:41):

It remains to show that

\Lambda

preserves composition. Let us assume we have

\alpha:F \to F'

and

\beta:F' \to F''

. We wish to show that

\Lambda(\beta \circ \alpha) = \Lambda(\beta) \circ \Lambda(\alpha)

\Lambda(\beta \circ \alpha): \Lambda(F) \to \Lambda(F'')

corresponds to the continuous function

[s]_x \mapsto [(\beta \circ \alpha)_U(s)]_x

. Since

( \beta \circ \alpha)_U = \beta_U \circ \alpha_U

, we have that

[(\beta \circ \alpha)_U(s)]_x=[\beta_U (\alpha_U(s))]_x

But we notice that

\Lambda(\beta) \circ \Lambda(\alpha): \Lambda(F) \to \Lambda(F'')

corresponds to the continuous function

[s]_x \mapsto [\alpha_U(s)]_x \mapsto [\beta_U(\alpha_U(s))]_x

. So, we conclude that

\Lambda(\beta \circ \alpha) = \Lambda(\beta) \circ \Lambda(\alpha)

and so

\Lambda

preserves composition.

David Egolf (May 30 2024 at 16:48):

The blog post next begins to discuss the fact that not only can we get a bundle

p:\Lambda(F) \to X

from a presheaf

F

X

, but that this bundle has a nice property. Namely, each point

[s]_x

\Lambda(F)

has a neighbourhood

V

such that

p|_V:V \to P(V) \subseteq X

is a homeomorphism. Intuitively, each point in

\Lambda(F)

has some little region "near it" that looks (topologically) just like some neighbourhood of its image under

p

. This seems like it might be helpful for understanding

\Lambda(F)

intuitively "around some germ", provided that we know what

X

looks like.

Wikipedia gives a nice picture illustrating this kind of situation:
covering space

David Egolf (May 30 2024 at 16:51):

David Egolf (May 30 2024 at 16:56):

Technically, we wish to show that the restriction of

p

V

provides a homeomorphism from

V

U

. We define

p|_V: V \to U

p|_V([s]_y) = y

, and aim to show this is a homeomorphism. (Since each germ in

V

belongs to a point in

U

, this function really does map to the set

U

David Egolf (May 30 2024 at 16:57):

First, we note that

p|_V

is a bijection. That's because it has an inverse as a function, which I will call

(p|_V)^{-1}

. This inverse function acts by

y \mapsto [s]_y

David Egolf (May 30 2024 at 17:05):

It remains to show that

p|_V

and

(p|_V)^{-1}

are both continuous. We know that

p:\Lambda(F) \to X

is continuous, and that the inclusion function

i:V \to \Lambda(F)

is continuous. Hence

p \circ i:V \to X

is continuous. I seem to recall that

p \circ i: V \to U

is then also continuous, provided that

p \circ i

only takes values in

U

. If that's true, then

p|_V

is continuous.

David Egolf (May 30 2024 at 17:07):

I'll stop here for today. Next time, I'm hoping to finish showing that

p|_V

is a homeomorphism!

John Baez (May 30 2024 at 17:41):

Great, you're moving along nicely here. And your memory is right. The subspace topology on a subset

S

of a topological space

Y

is the one where the open sets of

S

are defined to be the sets

U \cap S

where

U

is open in

Y

. It then instantly follows that if

f: A \to Y

is any continuous function taking values in the subset

S

, it gives a continuous function

f: A \to S

where

S

is given the subspace topology.

John Baez (May 30 2024 at 17:43):

By the way, some very careful people distinguish notationally between the function

f: A \to Y

in this situation and the function

f: A \to S

, and call the latter a corestriction of the former, by analogy with restriction - since we're shrinking the codomain of

f

instead of its domain.

John Baez (May 30 2024 at 17:44):

Peva Blanchard (May 30 2024 at 21:39):

I wanted to visualize a bit more what the étale space looks like in strange cases, and I thought it could be interesting to share .

Let

X

be the unit interval, and

x \in X

a point.
Let

F

be the presheaf such that, for every open subset

U\subseteq X

FU = \begin{cases} 2 & \text{if } x \not\in U \\ 1 & \text{otherwise} \end{cases}

Here

E

seems to just be the union of two line segments that crosses at a point

p

over

x

. It seems to me that

p

has "essentially" two neighborhoods that are homeomorphic to

U

, depending on which line segment you choose.

John Baez (May 31 2024 at 06:04):

Nice! And I guess if you change your mind which line you choose, and get a 'broken line segment', that's not an open neighborhod. This clarifies the original French meaning of the term espace étalé.

Peva Blanchard (May 31 2024 at 09:22):

Actually, my conclusion might be wrong. I've been too sketchy when defining the presheaf

F

. I need to specify the restriction morphisms.

Peva Blanchard (May 31 2024 at 09:59):

Let's write

2 = \{l, r\}

and

1 = \{\bullet\}

. Let

V \subseteq U

be two open subsets in

X

When both

U,V

contain

x

, or when both do not contain

x

, we have

FU = FV

and we define the restriction morphism to be the identity.

The interesting case is when

x \in U

and

x \not\in V

. In that case, let's define

\begin{align*} FU &\rightarrow FV \\ \bullet &\mapsto \begin{cases} l &\text{if } V \text{ is on the left of } x \\ r &\text{if } V \text{ is on the right of } x \\ \end{cases} \end{align*}

I.e., we have three line segments, one of them goes through

p

, and the other two adheres to

p

without touching it.

So we have a unique open neighborhood of

p

that maps homeomorphically to

U

David Egolf (May 31 2024 at 14:59):

These are interesting pictures! Even if the first one is not accurate to

F

, I like how it illustrates this: it's possible to have multiple regions (in this case, lines) that "look the same about a point

p

" while only having the point

p

in common.

Regarding the restrictions maps, I don't understand how you are defining

F(r_{U \to V}):F(U) \to F(V)

. You mention the condition "

V

is on the left of

x

" and the condition "

V

is on the right of

x

". But couldn't an open set

V

that doesn't contain

x

have elements both to the left and right of

x

? It wasn't clear to me what the restriction map

F(r_{U \to V})

would be in that case.

Peva Blanchard (May 31 2024 at 15:24):

Oh yes you're right! I am going too fast with the picture. I tend to draw and then formalize hastily in between two meetings. (I should definitely learn the moral lesson...)

The correct definition for

F

, in the case

x \in U

and

x \not\in V

should be (hopefully)

David Egolf (May 31 2024 at 15:29):

I like to keep careful track of the source and target of morphisms, so I suppose I aspire to be one of these "careful people". Thanks for reminding me how corestriction interacts with continuity!

David Egolf (May 31 2024 at 15:36):

I think we can now finish showing that

p|_V: V \to U

and

(p|_V)^{-1}

are continuous. We saw above that

p \circ i:V \to X

is continuous. Then, since

p|_V

is a corestriction of

p \circ i

and

U

has the subspace topology,

p|_V

is continuous.

It remains to show that

(p|_V)^{-1}:U \to V

is continuous. We recall that it acts by

y \mapsto [s]_y

David Egolf (May 31 2024 at 15:48):

To make typing this a little bit easier, I'm going to denote

(p|_V)^{-1}

using the symbol

g_s

. So,

g_s(y) = [s]_y

. I will aim to show that

g_s

is continuous at any point

y \in U

. We saw above that the following collection of sets is a neighourhood basis of open sets for

[s]_y

: namely

\{[s]_w | w \in W\}

W

ranges over the open sets of

X

contained in

U

and also containing

y

. To show that

g_s

is continuous at

y

, it suffices to show that the inverse image under

g_s

of any set in our neighbourhood basis for

g_s(y)=[s]_y

contains an open set containing

y

David Egolf (May 31 2024 at 15:56):

So, let us consider some arbitrary set

b=\{[s]_w|w \in W\}

in our our neighbourhood basis for

[s]_y

. Here

W

is an open set containing

y

and contained in

U

. The inverse image of

b

under

g_s

certainly contains

W

, and hence contains an open set containing

y

We conclude that

(p|_V)^{-1}=g_s:U \to V

is continuous at any point in

U

, and so

(p|_V)^{-1}

is a continuous function.

David Egolf (May 31 2024 at 16:22):

To remember what

V

is, I might instead denote

V

g_s(U)

- the set of germs of

s

associated to the points of

U

. Intuitively, "taking germs over

U

of a fixed

s \in F(U)

" produces a topological space just like

U

. Since there are potentially many different elements in

F(U)

, there are potentially many "copies" of

U

\Lambda(F)

, given by

g_t(U)

t

varies over the elements of

F(U)

Peva Blanchard (May 31 2024 at 16:39):

Yes it is exactly this observation that led me playing with strange cases. The trivial case is when all those copies are disjoint (e.g., like the parallel copies in the helix-like picture you posted before). Then I wondered what happens if we glue some of them at some specific point.

David Egolf (Jun 03 2024 at 16:04):

I believe we have now worked through the second blog post in the series! On to Part 3!

David Egolf (Jun 03 2024 at 16:19):

Part 3 begins by presenting a different way to specify the "sheaf condition" for a presheaf. Although this is not listed as an official puzzle, I would like to understand why this new formulation of the sheaf condition is equivalent to the formulation we've been using so far.

I was going to start by stating the new formulation of the sheaf condition. However, I don't understand it well enough to do so!

David Egolf (Jun 03 2024 at 16:21):

The new formulation involves a diagram that looks like this, where we have a collection of open sets

U_i \subseteq U

that cover the open set

U

X

, and

F

is a presheaf on

X

:
diagram

I believe I understand how

f

is defined. It is the function induced using the universal property of products via the collection of restriction functions

f_i:F(U) \to F(U_i)

. So,

f

sends each sheaf element on

U

to the tuple of its restrictions over the

U_i

that cover

U

. Intuitively, we are "decomposing" a sheaf element into smaller pieces.

David Egolf (Jun 03 2024 at 16:23):

I don't yet understand how

g

and

h

are defined. I think we will again be using the universal property of products, but it seems confusing at the moment... I will stop here for today!

John Baez (Jun 03 2024 at 16:54):

It may help to think in a low brow way. Think of an element of

\prod_i F(U_i)

as a list elements

s_i \in F(U_i)

, one for each

i

. Think of an element of

\prod_{i,j} F(U_i \cap U_j)

as a list of elements

s_{ij} \in F(U_i \cap U_j)

, one for each pair

i,j

. (Don't take the word 'list' too seriously here: the order doesn't matter, etc.) To get a map

John Baez (Jun 03 2024 at 16:55):

John Baez (Jun 03 2024 at 17:01):

Peva Blanchard (Jun 03 2024 at 17:03):

\begin{align*} U_i &\xrightarrow{r_{i1}} U_i \cap U_1 \\ U_i &\xrightarrow{r_{i2}} U_i \cap U_2 \\ U_i &\xrightarrow{r_{i3}} U_i \cap U_3 \\ \end{align*}

On the other hand, if we fix

j \in \{1, 2, 3\}

, we also get 3 restriction morphisms

\begin{align*} U_1 &\xrightarrow{r_{1j}} U_1 \cap U_j \\ U_2 &\xrightarrow{r_{2j}} U_2 \cap U_j \\ U_3 &\xrightarrow{r_{3j}} U_3 \cap U_j \\ \end{align*}

By taking the relevant products, and using the associativity, you get the two ways.

John Baez (Jun 04 2024 at 08:28):

That's a clearer hint than mine. By the way, there's a highbrow way of thinking about this stuff in terms of the 'Cech nerve', 'descent' and the 'bar construction', which we discussed here, but I feel the lowbrow approach we're taking here is a good warmup for that highbrow approach.

David Egolf (Jun 04 2024 at 16:01):

Thanks to both of your for the hints! I will be referencing them as I try to better understand

g

and

h

We want to define some function

:\prod_iF(U_i) \to \prod_{i,j}F(U_i \cap U_j)

. We know by the universal property of products that such functions corrrespond bijectively to cones with tip

\prod_i F(U_i)

over the discrete diagram having objects of the form

F(U_i \cap U_i)

. So, to find a morphism

:\prod_iF(U_i) \to \prod_{i,j}F(U_i \cap U_j)

, it suffices to find one morphism from

\prod_k F(U_k)

F(U_i \cap U_j)

for each

(i,j)

David Egolf (Jun 04 2024 at 16:04):

Now, to describe a function, it suffices to say what the function does to each element. An arbitrary element of

\prod_kF(U_k)

is a tuple of presheaf elements, where we have one element associated to each

U_k

. So, given a tuple of presheaf elements associated to the open sets

U_k

in our cover, we want to determine some corresponding element in each

F(U_i \cap U_j)

What is an element of

F(U_i \cap U_j)

? It is a presheaf element associated to

U_i \cap U_j

. So, from a tuple of presheaf elements having one element for each open set in our open cover, we want to determine some presheaf element on

U_i \cap U_j

David Egolf (Jun 04 2024 at 16:15):

Let

t \in \prod_k F(U_k)

be such a tuple of presheaf elements.

t

has a presheaf element in it associated in particular to

U_i

, and it also has one associated to

U_j

. I will denote the presheaf element of

t

associated to

U_i

t_i

, and the one associated to

U_j

t_j

If we restrict

t_i \in F(U_i)

U_i \cap U_j

, we get an element of

F(U_i \cap U_j)

. We can build up a function

:\prod_kF(U_k) \to F(U_i \cap U_j)

using this idea. Namely, let the function map act by

t \mapsto F(r_{U_i \to U_i \cap U_j})(t_i)

, where

F(r_{U_i \to U_i \cap U_j}): F(U_i) \to F(U_i \cap U_j)

is a restriction function provided by our presheaf.

David Egolf (Jun 04 2024 at 16:17):

We can use this approach to build a function

:\prod_kF(U_k) \to F(U_i \cap U_j)

for each

(i,j)

. These functions all together then induce a unique function

:\prod_k F(U_k) \to \prod_{i,j} F(U_i \cap U_j)

by the universal property of products.

David Egolf (Jun 04 2024 at 16:21):

Intuitively, this function takes a tuple

t

of presheaf elements over our open cover, and sends this tuple to a tuple of presheaf elements. The output tuple has one element associated to each pairwise intersection

U_i \cap U_j

of our open cover sets. Namely, it associates the restriction of

t_i \in F(U_i)

U_i \cap U_j

David Egolf (Jun 04 2024 at 16:25):

Now, we can also define a second function in a similar way. It will also be built up from functions

:\prod_k F(U_k) \to F(U_i \cap U_j)

(i,j)

varies. We define the

(i,j)

-th inducing function as

t \mapsto F(r_{U_j \to U_i \cap U_j})(t_j)

. Then, we can use the universal property of products to induce a unique function

:\prod_k F(U_k) \to \prod_{i,j}F(U_i \cap U_j)

David Egolf (Jun 04 2024 at 16:27):

In brief, this function takes a tuple

t

of presheaf elements over the sets of our open cover, and for each pairwise intersection

U_i \cap U_j

of those open sets it associates the restriction of

t_j \in F(U_j)

U_i \cap U_j

David Egolf (Jun 04 2024 at 16:31):

I think that's right. Assuming so, it appears that the functions I was looking for weren't all that complicated! Next time, building on this, I plan to work on understanding what it means for this to be an equalizer diagram:
diagram

David Egolf (Jun 04 2024 at 16:45):

(A side note: I just noticed that the phrase "a function is determined by its values on each of its inputs" can be thought of as an implicit reference to the universal property of coproducts in

\mathsf{Set}

. That's kind of fun!)

John Baez (Jun 04 2024 at 16:57):

Jean-Baptiste Vienney (Jun 04 2024 at 17:01):

Be aware it’s not exactly the same thing. The coproduct in

\mathbf{Set}

is the disjoint union of sets. Therefore a function

f:\underset{i \in I}{\bigsqcup}X_i \rightarrow Y

corresponds to family of functions

(f_i:X_i\rightarrow Y)_{i \in I}

“A function is determined by its values on each of its inputs” corresponds to the bijection

\mathbf{Set}[X,Y] \cong \underset{x \in X}{\prod}Y

. By the universal property of products, it gives you that a function

f:X \rightarrow Y

can be thought of as a family

(f(x))_{x \in X}

of elements of

Y

Jean-Baptiste Vienney (Jun 04 2024 at 17:21):

Oh, now I see it. You can also interpret this sentence using the universal property of coproducts. You can write

X \cong \underset{x \in X}{\bigsqcup}\{*\}

and the universal property of the coproduct gives you that a function

f:X\rightarrow Y

corresponds to a family

(f_x:\{*\} \rightarrow Y)_{x \in X}

that is a family

(f(x)=f_x(*))_{x \in X}

of elements of

Y

John Baez (Jun 04 2024 at 18:14):

Yes, and we're also using a very special feature of the category of sets here, which is that every object is a coproduct of copies of the terminal object.

John Baez (Jun 04 2024 at 18:15):

This makes the category of sets is "boringly simple" compared to other categories. But of course that boring simplicity is exactly why we like it!

Peva Blanchard (Jun 04 2024 at 18:18):

@John Baez Oh but I remember an article of yours, at the n-category café, which presented some very nice characterization of the category of sets. I'm trying to google it, but I can't remember the exact keywords. I think there was some stuff involving comonoids...

John Baez (Jun 04 2024 at 18:32):

Most relevant here is that

\mathsf{Set}

is the free category with coproducts on one object, and it then turns out that this object is the terminal object. But it sounds like you're thinking of something else.

John Baez (Jun 04 2024 at 18:36):

I don't know what article you're talking about, but maybe you mean that

\mathsf{FinSet}

is the free symmetric monoidal category on a cocommutative monoid object; it then turns out that this object is the terminal object.

John Baez (Jun 04 2024 at 18:36):

I probably did blog about this ages ago, as part of my "Coffee for Theorems" series.

Peva Blanchard (Jun 04 2024 at 18:37):

Yes, I was mostly reacting to "boringly simple" because I remember my impression that the category of Set can really be "non-boringly non-simple".

David Egolf (Jun 06 2024 at 15:43):

I wonder if, in a category with coproducts, it makes sense to think of the "elements" in that category as the objects that can't be written as a "non-boring" coproduct (that is, excluding things like

A \cong A \coprod 0

) of other objects. (The "indecomposable" objects, if that's the right term). I'm motivated in this intuition by this fact relating to the universal property of coproducts: a morphism from some object

A = \coprod_i B_i

is determined by corresponding morphisms from the

B_i

David Egolf (Jun 06 2024 at 15:44):

I was hoping to work out today what it means for our diagram above to be an equalizer. However, it turns out I need to rest up today. So, I hope to get back to that tomorrow!

John Baez (Jun 06 2024 at 15:52):

I don't know if it makes sense to think of indecomposable objects (yes, that's the right term) as "elements" - that's a sort of squishy question, so let me just say that I've never thought of them as "elements". However, they are important. They're especially important in categories where every object is a coproduct of indecomposables: then they serve as "building blocks" for general objects in a very nice way.

John Baez (Jun 06 2024 at 15:55):

For example, a functor from a group

G

\mathsf{Vect}

is called a representation of

G

, and there's a category

\mathsf{Rep}(G) = \mathsf{Vect}^G

with representations of

G

as objects and natural transformations as morphisms. There's a huge industry devoted to understanding

\mathsf{Rep}(G)

for various kinds of groups

G

John Baez (Jun 06 2024 at 15:57):

G

is a finite group, we have a wonderful theorem: every object in

\mathsf{Rep}(G)

is a coproduct of indecomposable objects! Moreover it's a coproduct in a unique way (up to isomorphism and permuting the guys you're taking the coproduct of).

John Baez (Jun 06 2024 at 15:59):

Even better, if we are using vector spaces over an algebraically closed field

k

\mathbb{C}

, then if we have two indecomposable objects

R, R' \in \mathsf{Rep}(G)

they are either isomorphic, in which case

John Baez (Jun 06 2024 at 16:01):

This should remind you of a "Kronecker delta"

\delta_{x,x'}

which is

1

x = x'

and

0

otherwise. It's a categorified Kronecker delta, where

\mathrm{hom}(R,R')

is 1-dimensional if

R \cong R'

and 0-dimensional otherwise!

John Baez (Jun 06 2024 at 16:04):

So these indecomposables are not only building blocks for all objects, they "don't talk to each other" - there are no interesting morphisms between nonisomorphic indecomposables.

John Baez (Jun 06 2024 at 16:05):

I'd call this a very "crunchy" situation - it's hard to put into words, but it's very much the opposite of a floppy, sloppy situation, it's very well-disposed to concrete calculations, and it makes the question of finding the indecomposable representations of a finite group incredibly important, because one you know them, you really know a lot!

John Baez (Jun 06 2024 at 16:08):

You said you wanted to learn how to create nice mathematical situations and structures. I guess one thing that helps is learning a bit about some of the classic examples of situations that mathematicians really love, and why they're loved.

\mathsf{Rep}(G)

for a finite group

G

is one of these. Any category where every object is a coproduct of indecomposables has a similar crunchy feel to it.

John Baez (Jun 06 2024 at 16:25):

I said I don't think of indecomposables as "elements", at least not in a sense similar to set-theoretic elements. But people often do think of them as similar to "atoms"... or elements in the chemical sense!

The discovery that every group representation is a coproduct of indecomposables in a unique way is very similar to the discovery that all molecules are made of atoms which can't be further decomposed (well, that's what they thought anyway) - and moreover, that this decomposition is unique.

The world would be a much more tricky place if I could decompose water into one oxygen atoms and two hydrogen atoms but you, using another technique, could decompose it into a phosphorus atom and a carbon atom.

Peva Blanchard (Jun 06 2024 at 16:56):

If I understand correctly, in the case of sets, the atomic objects are the empty set and the singleton sets. So, there are only two iso classes of atomic objects,

0

and

1

. And, there is a unique arrow from

0

from

1

. This is in sharp contrast with

Rep(G)

Also, when we fix a singleton set

1

, an element of an object

A

is usually presented as an arrow

1 \rightarrow A

. And the set of elements of

A

corresponds to the hom set

Set(1, A)

So, in an arbitrary category with coproducts, it makes sense to discuss, on one hand, atomic objects (indecomposable as a coproduct), and, on the other hand, elements of an object. But it's not obvious to me how to relate the two.

John Baez (Jun 06 2024 at 17:08):

Yes,

\mathsf{Rep}(G)

is like

\mathsf{Vect}

, and very different from

\mathsf{Set}

, in that its initial object is also terminal.

John Baez (Jun 06 2024 at 17:09):

It's an [[abelian category]], and the property I'm talking about, that every object is uniquely a coproduct of indecomposables, is essentially the same as it being [[semisimple]].

Oscar Cunningham (Jun 06 2024 at 17:14):

I think it's best not to consider

0

to be indecomposable. For the same reason

1

isn't prime.

John Baez (Jun 06 2024 at 17:32):

John Baez (Jun 06 2024 at 17:45):

You need the initial object not to count as indecomposable if you want to get the kind of result I'm claiming happens in

\mathsf{Rep}(G)

for a finite group: that every object is uniquely a coproduct of indecomposables. Just like 1 mustn't be prime if you want every positive integer to be uniquely a product of primes.

John Baez (Jun 06 2024 at 17:48):

David Egolf (Jun 07 2024 at 16:15):

I look forward to resuming posting in this thread (and some others)! There's a lot of interesting posts various people have made that I look forward to responding to.

Unfortunately, I have been struggling more with my chronic fatigue the last few days. So, I think I need to rest up for several days and then hopefully I can return to posting in these threads. Here's wishing everyone a pleasant weekend!

Peva Blanchard (Jun 07 2024 at 16:42):

No worries, as far as I am concerned, no need to give reasons for why you are not posting. Rest well :)

David Egolf (Jun 16 2024 at 16:05):

It's interesting to me that this is a "nice mathematical situation". At first glance, I might have assumed it was "too boring" or "too simple"! But I suppose the idea roughly is that we have a bunch of "independent building blocks" in this category, which sounds handy.

David Egolf (Jun 16 2024 at 16:07):

Specifically, I am interested in understanding what it means for this diagram to be an equalizer. It's going to take me a minute to remember what we already said about this diagram!

David Egolf (Jun 16 2024 at 16:12):

David Egolf (Jun 16 2024 at 16:22):

Continuing the review, focusing now on the functions

f,g,h

in our diagram above:

John Baez (Jun 16 2024 at 16:25):

Your impression that this is "too simple" shows you have good intuitions. The field of homological algebra is heavily focused on how categories deviate from the simple behavior described here, so from that point of view categories of representations of finite groups are boring. But:

1) When you're building a house, you don't want the bricks to be interesting: you want them to behave in simple nice ways. So, like the category of sets or the category of vector spaces, the category of representations of a finite group is a great thing for building further structures!

John Baez (Jun 16 2024 at 16:33):

2) Knowing general facts about categories of representations of finite groups is not the end of learning about these categories - it's just the start. We want to know all the indecomposable representations of all finite groups, and we want to know everything about them. That's an endless task... but luckily, it's fascinating and leads to many mind-blowing discoveries.

John Baez (Jun 16 2024 at 16:36):

But I'll restrain myself. Back to why the sheaf condition is an equalizer condition!

David Egolf (Jun 17 2024 at 18:09):

Continuing to try and understand why the sheaf condition is an equalizer condition, I next want to think about this: what does it mean for some

t \in \prod_i F(U_i)

to satisfy

g(t) = h(t)

? For

g(t)=h(t)

, we must have

g(t)_{i,j} = h(t)_{i,j}

for each

(i,j)

. We know that

g(t)_{i,j}

is the restriction of

t_i \in F(U_i)

U_i \cap U_j

. And

h(t)_{i,j}

is the restriction of

t_j \in F(U_j)

U_i \cap U_j

. For these two to be equal, we must have that

(t_i)|_{U_i \cap U_j} = (t_j)|_{U_i \cap U_j}

. In other words, the two parts of

t

that "overlap" on

U_i \cap U_j

must agree on the overlap.

David Egolf (Jun 17 2024 at 18:14):

I know how to find an equalizer of a diagram in

\mathsf{Set}

. In this case, an equalizer of our diagram will be given by:

David Egolf (Jun 17 2024 at 18:19):

Combining the two paragraphs above, the object part of an equalizer of our diagram is given by the subset

A

\prod_i F(U_i)

consisting of elements

t \in \prod_i F(U_i)

such that

(t_i)|_{U_i \cap U_j} = (t_j)|_{U_i \cap U_j}

for all

(i,j)

. Intuitively, each element of this set

A

consists of a way to assign a presheaf element (provided by

F

) to each open subset

U_i

in our open cover for

U

, such that data we pick "agrees on overlaps". Together, the entire equalizer set

A

corresponds to all the different ways in which we can do this.

Todd Trimble (Jun 17 2024 at 18:25):

The only thing I would change in that is to change "to each open set of

X

" to "to each open set of the covering" (i.e., to each

U_i

). Otherwise, looks good.

David Egolf (Jun 17 2024 at 18:39):

David Egolf (Jun 17 2024 at 18:50):

It remains to consider the case in which

F(U)

together with

f

also provides an equalizer for our diagram. Since limits are unique up to isomorphism, if

F(U)

is the object part of an equalizer, that implies we have a bijection between

F(U)

and the set of all ways

A

to pick data from each open set in our cover such that the selected data agrees on overlaps.

Let

\alpha: A \to F(U)

be the canonical isomorphism (induced by the universal property of limits). Picking some data

t_i \in F(U_i)

for each

U_i

in our open cover such that

t_i

and

t_j

agree on

U_i \cap U_j

for all

(i,j)

corresponds to picking an element

t

A

. Then

\alpha(t)

is an element

\alpha(t) \in F(U)

such that

f(\alpha(t))=t

. That is, restricting

\alpha(t)

to each

U_i

gives us

t_i

David Egolf (Jun 17 2024 at 18:51):

David Egolf (Jun 17 2024 at 18:55):

In this case,

\alpha

corresponds to a "stitching together" process that takes in presheaf data on the open sets

U_i

covering

U

that agrees on overlaps, and returns a "global" presheaf element in

F(U)

that restricts to the data we selected on each

U_i

. The existence of

\alpha

together with the fact that

A

is (the object part of) an equalizer tells us that we can always perform this stitching process given appropriate input data.

(Note that

F(U)

together with

f

isn't always an equalizer for our diagram! There will be lots of presheafs for which this is not the case. But I'm considering here the case in which this data does provide an equalizer).

David Egolf (Jun 17 2024 at 18:58):

John Baez (Jun 17 2024 at 22:24):

This is great, @David Egolf! You not only got the idea, you figured out how to explain it well, conveying the intuition. The phrase "stitching together" is very good for how we assemble an element of

F(U)

from elements of the

F(U_i)

that agree when restricted to the

U_i \cap U_j

David Egolf (Jun 19 2024 at 17:26):

Now, to show that

F

is acting like a sheaf with respect to

U

and its open cover of the

U_i

, there is still a little more work to do. We've seen that if

F(U)

and

f

are an equalizer (as above), then we can always form an element of

F(U)

from elements

t_i

of the

F(U_i)

that "agree on overlaps". It remains to show that in this case there is a unique element of

F(U)

that restricts to each of these elements

t_i

on the corresponding

U_i

David Egolf (Jun 19 2024 at 17:32):

Let

t \in A

be some tuple of

t_i \in F(U_i)

such that the

t_i

"agree on overlaps". We want to show there is a unique element

x

F(U)

that maps to

t

under

f

. Let us assume that we have the situation

f(x)=f(x')=t

for some

x,x' \in F(U)

. Since

\alpha

is a bijection, there is a unique

a \in A

so that

\alpha(a)=x

and similarly a unique

a' \in A

so that

\alpha(a')=x'

. Therefore,

f(x)=f(x')=t

implies that

f(\alpha(a)) = f(\alpha(a')

. Since

m = f \circ \alpha

, this implies that

m(a) = m(a')

. Since

m

is injective, this implies that

a=a'

and hence

\alpha(a)=x=x'=\alpha(a')

. We conclude that there is indeed a unique element of

F(U)

that restricts to each

t_i

U_i

David Egolf (Jun 19 2024 at 17:35):

So, if

F(U)

and the corresponding

f:F(U) \to \prod_i F(U_i)

are always an equalizer for the diagram under discussion, given any open set

U

X

and any open cover of

U_i

U

(with each

U_i \subseteq U

), then I think that

F

is a sheaf.

David Egolf (Jun 19 2024 at 17:37):

It remains to show that if

F

is a sheaf, then

F(U)

and

f:F(U) \to \prod_i F(U_i)

provide an equalizer for any version (as

U

and its open cover vary) of the diagram discussed above. But I will leave that for next time!

David Egolf (Jun 20 2024 at 16:42):

Alright, we're in the home stretch now! Let's assume that

F

is a sheaf. We want to show that this diagram is then an equalizer diagram (for any open set

U

X

and any open cover of

U

using some

U_i

with each

U_i \subseteq U

):
diagram

David Egolf (Jun 20 2024 at 16:48):

To do this, let's suppose we have some other cone over

g

and

h

. We want to show there is a unique morphism from this cone to our proposed equalizer cone, in the category of cones over

g

and

h

:
morphism of cones

David Egolf (Jun 20 2024 at 16:52):

Now, each element

x

X

corresponds to some element

f'(x) \in \prod_i F(U_i)

, such that

g(f(x')) = h(f'(x))

. As we saw earlier, this means that

f'(x)

picks out some data from each

F(U_i)

such that the data selected on

U_i

and

U_j

agree when restricted to

U_i \cap U_j

, for all

(i,j)

. So, we can think of each element

x

X

as being associated to some collection of data

f'(x)

on the

U_i

that "agrees on overlaps".

David Egolf (Jun 20 2024 at 16:56):

Since

F

is a sheaf, we know there is a unique element

\beta(x) \in F(U)

that restricts to each

f'(x)_i

U_i

, for all

i

. So, let's define

\beta

in that way: it sends

x \in X

associated to some collection

f'(x)

of "compatible on overlaps" data to the unique "stitched-together" element of

F(U)

induced by the data

f'(x)

David Egolf (Jun 20 2024 at 17:00):

We want to show that there is only this one way to define

\beta

. We need

f(\beta(x)) = f'(x)

for any

x \in X

. That is,

\beta(x)

must restrict on each

U_i

f'(x)_i

. Since

F

is a sheaf, there is only one such element of

F(U)

that satisfies this condition. So, indeed there is a unique morphism of cones to our cone involving

F(U)

and

f

David Egolf (Jun 20 2024 at 17:03):

We conclude that if

F

is a sheaf on

X

U

is an open set in

X

, and we have an open cover for

U

formed from sets

U_i

with each

U_i \subseteq U

, then this diagram is always an equalizer diagram:
diagram

David Egolf (Jun 20 2024 at 17:07):

David Egolf (Jun 26 2024 at 15:30):

By applying

\Gamma

after

\Lambda

, we can make a sheaf from any presheaf! The next order of business is to understand in detail how this works. [I've not had the energy for math the last little while, but when I have a bit more energy I plan to start working this exercise!]

John Baez (Jun 26 2024 at 17:48):

I will just say that this business of turning a presheaf into a sheaf is called "sheafification", and there's at least one very nice way to understand it other than than merely saying it's the composite

\Gamma \circ \Lambda

. So this is a very worthwhile thing to think about. For example, I think you can see that it's the left adjoint of the forgetful functor from presheaves to sheaves.

David Egolf (Jul 11 2024 at 23:20):

David Egolf (Jul 11 2024 at 23:21):

I am unsure what is meant here by a section of a sheaf. (The sheafification of

F

is a sheaf).

In the blog posts so far, we've talked about sections of bundles, but not sheaves, as far as I can remember. However, we recently saw that there is an equivalence of categories, between the category of sheaves on

X

and the category of etale spaces over

X

. When we talk about "a section of a sheaf", is this perhaps a way to refer to a section of the sheaf's corresponding etale space (which is a bundle) under this equivalence?

David Egolf (Jul 11 2024 at 23:32):

I am guessing that this is the intended meaning in the snippet of Part 3 that I quoted above.

David Egolf (Jul 11 2024 at 23:39):

Under this assumption, we can rewrite the snippet I quoted above. (I use

F'

to refer to the sheafification of

F

For example, if

F

is the presheaf of bounded continuous real-valued functions on open subsets of

\mathbb{R}

, then an element of

F'(\mathbb{R})

can be obtained by stitching together a bunch of bounded continuous real-valued functions, defined on various open subsets

U_i

\mathbb{R}

, provided that the

U_i

cover

\mathbb{R}

and the selected bounded continuous real-valued functions agree on overlaps. In particular, such an element is not necessarily bounded. So we see that the sheafification

F'

of a presheaf

F

can (at least sometimes) assign new data to an open set - data that the original presheaf

F

did not assign to that open set!

Peva Blanchard (Jul 12 2024 at 10:27):

Indeed, I was a bit surprised too, when reading papers, to see that an element

s \in F(U)

is also called a section of

F

over

U

. Because a section is usually defined w.r.t. a bundle

E \to B

I think there is a way to this way of speaking rigorous by noticing that, as presheaves,

F

is included in its sheafification

F'

. The inclusion is given, for every open subset

U

, by

s \mapsto [s]

where

[s]

is the continuous function from

U

to the étale space of

F

that assigns to every

x \in U

the germ

[s]_x

s

x

The function

[s]

is a section of the étale space of

F

.
And the map

s \mapsto [s]

is injective.
This is why I think we can speak of

s \in F(U)

as a section of

F

over

U

Kevin Carlson (Jul 12 2024 at 16:59):

Yes, that's right, in the settings where a sheaf can be seen as a local homeomorphism, its sections in

F(U)

are literally local sections of that map; in more general settings it's metaphorical.

John Baez (Jul 12 2024 at 19:05):

@David Egolf sorry for the sloppy use of language. Peva and Kevin guessed: at some point I started wanting to call an element of

F(U)

of a sheaf or even a presheaf

F

something a bit more evocative than an "element", so I started calling it a "section" - apparently without adequate warning.

John Baez (Jul 12 2024 at 19:05):

John Baez (Jul 12 2024 at 19:07):

For sheaves, it's quite safe to abuse language after one has internalized the fact that every sheaf is (isomorphic) to the sheaf of sections of its etale space.

John Baez (Jul 12 2024 at 19:10):

And one can even get away with it for presheaves, using the trick Peva explained: given a presheaf

G

we have a god-given inclusion

G(U) \to F(U)

where

F

is the sheafification of

G

, so we can think of elements of

G(U)

as some of the elements of

F(U)

, which we can think of as sections of the etale space of

F

John Baez (Jul 12 2024 at 19:11):

However, since this is supposed to be an introduction to the subject, I should not be forcing my readers into all this "thinking of X as Y" baloney.

John Baez (Jul 12 2024 at 19:12):

I will just go back and edit my blog posts to change the offending term "section" to "element" as needed.

John Baez (Jul 12 2024 at 19:12):

This is somewhat symbol-ridden compared to what I'd written - I was trying to talk like an ordinary bloke - but since I'd just said that

\Gamma \Lambda

is called 'sheafification', it should make sense in context.

David Egolf (Jul 12 2024 at 19:32):

John Baez (Jul 12 2024 at 19:34):

And it's spelled out in even more detail in the following puzzle, which I've also corrected:

Puzzle. Prove the above claim. Give a procedure for constructing an element of

(\Gamma \Lambda F)(U)

given open sets

U_i \subseteq U

covering

U

and elements

s_i \in F(U_i)

that obey

John Baez (Jul 12 2024 at 20:38):

If you spot more places where I do this, or any other problems, please let me know and I can fix them.

Peva Blanchard (Jul 12 2024 at 21:07):

This "way of thinking about

s \in FU

" reminds me of something that recently blew my mind.

Urs Schreiber gave a very nice talk at the Zulip CT Seminar, about higher topos theory in physics. I couldn't understand all the details, but he starts with a "way of thinking" that, I think, is accessible for CT beginners. Here it is.

When you have a space

X

that you want to study, e.g., the surface of the earth, it is easier if you have a plot. A plot could be, for instance, a function

p

that sends a tuples of coordinates

(x,y)

(e.g. the latitude and longitude) to a point in

X

. In other words, a plot can be seen as a function

U \xrightarrow{p} X

from some "nice" or familiar space

U

to the space

X

The key point is that the study of the space

X

is exactly the study of all the plots

U \xrightarrow{p} X

for every nice space

U

. To study the surface of the earth is the same as setting up all local charts and giving procedures to transform any one of them into another (provided they overlap).

This collection of charts can be described as follows: for every

U

, we have a set

FU

of plots

p : U \to X

. Moreover, these sets should be consistent with one another. If there is a nice morphism

U \to V

, then there is a function

FV \to FU

that maps any plot

V \to X

to its pre-composition with the nice morphism. And there is a collection of nice morphisms that mutually agree on the pairwise intersection of their domains, then they can be glued together.

In other words, the study of

X

is exactly the study of a sheaf

F

defined on some category

C

of "nice" spaces.

And now the twist: if we follow this wanna-be equivalence, then any sheaf

F

C

can be thought of as the sheaf of plots into a generalized space. To emphasize even more the situation, we can use the name

F

to refer to this generalized space.

Peva Blanchard (Jul 12 2024 at 21:11):

ps: I am still processing all the twists involved. I hope I did not over interpret this stuff.

Peva Blanchard (Jul 12 2024 at 21:19):

Note that, I am using the symbols

U

and

X

that have been used in this thread to denote an open subset

U

and a topological space

X

However, in Urs Schreiber's talk, the "nice" category is not the category of open subsets of a topological space. The first example he gave was something like the category of cartesian spaces

\mathbb{R}^n

. Apparently, this relates to the distinction between a petit and gros topos

John Baez (Jul 13 2024 at 06:37):

Right, that's an excellent explanation of an important direction sheaf theory goes after one learns the basic example of sheaves on a topological space! This direction is often attributed to Grothendieck. Unfortunately my blog articles don't get far enough to talk about this direction: I only managed to cover some very basic material. But Mac Lane and Moerdijk do talk about it.

The first step toward this direction is generalizing sheaves on a set with a topology to sheaves on a category with a Grothendieck topology. And this is the kind of sheaf their book is mainly about.

David Egolf (Jul 15 2024 at 17:57):

Since we are wishing to construct an element of

(\Gamma \Lambda F)(U)

, perhaps it will help to think about what the elements of

(\Gamma \Lambda F)(U)

are.

\Gamma \Lambda F

is the presheaf of sections of

\Lambda F

. So, the set

(\Gamma \Lambda F)(U)

is the set of sections of

\Lambda F

over

U

. Thus, an element of

(\Gamma \Lambda F)(U)

is a section of

\Lambda F

over

U

David Egolf (Jul 15 2024 at 17:59):

What is a section of

\Lambda F

over

U

? Well,

\Lambda F

is a bundle over

X

(where

X

is the topological space the open sets

U_i

are from), which I will write as

p:\Lambda F \to X

. (So I am using the symbol

\Lambda F

in two different ways now). We recall that each point of

x

has some set

\Lambda(F)_x

"hovering over" it, in the sense that

p^{-1}(x) = \Lambda(F)_x

for each

x \in X

. Thus, a section of

p = \Lambda F

over

U

is a continuous function

s:U \to \Lambda F

such that

s(x) \in \Lambda(F)_x

for each

x \in U

David Egolf (Jul 15 2024 at 18:01):

So, if we can construct a map

s:U \to \Lambda F

with

x \in \Lambda(F)_x

for each

x \in U

, we will have constructed an element of

(\Gamma \Lambda F)(U)

. To do this, we first recall that

\Lambda(F)_x

is the colimit of the diagram given by the

F(V_i)

(and their restriction functions) as

V_i

ranges over all the open sets in

X

that contain

x

. In particular, this implies that there is a ("cone leg") function from

F(V_i)

\Lambda F(x)

x \in V_i

David Egolf (Jul 15 2024 at 18:07):

David Egolf (Jul 15 2024 at 18:12):

Intuitively,

s:U \to \Lambda F

is a "local behaviour" function, that builds up some "global" data on all of

U

by describing how it behaves at each point.

s(x) = [s_i]_x

intuitively says that our global data at

x

will behave locally like how

s_i \in F(U_i)

does. So, we can view this definition of

s

as a sort of "gluing process" that forms global data from local data. This glued together data,

s

, is then an element of the set assigned to

U

by the sheafification of

F

David Egolf (Jul 15 2024 at 18:14):

It still remains to show that

s: U \to \Lambda F

defined by

s(x)=[s_i]_x

(for

x \in U_i

) is continuous. But I will stop here for today!

David Egolf (Jul 22 2024 at 16:24):

I will now try to prove that

s:U \to \Lambda F

defined by

s(x) = [s_i]_x

(when

x \in U_i

) is continuous. I hope to do this by showing that

s

is continuous at any point

x \in U

To show that

s

is continuous at

x \in U

, we need to show that the inverse image under

s

of each neighborhood of

s(x)

is a neighborhood of

x

. (I use the word "neighborhood" to mean a subset that contains the point in question, while having a subset that is open which also contains that point).

David Egolf (Jul 22 2024 at 16:29):

So, let

V \subseteq \Lambda F

be a neighborhood of

s(x)

. We wish to show that

s^{-1}(V)

is a neighborhood of

x

David Egolf (Jul 22 2024 at 16:42):

After spending some time dredging up my memories from the earlier parts of this thread, I think it suffices to show this: the inverse image of any neighbourhood from a neighbourhood basis of

s(x)

is a neighbourhood of

x

To see why this is sufficient, let

V

be an arbitrary neighbourhood of

s(x)

and let

B_{s(x)}

be a neighbourhood basis for

s(x)

. Then, by the definition of neighbourhood basis, there is some

V_b \subseteq V

with

V_b \in B_{s(x)}

, so that

s(x) \in V_b

. If

f^{-1}(V_b)

is a neighbourhood of

x

, then that means it contains an open set that contains

x

, which is further a subset of

f^{-1}(V)

. Thus, we can conclude that

f^{-1}(V)

is a neighbourhood of

x

David Egolf (Jul 22 2024 at 16:56):

Earlier in this thread, we discussed a convenient neighbourhood basis for a point

p \in \Lambda F

. First, since every element of

\Lambda F

is a germ, we note that

p = [s]_x

for some

s \in F(U)

, where

U \subseteq X

is an open set containing

x

. Then, we can form the collection of sets of the form

\{[s]_x | x \in U'\}

U' \subseteq U

varies over the open subsets of

U

that contain

x

If I'm reading the earlier part of this thread correctly, we saw that this collection of sets

B_{[s]_x}

forms a neighbourhood basis for

p=[s]_x

. Intuitively, each set in the collection

B_{[s]_x}

is a set of "local behaviours of

s

" on some open set containing

x

David Egolf (Jul 22 2024 at 17:04):

Returning to the current puzzle, to show that

s:U \to \Lambda F

is continuous at

x

, it suffices to show that the inverse image of any neighbourhood from our neighbourhood basis

B_{s(x)}

is a neighbourhood of

x

David Egolf (Jul 22 2024 at 17:07):

Now,

s(x) = [s_i]_x

by definition. Here,

s_i \in F(U_i)

is one of our provided sheaf set elements such that

x \in U_i

In this setting, let us consider an arbitrary neighbourhood from our neighbourhood basis

B_{s(x)} = B_{[s_i]_x}

. It is of this form:

\{[s_i]_y | y \in U'\}

where

U' \subseteq U_i

is an open subset of

U_i

that contains

x

. I will denote this neighbourhood as

N_i

David Egolf (Jul 22 2024 at 17:14):

We wish to show that

s^{-1}(N_i)

is a neighbourhood of

x

. What is this preimage? Well,

s:y \mapsto [s_i]_y

. So, if we have some

[s_i]_y \in N_i

, its preimage is the collection of points

y'

such that

[s_i]_{y} = [s_i]_{y'}

For each

y \in U'

, we have that

[s_i]_y \in N_i

. Hence,

s^{-1}(N_i)

contains at least all of

U'

. Since

U'

is an open set containing

x

, we conclude that

s^{-1}(N_i)

really is a neighbourhood of

x

David Egolf (Jul 22 2024 at 17:21):

We conclude that the inverse image of an arbitrary set in our neighbourhood basis for

s(x)

is a neighbourhood of

x

. Therefore, the inverse image of an arbitrary neighbourhood for

s(x)

is a neighbourhood for

x

. That implies that

s

is continuous at

x

, for any

x \in U

. Thus,

s:U \to \Lambda F

is continuous, as desired!

David Egolf (Jul 22 2024 at 17:23):

Intuitively, if we think of continuous functions as having output that changes gradually as their input changes gradually, then the "local behaviour" of our data on

U

specified by our

s: U \to \Lambda F

changes gradually as we move around in

U

(I suppose one way to formalize this intuition is to note that, by making use of

s

, any path

p:\mathbb{I} \to U

induces a path in

\Lambda F

. Namely , we get

s \circ p:\mathbb{I} \to U \to \Lambda F

David Egolf (Jul 22 2024 at 17:33):

Whew! Hopefully I did that correctly. I'll wrap up for today by introducing the next puzzle:

David Egolf (Jul 22 2024 at 17:37):

John Baez (Jul 22 2024 at 20:49):

Either

\Gamma \Lambda

\Lambda \Gamma

has to be a typo above, so pick the one that makes sense.

John Baez (Jul 22 2024 at 20:58):

If you get stuck... I fixed the typo in my blog - thanks! And I also fixed another mistake where I spoke of a "section" of a presheaf instead of an "element". You've convinced me it's really bad to start using "section" in this other way.

David Egolf (Jul 25 2024 at 18:38):

Awesome! It's great you fixed those things! It's nice to know that this thread is helping make the blog posts even better for future readers.

David Egolf (Jul 25 2024 at 18:43):

Alright, my next goal is to show that for any presheaf

F

there is a morphism of presheaves

\eta_F:F \to \Gamma \Lambda F

. All the presheaves I consider here will be on some topological space

X

So, we are looking for a natural transformation

\alpha = \eta_F

from

F:\mathcal{O}(X)^{\mathrm{op}} \to \mathsf{Set}

\Gamma \Lambda F: \mathcal{O}(X)^{\mathrm{op}} \to \mathsf{Set}

David Egolf (Jul 25 2024 at 18:47):

Let

r: U \to U'

be a morphism in

\mathcal{O}(X)^{\mathrm{op}}

. Here

U

and

U'

are open sets of

X

, with

U' \subseteq U

. Associated to

r

, we have this naturality square:
naturality square

David Egolf (Jul 25 2024 at 18:48):

So, we need to specify a function

\alpha_U:F(U) \to \Gamma \Lambda F(U)

for each open set

U

X

David Egolf (Jul 25 2024 at 18:50):

In the previous puzzle, we saw a way to build an element of

\Gamma \Lambda F(U)

given a cover of open sets

U_i

for

U

, together with

s_i \in F(U_i)

for each

i

so that the

s_i

given agree on overlaps.

We can now use this procedure specialized to the case where our cover of open sets for

U

consists of just the single set

U

. If we are given some

s \in F(U)

, we should be able to construct some element of

\Gamma \Lambda F(U)

David Egolf (Jul 25 2024 at 18:54):

An element of

\Gamma \Lambda F(U)

is a section

s':U \to \Lambda F

, which is a continuous function

:U \to \Lambda F

such that

s'(x) \in \Lambda(F)_x

for each

x \in U

. (Here,

\Lambda(F)_x

is the set of germs of

F

x

Given

s \in F(U)

, our recipe for constructing an

s':U \to \Lambda F

from the previous puzzle is this:

s'(x) = [s]_x

for each

x \in U

Intuitively, this

s'

is just like

s

: at each point

x \in U

it specifies a germ

s'(x)=[s]_x

, which describes the "local behaviour" of

s

at that point.

David Egolf (Jul 25 2024 at 18:59):

So, we can try setting

\alpha_U(s) = (x \mapsto [s]_x):U \to \Lambda F

for each

U

. It remains to check that this specifies a natural transformation.

David Egolf (Jul 25 2024 at 19:21):

Picking some

s \in F(U)

, let's see if the naturality square commutes.

|_{U'} \circ \alpha_U(s) = |_{U'}(x \mapsto [s]_x)

and

\alpha_{U'} \circ |_{U'}(s)

will be

x \mapsto [s|_{U'}]_x

for

x \in U'

. These two functions are the same because restricting a presheaf element

s \in F(U)

from

U

U'

leaves each of its germs in

U'

unchanged.

We conclude that an arbitrary naturality square (of the form pictured above) commutes, so we have indeed found a natural transformation

\alpha:F \to \Gamma \Lambda F

David Egolf (Jul 25 2024 at 19:29):

I'll stop there for now! But the next part of the puzzle is to show that we can build up a natural transformation

\eta : 1 \Rightarrow \Gamma \Lambda

in this way.

David Egolf (Jul 30 2024 at 16:45):

We are working towards showing that there is an adjunction between these two functors. An important feature of an adjunction is a "unit". If

L:C \to D

is left adjoint to

R:D \to C

, the unit is a natural transformation

\eta:1_C \to R \circ L

. Intuitively, a unit tells us something about what happens to an object or morphism of

C

that takes a "round trip" across the adjunction and back.

David Egolf (Jul 30 2024 at 16:49):

David Egolf (Jul 30 2024 at 16:55):

To construct such a natural transformation, we need to say what each of its components are. Any component

\eta_F

\eta

is a morphism from some presheaf

F

to its sheafification

\Gamma \Lambda (F)

So, for each

F

, we need a natural transformation

\eta_F:F \to \Gamma \Lambda (F)

David Egolf (Jul 30 2024 at 16:59):

Since

\eta_F

is a natural transformation between presheaves on

X

, it has one component for each open set of

X

. Using our work from last time, how should we set

(\eta_F)_U: F(U) \to \Gamma \Lambda (F)(U)

David Egolf (Jul 30 2024 at 17:03):

Note that

s \in F(U)

and

(x \mapsto [s]_x):U \to \Lambda F

. By

[s]_x

I mean the germ of

s \in F(U)

x \in U

(We recall that an element of

\Gamma \Lambda (F)(U)

is a section

:U \to \Lambda F

with respect to

p:\Lambda F \to X

which sends each germ to its associated point in

X

David Egolf (Jul 30 2024 at 17:07):

We saw last time that

\eta_F:F \to \Lambda \Gamma(F)

defined in this way is indeed a natural transformation from a presheaf to its sheafification. It remains to show that all the

\eta_F

assemble to form a natural transformation

\eta: 1_{\widehat{\mathcal{O}(X)}} \to \Gamma \circ \Lambda

David Egolf (Jul 30 2024 at 17:12):

To show that we get a natural transformation, we need to show that an arbitrary naturality square commutes. Let

\alpha: F \to G

be an arbitrary morphism in

\widehat{\mathcal{O}(X)}

. Our corresponding naturality square is:
naturality square

David Egolf (Jul 30 2024 at 17:14):

We want to show that

\Gamma \Lambda(\alpha) \circ \eta_F = \eta_G \circ \alpha

is true. Both sides of this equation are natural transformations. To show that two natural transformations are equal, it suffices to show that each of their components are equal.

Hence, we now aim to show that

(\Gamma \Lambda(\alpha))_U \circ (\eta_F)_U = (\eta_G)_U \circ \alpha_U

, where

U

is some arbitrary open set of

X

David Egolf (Jul 30 2024 at 17:17):

David Egolf (Jul 30 2024 at 17:19):

To show that this diagram commutes, we can trace around an arbitrary element

s \in F(U)

and see what happens to it. We want to show that

((\Gamma \Lambda(\alpha))_U \circ (\eta_F)_U)(s) = ((\eta_G)_U \circ \alpha_U)(s)

is true.

David Egolf (Jul 30 2024 at 17:26):

We don't know anything in particular about

\alpha_U

, so we can't further simplify

\alpha_U(s)

. However, we do know what

(\eta_G)_U

is like. From above, we have

(\eta_G)_U:s \mapsto (x \mapsto [s]_x)

. Each output is a function

:U \to \Lambda G

, which is in fact a section of

p:\Lambda G \to X

Thus,

((\eta_G)_U \circ \alpha_U)(s) = (\eta_G)_U(\alpha_U(s)) = (x \mapsto [\alpha_U(s)]_x)

David Egolf (Jul 30 2024 at 17:32):

We know what

(\eta_F)_U(s)

is. It is

(x \mapsto [s]_x):U \to \Lambda F

. Thus, the left-hand side is:

David Egolf (Jul 30 2024 at 17:36):

Having re-written each side of the equation we wish to show is true, our new goal is to show this equation is true:

(Here, each side of the equation is a section of

p:\Lambda G \to X

over

U

. In particular, each side of the equation is a function

:U \to \Lambda G

David Egolf (Jul 30 2024 at 17:41):

To go further with this, I think we need to re-write

(\Gamma \Lambda (\alpha))_U

, so that we can figure out what it does to the input

(x \mapsto [s]_x): U \to \Lambda F

To start doing that, let's start by recalling what

\Lambda(\alpha): \Lambda(F) \to \Lambda(G)

is.

David Egolf (Jul 30 2024 at 17:52):

David Egolf (Jul 30 2024 at 17:53):

\alpha(s)

here is shorthand for

\alpha_U(s)

where

s \in F(U)

. So we are making using of

\alpha_U:F(U) \to G(U)

David Egolf (Jul 30 2024 at 17:57):

Next, I want to work out what

\Gamma(\Lambda(\alpha)) = \Gamma([s]_x \mapsto [\alpha(s)]_x)

is.

To do this, it will be helpful to recall what

\Gamma

outputs given a morphism of bundles. After reviewing an earlier part of this thread, it appears that:

David Egolf (Jul 30 2024 at 18:14):

So,

\Gamma(\Lambda(\alpha))

is a natural transformation from

\Gamma(\Lambda(F))

\Gamma(\Lambda(G))

. These are both sheaves on

X

. The

U

-th component of this natural transformation

:\Gamma(\Lambda(F))(U) \to \Gamma(\Lambda(G))(U)

is then the function

t \mapsto \Lambda(\alpha) \circ t

. Here,

t:U \to \Lambda F

is a section of

p:\Lambda F \to X

. Since

\Lambda(\alpha): \Lambda(F) \to \Lambda(G)

, we have that

\Lambda(\alpha) \circ t:U \to \Lambda(G)

, as desired.

David Egolf (Jul 30 2024 at 18:18):

Intuitively, the

U

-th component of this natural transformation takes data described on

U

in terms of local behaviour at each point in

U

, and then transforms each piece of local behaviour (a point in

\Lambda F

) to a new piece of local behaviour (a point in

\Lambda G

David Egolf (Jul 30 2024 at 18:21):

In brief,

(\Gamma \Lambda(\alpha))_U: (\Gamma \Lambda (F))(U) \to (\Gamma \Lambda (G))(U)

and

(\Gamma \Lambda(\alpha))_U(t) = \Lambda(\alpha) \circ t

David Egolf (Jul 30 2024 at 18:26):

We are now in the position to try and rewrite this expression:

(\Gamma \Lambda(\alpha))_U(x \mapsto [s]_x)

. This can now be rewritten as:

David Egolf (Jul 30 2024 at 18:31):

We noted above that

\Lambda(\alpha):[s]_x \mapsto [\alpha(s)]_x

, where

\alpha(s)

is shorthand for

\alpha_U(s)

s \in F(U)

. Thus,

\Lambda(\alpha)([s]_x) = [\alpha_U(s)]_x

, and so:

David Egolf (Jul 30 2024 at 18:35):

We were able to rewrite each side of

((\Gamma \Lambda(\alpha))_U \circ (\eta_F)_U)(s) = ((\eta_G)_U \circ \alpha_U)(s)

to the expression

x \mapsto [\alpha_U(s)]_x

. Thus, this diagram commutes at any

s \in F(U)

, and so it commutes:
naturality square evaluated at U

Since

U

was arbitrary, we conclude that this diagram commutes for any open set

U

X

David Egolf (Jul 30 2024 at 18:36):

Thus,

\Gamma \Lambda(\alpha) \circ \eta_F = \eta_G \circ \alpha

is true and this naturality square commutes:
naturality square

Since

\alpha:F \to G

was an arbitrary natural transformation between presheaves on

X

, we conclude that this diagram commutes for any natural transformation between presheaves on

X

. So, any naturality square for

\eta

commutes.

David Egolf (Jul 30 2024 at 18:37):

Finally, we conclude that

\eta: 1_{\widehat{\mathcal{O}(X)}} \to \Gamma \circ \Lambda

is a natural transformation, as desired!

David Egolf (Jul 30 2024 at 18:45):

This means we have a candidate

\eta

for the unit of an adjunction

\Lambda \dashv \Gamma

. Next time, I plan to think about what the counit of such an adjunction could look like!

John Baez (Jul 31 2024 at 09:02):

David Egolf (Aug 05 2024 at 17:37):

Next up, we wish to construct a candidate for the counit of an adjunction

\Lambda \dashv \Gamma

. This will be a natural transformation

\epsilon: \Lambda \circ \Gamma \to 1_{\mathsf{Top}/X}

To start with, let's pick some bundle

p:Y \to X

. The

p

-th component of

\epsilon

will be a morphism of bundles

\epsilon_p: \Lambda\Gamma(p) \to p

David Egolf (Aug 05 2024 at 17:39):

David Egolf (Aug 05 2024 at 17:45):

We want to define a morphism of bundles

\epsilon_p:\Lambda \Gamma(p) \to p

. I don't have the energy to figure out how to do that today. But I will quote the relevant part of the blog post to facilitate doing this on another day:

(Note that the blog post uses the notation

\Lambda \Gamma_p

where I use the notation

\Lambda \Gamma(p)

John Baez (Aug 05 2024 at 20:56):

John Baez (Aug 05 2024 at 21:00):

I also fixed another mistake where I used

\eta

instead of

\epsilon

. It's both a blessing and a curse that Greek has two e-like letters.

Oscar Cunningham (Aug 06 2024 at 07:46):

The real trick is to use

\varepsilon

and

\epsilon

for two related variables.

David Egolf (Sep 05 2024 at 14:52):

Upon reflection, the recent discussion relating to the adjunction

\Lambda \dashv \Gamma

has felt rather "heavy". I think it will help if I start by summarizing exactly how

\Lambda

and

\Gamma

work. Then, contemplating

\Lambda \circ \Gamma

should feel a bit easier, I hope!

John Baez (Sep 05 2024 at 15:25):

Good! Ideally when you have a clear mental image of

\Lambda

and

\Gamma

you can sort of "see" (in a more or less literal sense) what

\epsilon: \Lambda \circ \Gamma \to 1

and

\eta : 1 \to \Gamma \circ \Lambda

should be - i.e., how close taking a round trip around between presheaves and bundles comes to getting you back to where you started. To me, seeing what's going on is more important than writing up a proof with lots of symbols, since if I can do the former, I believe I can do the latter when pressed.

(This is after having done 8 years of homework assignments and taught years of courses that kept challenging that belief, in sometimes quite threatening ways. :upside_down:)

David Egolf (Sep 05 2024 at 15:47):

I like that perspective @John Baez ! I think I've slowly started to develop some intuition for

\Lambda

and

\Gamma

, but I have a ways to go still. I'll keep the goal of developing a clearer picture in mind as I work on this.

David Egolf (Sep 05 2024 at 16:05):

John Baez (Sep 05 2024 at 16:10):

Great! Just for fun, I'll say: I seem to have mentally compacted a lot of stuff to a little picture of a bundle

Y

over

X

(a rectangle sitting over a line), an open set

U \subseteq X

, and a bunch of sections of

Y

over

U

(some graphs of continuous functions defined on

U

, drawn in that rectangle I mentioned).

Then there's a fancier picture where I have two bundles

Y, Y'

over

X

and a bundle map

f: Y \to Y'

. I can see how it sends sections of

Y

over

U

to sections of

Y'

over

U

John Baez (Sep 05 2024 at 16:12):

Drawing the pictures would have taken one tenth the time it takes to describe them!

David Egolf (Sep 05 2024 at 16:23):

That's helpful! For a given bundle

p:Y \to X

, it makes sense to draw all the points that

p

sends to

x

as "hovering over"

x

. And that is nicely visualized using the rectangle you describe!

The low dimension of this picture is also nice, because it lets us easily visualize a section. As a slightly fancier version, one could also imagine a rectangular prism floating over a rectangle. But that would be harder to draw!

John Baez (Sep 05 2024 at 16:26):

Yes, it's funny how much advanced work on topology boils down to drawing pathetically simple 2-dimensional pictures and pretending they're higher-dimensional. Our retina is essentially 2-dimensional and we have to live with that.

David Egolf (Sep 05 2024 at 16:35):

This picture aims to illustrate how we get a section of

p'

from a section of

p

, given

f:Y \to Y'

satisfying

p' \circ f = p

. This condition on

f

basically ensures that each arrow describing how

f

maps points is vertical in this picture.

John Baez (Sep 05 2024 at 16:37):

Nice! Yes, this sort of picture is always hovering in my eyeballs as I work with bundles, sheaves and presheaves... guiding me.

David Egolf (Sep 05 2024 at 16:42):

I'll stop here for today. Next time, I'd like to do a similar thing for

\Lambda

. Namely, I hope to review how it acts on objects and morphisms, and maybe to draw a related picture.

John Baez (Sep 05 2024 at 16:44):

John Baez (Sep 05 2024 at 16:51):

David Egolf (Sep 06 2024 at 16:57):

David Egolf (Sep 06 2024 at 17:00):

John Baez (Sep 06 2024 at 17:14):

This is much tougher to draw. Do you want to hear how I draw a germ? It uses some 'artistic license', I'm afraid.

David Egolf (Sep 06 2024 at 17:15):

David Egolf (Sep 06 2024 at 17:27):

David Egolf (Sep 06 2024 at 17:31):

The horizontal line is our topological space

X

. The region indicated by the large oval is an open set

U

X

. Applying

F

U

gives us the set

F(U)

, which is represented here as a box floating above

U

The squiggly line in that box indicates an element

s \in F(U)

. A germ of

s

x

is indicated by a small circle around part of the part of

s

associated to

x \in U

. (Intuitively, this is given in the limit by restricting

s

to smaller and smaller open sets contained in

U

and containing

x

). Finally the bundle

\Lambda(F)

projects this germ back down to

x \in X

David Egolf (Sep 06 2024 at 17:33):

The space of germs

\Lambda(F)

does not appear in this picture, which is somewhat unsatisfying.

David Egolf (Sep 06 2024 at 17:36):

However, I like this about the above picture: it illustrates how going from a presheaf to a bundle of germs involves a transition. Namely,

David Egolf (Sep 06 2024 at 17:46):

Here's a fancier version of the above picture, aiming to illustrate part of how

\Lambda

sends a natural transformation of presheaves to a morphism of bundles:
picture

David Egolf (Sep 06 2024 at 17:50):

I've added a second box hovering over

U

, corresponding to the set

G(U)

. Since we have a natural transformation

\alpha: F \to G

, we have a function

\alpha_U:F(U) \to G(U)

. The squiggly line in the

G(U)

box indicates

\alpha_U(s)

. "Zooming in" on

\alpha_U(s)

x

gives us the germ

[\alpha_U(s)]_x

. Finally, the bundle

\Lambda(G):\Lambda(G) \to X

projects this germ back down to

x \in X

David Egolf (Sep 06 2024 at 17:55):

These pictures aren't perfect, but I think making them has been helpful for developing some intuition about what's going on!

Next time, I plan to start thinking about the "round trips"

\Lambda \circ \Gamma

and

\Gamma \circ\Lambda

. It would be very cool if we could figure out some pictures to illustrate these round-trip functors!

John Baez (Sep 06 2024 at 18:46):

Hey, that's more or less how I draw a germ. Morally, the germ of

s

x

is like

x

restricted to an 'infinitesimal' open set containing

x

John Baez (Sep 06 2024 at 18:52):

Yes, the space of all germs (say of the sheaf of continuous or smooth real-valued functions) is too large to draw except in a completely oversimplified way. I don't see a way around that.

John Baez (Sep 06 2024 at 19:01):

Good! I find pictures helpful as long as I know their limitations, but maybe I haven't thought enough about how the mere process of trying to draw them makes me think about things in new ways.

One challenge is that the etale space of a sheaf of sections of a bundle is usually huge compared to the original bundle. But you can just draw it as a rectangle labeled 'huge'. :upside_down:

David Egolf (Sep 08 2024 at 17:16):

Today, I want to try and draw a picture to illustrate the "round-trip" functor

\Gamma \circ \Lambda:\widehat{\mathcal{O}(X)} \to \mathsf{Top}/X \to \widehat{\mathcal{O}(X)}

David Egolf (Sep 08 2024 at 17:16):

David Egolf (Sep 08 2024 at 17:18):

David Egolf (Sep 08 2024 at 17:58):

David Egolf (Sep 08 2024 at 18:01):

In the top part of the picture, I visualize part of a presheaf

F

X

. The left box represents the set

F(U)

of data attached to the open set

U

. Similarly, the right box represents the set

F(U')

of data attached to the open set

U'

. The wiggly lines in these boxes represent elements of these sets.

David Egolf (Sep 08 2024 at 18:02):

Because

F

is a presheaf, we can zoom in on these wiggly lines to various points, and get germs. Each germ here is represented by a small circle, intending to convey the idea of "zooming in" on a wiggly line at some point. There are many different ways our data can wiggle, and so there are a huge number of germs. I've drawn a big cloud of little circles to try and represent the space of germs very roughly.

The arrows coming down from the top of the diagram to the bottom intend to illustrate how the germs of the pictured wiggly lines would be included in this huge space of germs.

David Egolf (Sep 08 2024 at 18:04):

Next, here is a picture for the second half of the process, which involves applying

\Gamma

:
picture

David Egolf (Sep 08 2024 at 18:19):

\Gamma

sends a bundle to its sheaf of sections. A section in this case involves "flowing through" germ space in a continuous way, picking out a local behaviour at each point of some open set of

X

. Such a section now describes some data attached to an open set of

X

again!

In the picture, I've illustrated how we can "glue together" the two presheaf elements

s

and

s'

that we started with. We get

s''

, which is an element of the set attached to

U \cup U'

by our sheaf of sections of

\Lambda(F):\Lambda(F) \to X

David Egolf (Sep 08 2024 at 18:23):

F

already supported "gluing together" of elements that agree on overlaps (to a unique result), then

F

was in fact already a sheaf. But if

F

contained elements agreeing on overlaps that couldn't be glued together, then this "round trip" process will result in a sheaf-version of

F

! And this

(\Gamma \circ \Lambda) (F)

will be different than

F

in this case because it contains some additional "glued together" elements.

David Egolf (Sep 08 2024 at 18:26):

I'll stop here for today. Next time, I'm hoping to draw a picture illustrating the other round trip, namely

\Lambda \circ \Gamma: \mathsf{Top}/X \to \widehat{\mathcal{O}(X)}\to \mathsf{Top}/X

John Baez (Sep 08 2024 at 20:19):

That was very nice. You rose to the challenge of drawing countless germs without creating a complete mess!

David Egolf (Sep 10 2024 at 15:38):

I'm realizing I don't yet have clear intuition for the round trip functor

\Lambda \circ \Gamma: \mathsf{Top}/X \to \widehat{\mathcal{O}(X)}\to \mathsf{Top}/X

. To my understanding, this process converts any bundle over

X

to an étale space over

X

. (I will write "etale space" to mean " étale space", for ease of typing).

I think we proved that earlier in the thread, but I would struggle to explain how this happens using a picture.

David Egolf (Sep 10 2024 at 15:39):

John Baez (Sep 10 2024 at 15:40):

So for

\Lambda \circ \Gamma

you start with a bundle over

X

, then form its sheaf of sections, then look all the germs of sections and make these into the points of a whopping big new bundle over

X

John Baez (Sep 10 2024 at 15:41):

Any point in the whopping big new bundle gives a point in the original bundle, since a germ of a section

s

at a point

x

gives the point

s(x)

John Baez (Sep 10 2024 at 15:42):

I hope I didn't ruin things just now - I usually try to play coy and let you figure out almost everything yourself!

David Egolf (Sep 10 2024 at 15:49):

Thanks! I don't fully follow what you said (yet), but I will try to draw a picture of what you just said and see what happens!

I do enjoy figuring things out, but in this case a nudge in the right direction is appreciated!

David Egolf (Sep 10 2024 at 16:30):

To draw a picture, I need to choose a bundle to start with. I would like to choose a bundle that is not an etale space, so I can see how

\Lambda \circ \Gamma

upgrades it to an etale space.

David Egolf (Sep 10 2024 at 16:33):

Now, a bundle

p:Y \to X

is an etale space exactly if

p:Y \to X

is a local homeomorphism. A local homeomorphism

f:A \to B

is a continuous map so that about each point

a

in the domain there is some open set

U

so that

f|_U:U \to f(U)

is a homeomorphism (where

U

and

f(U)

are both equipped with the subspace topology).

David Egolf (Sep 10 2024 at 16:35):

A homeomorphism is in particular an open map: it sends open sets to open sets. The inverse of a homeomorphism is also a homeomorphism, and so it will also send open sets to open sets. Because homeomorphisms are bijections, if we have a homeomorphism

f|_U:U \to f(U)

, we get an induced bijection of open sets, going between the open sets of

U

and the open sets of

f(U)

David Egolf (Sep 10 2024 at 16:37):

So, to show a bundle

p:Y \to X

is NOT an etale space, it would suffice to find some point

y \in Y

so that any open neighborhood

U

y

has "too many" open sets compared to the image of

U

under

p

. To be more precise, it would suffice to show that there is no open set

U

containing

y

such that

p|_U:U \to p(U)

induces a bijection of open sets.

David Egolf (Sep 10 2024 at 16:45):

David Egolf (Sep 10 2024 at 16:46):

Here,

Y

is a subspace of

\mathbb{R}^2

, equipped with the subspace topology.

X

is the real line, and

p

is the continuous map given by composing the inclusion of

Y

\mathbb{R}^2

and the projection of

\mathbb{R}^2

down to the

x

-axis.

David Egolf (Sep 10 2024 at 16:49):

U

is an open set containing

y

, where

U

is contained in the "vertical section" of

Y

. Notice that

y

has many open neighborhoods in

U

, given for example by various "vertical" open intervals containing

y

. However, the image of

U

under

p

is just a single point. Viewed as a subspace of

X

, this image only has two open sets: the empty set and the set containing the single point

p(U)

. Hence, there can be no bijection between the open sets of

U

and the open sets of

p(U)

(And I suspect that this is true in fact for any open set

U

containing

y

: indeed, any such open set contains a small (and hence "vertical") open interval about

y

, which is an open set that gets "collapsed" when passing to the image of

p

EDIT: More simply,

p

restricted to any open set containing

y

will be non-injective. Hence, this restriction of

p

can't be a homeomorphism.

David Egolf (Sep 10 2024 at 16:51):

We conclude that

p:Y \to X

is not a local homeomorphism, and so

Y

is not an etale space over

X

David Egolf (Sep 10 2024 at 16:57):

Next time, I'm hoping to draw a picture illustrating the process of applying

\Gamma

to this bundle

p:Y \to X

. I'm curious to see how the resulting sheaf of sections will reflect the fact that

p

is not a local homeomorphism!

David Egolf (Sep 10 2024 at 17:21):

After writing the above, I realized that the "local non-injectivity" of

p

is what stops

p

from being a local homeomorphism. With that in mind, I think this bundle is also not an etale space:
picture

David Egolf (Sep 10 2024 at 17:22):

The idea is that

Y

is some space that "branches" at some point. If we pick

y \in Y

right at the branching point, then any open neighborhood of

y

will contain points from both the "upper" and "lower" branches. And hence the projection

p

won't be injective even when restricted to really small neighborhoods of

y

, like the picture

U

. Thus, this

p

can't be a local homeomorphism.

Peva Blanchard (Sep 10 2024 at 17:32):

Interesting. So maybe the following holds: if the bundle

p: Y \to X

is étale, then all fibers must be in bijection with one another.

John Baez (Sep 10 2024 at 17:34):

That's right! Most bundles that you can actually draw are not etale spaces! For example the bundle I always draw when someone asks me to draw a bundle:

The only etale spaces

p: E \to B

I can easily draw are the 'covering spaces', where every point

b \in B

has a neighborhood

U

where

p^{-1}{U} \cong U \times S

for some discrete space

S

John Baez (Sep 10 2024 at 17:54):

This is certainly true if

X

is connected and

p: Y \to X

is a covering space. So the challenge is to think about etale spaces that aren't covering spaces.

Hmm, now I see that some such etale spaces are quite easy to draw, like this:

X

is the real line, and

Y

is the open interval

(0,1)

, and

p: Y \to X

is the inclusion of

(0,1)

in the line.

David Egolf (Sep 11 2024 at 15:52):

Yes! Looking on Wikipedia, I see that if

U

is an open subset of

X

, then the inclusion

i:U \to X

is a local homeomorphism, provided that

U

is equipped with the subspace topology. (So in this case

U

is an etale space over

X

That same article also notes that if

U

is an open subset of

\mathbb{R}^n

, then these two conditions on a continuous map

f:U \to \mathbb{R}^n

are equivalent:

John Baez (Sep 11 2024 at 16:37):

Thanks! I peeked at the Wikipedia page and I see that to prove the second condition implies they use a substantial theorem in algebraic topology, called invariance of domain, proved by Brouwer. This is one of the results they dole out in a first course on homology theory, to prove you can use it to settle questions that aren't obviously about homology theory.

David Egolf (Sep 11 2024 at 16:46):

For this condition on fibers to hold, we can also note that we need

p

to be surjective, at least if

Y

is non-empty. Otherwise, we'll have at least one non-empty fiber and at least one empty fiber.

David Egolf (Sep 11 2024 at 16:48):

(In the case mentioned above by @John Baez, a covering space

p:Y \to X

on a connected space

X

is always surjective).

John Baez (Sep 11 2024 at 16:55):

Your parenthetical claim actually doesn't follow from Wikipedia's definition of covering space. According to that definition

p

can have empty fibers, e.g. we can have

Y = \emptyset

, because the "discrete space"

D_x

mentioned in that definition can be empty.

John Baez (Sep 11 2024 at 16:57):

It's good to allow empty fibers in that definition since ruling out the empty set by fiat tends to produce categories with worse properties: e.g. the empty covering space of

X

is the initial object in the category of covering spaces of

X

John Baez (Sep 11 2024 at 16:58):

However, people are vastly more interested in covering spaces where the fibers are nonempty, and then

p: Y \to X

is surjective.

John Baez (Sep 11 2024 at 16:59):

Wikipedia says "some authors" require surjectivity in the case where

X

is not connected. Those authors should probably require it even when

X

is connected, since they obviously don't like covering spaces with empty fibers!

Kevin Carlson (Sep 11 2024 at 17:02):

Grumble. Reminds me of an old-fashioned professor I TA'd linear algebra for who wasn't quite convinced that the zero vector space has a basis.

John Baez (Sep 11 2024 at 17:06):

David Egolf (Sep 11 2024 at 17:12):

I admit I didn't try to prove this myself! I just read this in that Wikipedia article:

Is that claim in the article incorrect? (As far as I can tell, that Wikipedia article doesn't actually try to prove this claim.)

John Baez (Sep 11 2024 at 17:20):

Take the Wikipedia definition of 'covering space' and see if

p: Y \to X

is a covering space when

Y = \emptyset

and

p

is the unique map to

X

. If this is a covering space by their definition, then it can't be true that

This would then be a good time to start your career of correcting Wikipedia pages. :upside_down:

But it's possible I didn't read their definition carefully enough, and for some reason it rules out the case

Y = \emptyset

David Egolf (Sep 11 2024 at 17:22):

Alright, let me see what happens when we consider

p:Y \to X

when

Y

is empty! (Time to put Wikipedia to the test!)

David Egolf (Sep 11 2024 at 17:28):

David Egolf (Sep 11 2024 at 17:30):

In the next sentence the article elaborates on this, and indicates that each

V_d

is to be open (presumably as a subset of

Y

David Egolf (Sep 11 2024 at 17:39):

Alright. Now, consider the case where

Y

is empty. Then each

V_d \subseteq Y

is also empty. Let us assume that

X

is non-empty, and let

U_x

be a non-empty open neighborhood of

x \in X

. For

p:Y \to X

to be a covering, we need

p|_{V_d}:V_d \to U_x

to be a homeomorphism for each

d \in D

. However, in this case any

p|_{V_d}

is mapping from an empty set

V_d

to a non-empty set

U_x

EDIT:
I think the following is wrong, but I leave it here for context:
[Hence

p|_{V_d}

can't be a homeomorphism, and

p

can't be a covering. We conclude that Wikipedia's definition of a covering

p:Y \to X

excludes

Y

from being empty, at least when

X

is non-empty.]

David Egolf (Sep 11 2024 at 17:44):

David Egolf (Sep 11 2024 at 17:46):

Strictly speaking, we are not required to have a homeomorphism

p|_{V_d}:V_d \to U_x

. Instead, we are required that for every

d \in D_x

we have a homeomorphism

p|_{V_d}:V_d \to U_x

. So, if

D_x

is empty, this condition can still hold trivially!

David Egolf (Sep 11 2024 at 17:49):

Let us consider

p^{-1}(U_x)

. Since

Y

is empty,

p^{-1}(U_x)

is also empty. We want to write the empty space as a coproduct

\coprod_{d \in D_x}V_d

where

D_x

is empty. What is the empty coproduct in

\mathsf{Top}

David Egolf (Sep 11 2024 at 17:52):

I expect that the empty coproduct is the colimit of the empty diagram, which is the initial object of

\mathsf{Top}

. So, the empty coproduct should be the empty space.

John Baez (Sep 11 2024 at 17:53):

Yes, that's how the empty set tricks people: a lot of things are vacuously true about it.

John Baez (Sep 11 2024 at 17:55):

Right. Maybe you can see why the editors of this page, who are probably not category theorists, slipped up around here.

David Egolf (Sep 11 2024 at 18:04):

So, it seems that

Y

can indeed be empty in Wikipedia's definition of a covering

p:Y \to X

. To summarize the case when

Y

is empty and

X

is non-empty:

David Egolf (Sep 11 2024 at 18:06):

So I indeed stand corrected! When using Wikipedia's definition of a covering, one can have a non-surjective covering

p:Y \to X

even when

X

is connected. The empty map

p:Y \to X

provides such a covering when

Y

is empty and

X

is connected.

David Egolf (Sep 11 2024 at 18:08):

(Nothing like a bit of fun with the empty set to start out the day :sweat_smile: !)

David Egolf (Sep 11 2024 at 18:21):

Before wrapping up for today, I wanted to draw a picture. Specifically, I want to start thinking about the sheaf of sections

\Gamma(p)

of this bundle

p:Y \to X

:
bundle

I'm quite curious to see how the local non-injectivity of

p

y

gets removed as we apply

\Lambda \circ \Gamma: \mathsf{Top}/X \to \widehat{\mathcal{O}(X)} \to \mathsf{Top}/X

David Egolf (Sep 11 2024 at 18:23):

Intuitively, the local non-injectivity of this map comes from the following fact: the two "branches" to the right of

y

have points arbitrarily close to

y

. I am guessing that by applying the above process we will end up with a bunch of germs associated to

y

, but where each of these germs will only be "really close" to some germs associated to one of the two branches.

David Egolf (Sep 11 2024 at 18:28):

Here's a picture illustrating two sections of

p:Y \to X

, namely

s:V \to Y

and

s':V \to Y

:
picture

Each of

s

and

s'

is a section of

p:Y \to X

over the open set

V \subseteq X

. So,

s,s' \in \Gamma(p)(V)

, where

\Gamma(p):\mathcal{O}(X)^{\mathrm{op}} \to \mathsf{Set}

is the sheaf of sections of

p

David Egolf (Sep 11 2024 at 18:40):

Notice that the section

s'

goes along the upper branch, while

s

goes along the lower branch. I strongly suspect that the germ of

s'

y

will be different from that of the germ of

s

y

. Intuitively that would reflect the fact that each section is passing through

y

in a certain way!

If this intuition is right, we can begin to see the single point

y

split into multiple germs at that point, including the germs I just described

[s']_y

and

[s]_{y}

. I suspect that

[s']_y

is close to germs from

s'

near

y

- for example, germs of

s'

on the upper branch. And similarly

[s]_y

I suspect is close to germs from

s

near

y

- for example germs of

s

on the lower branch.

Intuitively, we are getting multiple germs associated to

y

. And I suspect that each of these germs is only arbitrarily near to germs from a section that flows along on one of the two branches splitting off from

y

. So we can perhaps begin to see how our bundle of germs of sections of

p

could be locally injective!

David Egolf (Sep 11 2024 at 18:50):

John Baez (Sep 11 2024 at 20:18):

Okay, so we see eye to eye. Wikipedia's definitiom looks correct to me, and with this definition a covering space

p:X\to Y

is surjective if

X

is nonempty and

Y

is connected.

John Baez (Sep 11 2024 at 20:28):

To prove this particular fact, note (or show) that two sections of a bundle, say

s

and

s'

, give two elements of its sheaf of sections, and these two elements have the same germ at a point

x

iff

s

and

s'

become equal when restricted to some open neighborhood of

x

. But in you picture

s

and

s'

are not equal on any open neighborhood of

x = p(y)

Peva Blanchard (Sep 12 2024 at 21:56):

I've been thinking about this. I did not manage to prove that all fibers must be in bijection with one another yet, but I made a small step: when the bundle

p : Y \to X

is étale, then every fiber is discrete (w.r.t. to the subspace topology).

Indeed, let

u \in F_x = p^{-1}(x)

a point in the fiber over

x

. Since

p

is étale, there exists an open neighborhood

U

u

s.t.

p_{|U} : U \to p(U)

is a homeomorphism. By definition of the subspace topology,

U \cap F_x

is open in

F_x

. But,

p_{|U}

being a homeomorphism implies that

U \cap F_x = \{u\}

. Hence, every singleton in

F_x

is open. I.e.,

F_x

is discrete.

John Baez (Sep 13 2024 at 01:20):

A while ago in this thread I mentioned a simple counterexample that works even when

X

is connected. (I didn't say it's a counterexample, but it clearly is.)

You can also find a counterexample that's a covering space when

X

is not connected.

John Baez (Sep 13 2024 at 01:23):

Peva Blanchard (Sep 13 2024 at 06:18):

Oh indeed! I've been unknowingly assuming that every fiber was not empty. The inclusion

(0,1) \subseteq \mathbb{R}

provides a counter-example. From there, we can build other counterexamples with fibers of arbitrary different sizes.

E.g.,

E = F_1 \times (0,1) + F_2 \times (2,3) + F_3 \times (4,5) + \dots

with

p: E \to \mathbb{R}

defined as the second projection on every term of the disjoint sum, and the

F_i

's being arbitrary discrete spaces.

Peva Blanchard (Sep 13 2024 at 06:27):

I should probably reformulate my original goal then. "If

p: E \to X

is an étale bundle, with

X

connected, and

p

surjective, then all fibers are in bijection with one another". I'll think about it.

John Baez (Sep 13 2024 at 15:24):

Alas, that conjecture is false too - and I think with your ability to create etale spaces with fibers of different sizes, it should be easy to disprove.

John Baez (Sep 13 2024 at 15:26):

Maybe you should try to prove that if

p: E \to X

is a covering space, and

X

is connected, then all fibers are in bijection with one another.

John Baez (Sep 13 2024 at 15:32):

Etale spaces seem too flexible for a good result of this sort unless we essentially require that they're covering spaces.

However, I now see, hovering before my eyes, an etale space

p: E \to X

where

X

is connected and all fibers are in bijection with one another, which is not a covering space.

David Egolf (Sep 13 2024 at 17:02):

I think this result provides some good intuition! I saw somewhere an analogy between an etale space and a pastry having many thin layers. I think one could view this result as saying that each point in a given fibre is "apart" from the other points in that fibre. And this could be viewed as saying that each point in any given fibre is in a separate "layer" from the others in that fibre.

Peva Blanchard (Sep 13 2024 at 17:02):

Oh yes. Actually, if

C

is any collection of open subsets of

X

, then we can form the coproduct

When

C

is the set of all open subsets of

X

, it looks like the corresponding bundle is initial among all the étale spaces over

X

Peva Blanchard (Sep 13 2024 at 17:04):

I had the exact same picture in mind! It's called "mille-feuille" in french. And it's quite crispy. The only difference is that étale spaces seem to lack cream. Étale spaces taste very dry.

David Egolf (Sep 13 2024 at 17:14):

If I understand correctly, in particular this lets us have layers that overlap in terms of their projection:
picture

The projection map is not injective, but it is locally injective. Some fibres have two elements, and some fibres have a single element. So, the fibres aren't in bijection with one another, even though the projection is surjective and

X

is connected.

John Baez (Sep 13 2024 at 18:52):

Peva Blanchard (Sep 13 2024 at 20:04):

Fix a point

x_0

in the base. There is an open neighborhood

U_0

x_0

such that the pre-image is homeomorphic to

U_0 \times E_{x_0}

where

E_{x_0}

is the fiber over

x_0

. In particular, for every

x \in U_0

, the fiber

E_x

is in bijection with

E_{x_0}

We want to prove that

W = X

. Since

X

is connected and

W

is not empty, it suffices to show that

W

is both open and closed.

Clearly,

U_0 \subseteq W

. For any

x \in W

, there exists a neighborhood

U

x

such that

p^{-1}U \cong U \times E_x \cong U \times E_{x_0}

. Then,

U \subseteq W

. This proves that

W

is a neighborhood of every of its points. I.e.,

W

is open.

Let

y \in X - W

, i.e.,

E_y \not\cong E_{x_0}

. Using the defining property of the covering, there exists a neighborhood

V

y

such that

p^{-1}V \cong V \times E_y

. In particular, for every

y' \in V

E_{y'} \cong E_y \not\cong E_{x_0}

. Thus.,

V \subseteq X - W

. I.e.,

X - W

is open, and

W

is closed. qed.

John Baez (Sep 13 2024 at 20:21):

Great! I wasn't sure how to prove it, I just knew it was true. :upside_down: But this looks like the best way to prove it.

John Baez (Sep 13 2024 at 20:27):

So, I think the idea of "all fibers being in bijection with one another" is not really something we should expect of etale spaces, unless they are covering spaces of connected spaces.

But here's the example I was imagining of an etale space where all fibers are in bijection with each other even though it's not a covering space!

Start with the map

p: \mathbb{R}^2 \to \mathbb{R}

that projects onto the first coordinate:

Then let

Y \subseteq \mathbb{R}^2

be the subset that's the union of all horizontal lines

Restricting

p

Y

we get a map I'll abusively call

p: Y \to \mathbb{R}

. This is an etale space over a connected space, and all the fibers are in bijection with each other (since they're all countably infinite), but it's not a covering space.

Oscar Cunningham (Sep 13 2024 at 21:32):

Another example would be to take two copies of

\mathbb{R}

and quotient them together on

(0,\infty)

. This gives a space which has fibers of cardinality

2

above

(-\infty,0]

and fibers of cardinality

1

above

(0,\infty)

. So if we add a disjoint copy of

(0,\infty)

then the fibers will have cardinality

2

everywhere.

John Baez (Sep 13 2024 at 21:36):

Thanks - that's a much more exciting example, because it doesn't use the dirty trick

\aleph_0+ 1 = \aleph_0

Oscar Cunningham (Sep 13 2024 at 21:38):

Kevin Carlson (Sep 13 2024 at 23:42):

That’s the kind of thing I was thinking of, but then gave up because it isn’t a local homeomorphism at the branch point.

Kevin Carlson (Sep 13 2024 at 23:44):

I suspect you can’t actually get an example of a local homeomorphism with fibers of constant finite size over a connected base that isn’t a covering.

John Baez (Sep 13 2024 at 23:54):

What if we define a topology such that the branch point y has open neighborhoods U containing just some of the lower branch at left (together with some of the stuff at right) and open neighborhoods V containing just some of the upper branch at left (together with some of the stuff at right)? Then their intersection needs to be open, and it doesn't 'look' open, but let's accept that.

Kevin Carlson (Sep 14 2024 at 00:07):

I was wondering about that too but if you intersect two of those neighborhoods you see that there’s an open subset to the right of the branch point that looks like a half-open interval. In other words near the branch point I think the space we’ve specified is just a disjoint union of two open intervals and a half open interval, and so again this isn’t actually a local homeomorphism!

Kevin Carlson (Sep 14 2024 at 00:08):

If you allow things continuing into just the upper left to be open but not just the lower left, then maybe…

Kevin Carlson (Sep 14 2024 at 00:09):

Oh, eek, Oscar hasn’t specified a branch point in the way I thought since he glued over

(0,\infty),

not

[0,\infty)

John Baez (Sep 14 2024 at 00:09):

Kevin Carlson (Sep 14 2024 at 00:10):

Kevin Carlson (Sep 14 2024 at 00:13):

John Baez (Sep 14 2024 at 00:18):

Okay, I hadn't really understood what Oscar's example actually was. I thought there was one point where the two branches merge, but there are two, and that saves the day, apparently.

John Baez (Sep 14 2024 at 01:23):

Though of course that's necessary to accomplish what we're looking for: the same number of points in every fiber!

Oscar Cunningham (Sep 14 2024 at 05:58):

Right. The way I think about it is that bundles over

\mathbb{R}

can split

1\to n

, merge

n\to 1

, begin

0\to 1

or end

1\to 0

. But whichever way they go the bit with

1

above it has to be an open set. That's why I quotiented by

(0,\infty)

above, leaving two origins.

Kevin Carlson (Sep 14 2024 at 06:16):

John Baez (Sep 14 2024 at 14:40):

Here's another way to think about this. Start with a bundle

Y \to X

like this:

This is the bundle Kevin and I originally thought Oscar was talking about: the fiber over the arrow has just one point, while each fiber to the left of that has two, and each fiber to the right has one.

John Baez (Sep 14 2024 at 14:44):

Then apply the functor

\Lambda \circ \Gamma

that @David Egolf has been investigating. So: take the sheaf of sections of our bundle

Y \to X

, and then form the bundle of germs of that sheaf. This new bundle

\Lambda \Gamma(Y) \to X

is an etale space!

John Baez (Sep 14 2024 at 14:47):

The sheaf of sections of the original bundle has two different germs at the arrow, and two at each point to the left, and one at each point to the right.

John Baez (Sep 14 2024 at 14:48):

So our etale space has two points above the arrow, while our original bundle had just one!

John Baez (Sep 14 2024 at 14:51):

The counit

\Lambda \Gamma(Y) \to Y

must collapse those two points down to one.

John Baez (Sep 14 2024 at 14:53):

So, it maps what Oscar was trying to talk about, down to what Kevin and I originally thought he was talking about.

Peva Blanchard (Sep 15 2024 at 10:19):

Just to be sure that I understand the example correctly, especially the reason why it is not a covering space, I'm drawing it like this
image.png

If it were a covering, then the portion of the total space enclosed in the two yellow dashed lines would be homeomorphic to

U \times 2

From the picture, we can see that it does not look like two disjoint copies of

U

. But I have trouble formulating a precise argument that rules out the existence of a homeomorphism to

U \times 2

I thought about connectedness, but if I'm not mistaken the white part is connected, hence

\pi^{-1}U

has 2 connected components. So the invariant "number of connected components" is not enough to distinguish

\pi^{-1}U

and

U \times 2

Oscar Cunningham (Sep 15 2024 at 13:14):

Just looking at the number of connected components isn't enough, but you can use the fact that there are points in the base space for which both points above them are in the same component. That can't happen with

U\times 2

Peva Blanchard (Sep 15 2024 at 15:09):

Ah yes, I see, thank you! My mistake was in thinking of

\pi^{-1}U \cong U \times 2

as a homeomorphism between topological spaces, whereas it should be be an isomorphism of bundles over

U

Peva Blanchard (Sep 15 2024 at 15:13):

Or more precisely. As a mere topological space,

\pi^{-1}U

is indeed homeomorphic to

U \times 2

. But, they are not isomorphic as bundles over

U

, thanks to your argument.

Oscar Cunningham (Sep 15 2024 at 15:48):

They're not isomorphic as bundles over

U

. But I don't think they are homeomorphic either. It's just harder to write down an invariant that proves that they're not homeomorphic.

Oscar Cunningham (Sep 15 2024 at 15:48):

Peva Blanchard (Sep 15 2024 at 15:53):

Oh yes, the two origins cannot be separated in your example. I was wrong,

\pi^{-1}U

and

U \times 2

are not even homeomorphic as mere topological spaces

John Baez (Sep 15 2024 at 15:56):

That's true. But the really important lesson here is that given bundles

p: E \to B

p': E' \to B

, they are only isomorphic if there's a homeomorphism

f: E \to E'

with

p = p ' \circ f

. That last equation makes isomorphism of bundles much more than mere homeomorphism of their total spaces.

John Baez (Sep 15 2024 at 16:00):

This means that there needs to be a subject of "invariants of bundles" which goes beyond the subject of "invariants of topological spaces". Algebraic topology provides lots of both. For vector bundles, the most famous invariants are the [[Chern classes]] (for complex vector bundles) and [[Pontryagin classes]] and [[Stiefel-Whitney classes]] (for real vector bundles). All these can be defined using [[classifying spaces]], which we've been talking about in another thread.

John Baez (Sep 15 2024 at 16:02):

There are also "invariants of sheaves", which are especially well developed for sheaves of vector spaces - sheaves

F

where the set

F(U)

attached to any open set

U

is actually a vector space, and the restriction maps

F(U) \to F(V)

are linear. (Or, in algebraic geometry, sheaves of modules of the so-called "structure sheaf" - probably not worth explaining here. Grothendieck was especially involved in studying these, and his studies of such sheaves eventually led him to topos theory.)

David Egolf (Sep 15 2024 at 18:43):

This is helping me understand why our counit natural transformation needs to go from

\Lambda \circ \Gamma

1_{\mathsf{Top}/X}

and not from

1_{\mathsf{Top}/X}

\Lambda \circ \Gamma

. If we focus on a single point "hovering" in some bundle, and then apply

\Lambda \circ \Gamma

, we get out the germs of sections that go through that point. It's very natural to define a function that collapses each of these germs back to the original point. By contrast, I don't see a nice way to define a function that would send our original point to some particular germ of a section that goes through that point. (I don't see a nice way to pick some distinguished germ that is most deserving of being mapped to by our original point).

John Baez (Sep 16 2024 at 06:00):

Indeed, I doubt there's a way to pick a distinguished germ in general. It could be good to try to prove this by showing there exists no natural transformation

\alpha: 1_{\mathsf{Top}/X} \to \Lambda \circ \Gamma

. I think one can prove this by considering a single bundle that has lots of automorphisms, like the bundle

Each automorphism gives a naturality square that needs to commute, and I think one can show they can't all commute, no matter how one chooses

\alpha

David Egolf (Sep 22 2024 at 16:59):

I feel more comfortable now with

\Gamma

and

\Lambda

, thanks to the discussion and picture-drawing above. Building on this understanding, I now want to return to the counit for our adjunction

\Lambda \dashv \Gamma

David Egolf (Sep 22 2024 at 17:01):

We are looking for a natural transformation

\epsilon: \Lambda \circ \Gamma \to 1_{\mathsf{Top}/X}

. Given a particular bundle

p:Y \to X

\Gamma(p)

is its sheaf of sections, and then

\Lambda(\Gamma(p))

is the bundle of germs of that sheaf of sections. Thinking of our bundle as some geometry "hovering over"

X

, then the sections are ways to "travel through" parts of that geometry, and we can get multiple germs at some point if there are sections with different germs passing through that point.

David Egolf (Sep 22 2024 at 17:07):

Given this intuition, intuitively I am hoping we can define a morphism of bundles

\epsilon_p:(\Lambda \circ \Gamma)(p) \to p

that sends each germ of a section associated to some point

y \in Y

back to

y

. Intuitively we are "collapsing" the cloud of germs associated to a point back to that point.

David Egolf (Sep 22 2024 at 17:08):

Let

s:U \to Y

be a section of

p:Y \to X

over the open set

U \subseteq X

, having germ

[s]_x

x \in U

. Then we want

\epsilon_p

to send

[s]_x

s(x)

David Egolf (Sep 22 2024 at 17:12):

The first order of business is to check that

\epsilon_p: (\Lambda \circ \Gamma) (p) \to p

really is a morphism of bundles. I will denote the corresponding function as

\epsilon_p: (\Lambda \circ \Gamma)_p \to Y

, where

(\Lambda \circ \Gamma)_p

is the space of germs of the sheaf

\Gamma(p)

This function needs to preserve fibers and it also needs to be continuous. By "preserving fibers" I mean that it maps any data hovering over

x \in X

to data hovering over

x \in X

, for any

x \in X

David Egolf (Sep 22 2024 at 17:16):

First, let's check that

\epsilon_p: (\Lambda \circ \Gamma)_p \to Y

preserves fibers. We have

\epsilon_p([s]_x) = s(x)

for any germ of a section

[s]_x

. But both

[s]_x

and

s(x)

"hover over"

x

in their respective bundles, so

\epsilon_p

does preserve fibers.

David Egolf (Sep 22 2024 at 17:22):

I next want to show that

\epsilon_p: (\Lambda \circ \Gamma)_p \to Y

is continuous. We know that the projection of germs to their "base" point

(\Lambda \circ \Gamma)(p):(\Lambda\circ \Gamma)_p\to X

is continuous. And we know that

s:U \to Y

is continuous, for any section

s

. I'm hoping to somehow use these facts to prove that

\epsilon_p

is continuous.

David Egolf (Sep 22 2024 at 17:28):

Let

[s]_x \in (\Lambda \circ \Gamma)_p

, where

s:U \to Y

is a section of

p:Y \to X

over the open set

U \subseteq X

. I'd like to find an open set containing

[s]_x

such that the projection of the germs in that open set all land in

U

I seem to recall that the set of all the germs of

s

over

U

is an open set containing

[s]_x

(\Lambda \circ \Gamma)_p

David Egolf (Sep 22 2024 at 17:31):

I'll use

[s]_U

to refer to this open set containing

[s]_x

. Then, we get a restriction of our projection (for germs of sections) as

(\Lambda \circ \Gamma(p))_{[s]_U}:[s]_U \to U

. This is a continuous function, because restricting a continuous function in this way always yields a continuous function.

David Egolf (Sep 22 2024 at 17:32):

Then,

s \circ (\Lambda \circ \Gamma(p))_{[s]_U}:[s]_U \to U \to Y

acts by

[s]_x \mapsto x \mapsto s(x)

. And this function is continuous, because it is the composite of continuous functions.

Further, we note that this function is the restriction of

\epsilon_p: (\Lambda \circ \Gamma)_p \to Y

[s]_U

David Egolf (Sep 22 2024 at 17:36):

Since each germ

[s]_x

is the germ of some section

s:U \to Y

for some open set

U \subseteq X

, we can perform a similar procedure at each point in

\Lambda \circ \Gamma_p

. The various

[s]_U

form an open cover for

(\Lambda \circ \Gamma)_p

, and our restriction of

\epsilon_p

to each of these open sets is continuous.

We conclude that

\epsilon_p

is continuous, because we can always "glue together" continuous functions that agree on overlaps to make a continuous function.

David Egolf (Sep 22 2024 at 17:44):

David Egolf (Sep 22 2024 at 17:47):

I just realized I didn't check that

:[s]_x \mapsto s(x)

is really a function. We need to show that if

[s]_x = [t]_x

then

s(x) = t(x)

. But if

[s]_x = [t]_x

that implies that

s

and

t

are equal on some small enough open neighorhood of

x

. In particular, they are equal when evaluated at

x

. So,

:[s]_x \to s(x)

really is a function.

John Baez (Sep 22 2024 at 19:31):

David Egolf (Sep 23 2024 at 16:32):

To wrap up this puzzle, it remains to show that our components

\epsilon_p:(\Lambda \circ \Gamma)(p) \to p

assemble to form a natural transformation

\epsilon: \Lambda \circ \Gamma \to 1_{\mathsf{Top}/X}

David Egolf (Sep 23 2024 at 16:34):

To check this, we consider a naturality square associated to a morphism

f:p \to p'

\mathsf{Top}/X

:
naturality square

David Egolf (Sep 23 2024 at 16:36):

This diagram lives in

\mathsf{Top}/X

, so each of the morphisms here is a morphism of bundles. To show that this diagram commutes, it suffices to show that the corresponding functions commute. So, we consider this square of topological spaces and continuous functions:
square

David Egolf (Sep 23 2024 at 16:39):

Now, two functions with the same source and target are equal iff they agree when evaluated at any element. So, let's trace an element of

(\Lambda \circ \Gamma)_p

around this square, and see what we get via the top right path as compared to the bottom left path.

David Egolf (Sep 23 2024 at 16:41):

Our space

(\Lambda \circ\Gamma)_p

is the space of germs of sections of the bundle

p:Y \to X

. So, an element of this space is of the form

[s]_x

, which refers to the germ of some section

s:U \to Y

p

at the point

x \in U

, for some open set

U \subseteq X

David Egolf (Sep 23 2024 at 16:41):

Going around the bottom left path,

[s]_x \mapsto s(x) \mapsto f(s(x))

, where we have used the definition of

\epsilon_p

discussed above.

David Egolf (Sep 23 2024 at 16:54):

Going around the top right path, I need to recall how

\Lambda \circ \Gamma

acts on a morphism

f:p \to p'

\mathsf{Top}/X

First,

\Gamma

converts this morphism

f

of bundles to a natural transformation between sheaves. This natural transformation at component

U

(where

U

is open subset of

X

) sends a section of

p

to a section of

p'

by post-composition with

f

. That is, a section

s:U \to Y

gets mapped to a section of

Y'

given by

f \circ s:U \to Y \to Y'

. So, the

U

-th component function acts by post-composition with

f

Then we apply

\Lambda

, which needs to take this natural transformation and produce a morphism of bundles. The bundles in question are specifically bundles of germs of sections. Given a germ

[s]_x \in (\Lambda \circ \Gamma)_p

, with

s:U \to Y

a section over

U

, we want to get a germ "hovering over"

x

(\Lambda \circ \Gamma)_{p'}

. We do this by applying our

U

-th component function to

s

, to get the germ

[f \circ s]_x

David Egolf (Sep 23 2024 at 16:56):

We can now trace

[s]_x

around the top-right path in our square. We get

[s]_x \mapsto [f \circ s]_x \mapsto (f \circ s)(x) = f(s(x))

. We conclude that the square commutes, and so our original square of bundle morphisms also commutes.

Thus, an arbitrary naturality square for

\epsilon: \Lambda\circ \Gamma \to 1_{\mathsf{Top}/X}

commutes, and so

\epsilon

is indeed a natural transformation!

David Egolf (Nov 18 2024 at 17:06):

I want to start on Part 4 of the blog post series today! (My motivation to work through this series remains fairly high; it's just a matter of finding the energy to do so.)

David Egolf (Nov 18 2024 at 17:14):

The goal in Part 4 is to learn how to "pull back" sheaves along a continuous map. First, we review how to push them forward along a continuous map.

Given a continuous map

f:X \to Y

we get an induced (preimage) map from the open sets of

Y

to the open sets of

X

. This in turn induces a functor from

\mathcal{O}(Y)^{\mathrm{op}}

\mathcal{O}(X)^{\mathrm{op}}

, which we'll call

f^{-1}

. By precomposing with

f^{-1}

we can start with a presheaf on

X

and end up with a presheaf on

Y

David Egolf (Nov 18 2024 at 17:14):

The resulting sheaf is the "pushforward" of our original sheaf

F

along

f

, denoted

f_*F

, so that we have

f_*F = F \circ f^{-1}

. This process can be extended to morphisms in a functorial way, so we end up with a functor from presheaves on

X

to presheaves on

Y

. In fact, this also gives a functor from sheaves on

X

to sheaves on

Y

David Egolf (Nov 18 2024 at 17:16):

Now we want to go in the opposite direction: from sheaves on

Y

to sheaves on

X

, given a continuous function

f:X \to Y

. As a mini-challenge to myself, I'm going to see if I can guess how we might do this before the blog post gives the answer...

David Egolf (Nov 18 2024 at 17:21):

A presheaf on

Y

, such as

F:\mathcal{O}(Y)^{\mathrm{op}} \to \mathsf{Set}

, intuitively attaches some information to each open set of

Y

. However, we've seen before that we can associate information to each point of

Y

using a presheaf on

Y

. Namely, we can form a full subcategory of

\mathcal{O}(Y)^{\mathrm{op}}

by including exactly the open sets that contain some point

y

of interest, apply

F

to that get a diagram in

\mathsf{Set}

, and then take the colimit of that diagram in

\mathsf{Set}

. In this way, we can associate a set to each point of

Y

using

F

John Baez (Nov 18 2024 at 17:32):

Nice! For those who haven't read all > 1000 comments on this thread, you're now alluding to how any presheaf over a topological space has a 'stalk' at each point of that space, the stalk being the set of 'germs' of the presheaf at that point.

David Egolf (Nov 18 2024 at 17:36):

Yes! We saw earlier that this information can be organized as a "bundle" on

Y

; as a continuous function to

Y

. Specifically, we get a continuous function

p:\Lambda(F) \to Y

where the set (the set of germs, the "stalk") associated to

y \in Y

is given as

p^{-1}(y)

David Egolf (Nov 18 2024 at 17:40):

Now, I want to associate a set to each point of

X

. I notice that applying

f:X \to Y

to a point of

X

gives me a point of

Y

, and so I could associate to

x \in X

the set which is already associated (using

F

) to

f(x)

David Egolf (Nov 18 2024 at 17:46):

What I really want is to produce a bundle on

X

. (That's because I could then convert that bundle on

X

to a presheaf on

X

!) To do that, I think we can use a pullback:
getting a bundle on X

David Egolf (Nov 18 2024 at 17:48):

Since this diagram lives in

\mathsf{Set}

[edit: this is wrong - see discussion below], we have some hope to explicitly compute this pullback. It should be the subset of

X \times \Lambda(F)

consisting of pairs

(x, g)

such that

f(x) =p(g)

David Egolf (Nov 18 2024 at 17:50):

What set are we attaching to some point

x \in X

? This is the set of pairs

(x, g)

such that

f(x)=p(g)

. So, we are indeed attaching to

x

the data attached by

F

f(x) \in Y

. That matches our intuitive guess from above!

David Egolf (Nov 18 2024 at 17:53):

We can then convert this bundle

p'

X

to a presheaf on

X

, by taking the presheaf of sections of

p'

. So we obtain a presheaf on

X

from a presheaf on

Y

, as was our goal!

David Egolf (Nov 18 2024 at 17:57):

Ah, a correction - the diagram I drew above lives in

\mathsf{Top}

, the category of topological spaces. So that complicates things a little bit.

John Baez (Nov 18 2024 at 17:59):

This strategy sounds great! Maybe we can polish it up a bit. There are ways to turn presheaves into bundles and bundles back into presheaves. These processes are adjoint functors. But even better, I think we've seen there's an equivalence between the really nice presheaves on a topological space

X

, namely the sheaves, and the really nice bundles over

X

, namely the etale spaces.

So given a map of spaces

f: X \to Y

we can take a sheaf on

Y

, convert it to an etale space over

Y

, pull it back to

X

, and convert it back to a sheaf. This process exists since we can pull back a bundle and get a bundle. But if we can pull back an etale space and get an etale space, this process will be even nicer, since we'll never be leaving the world of etale spaces and sheaves, which are just two equivalent ways of talking about the same thing.

David Egolf (Nov 18 2024 at 18:01):

Oh that's a great point! We want to not only pull back presheaves to get presheaves - we want to pull back sheaves to get sheaves. So it will be even better if we can show that not only does pulling back a bundle give a bundle, but pulling back an etale space (which closely relates to a sheaf) gives us an etale space.

David Egolf (Nov 18 2024 at 18:03):

I still think it's a reasonable first step to finish showing that we can pull back a bundle to get a bundle. That basically amounts to showing that

\mathsf{Top}

has pullbacks.

John Baez (Nov 18 2024 at 18:04):

Definitely that's the right first step, especially since etale spaces are bundles with a mere extra property - so you can then go ahead and see whether pulling back bundles preserves that property.

David Egolf (Nov 18 2024 at 18:05):

I know that the forgetful functor

U:\mathsf{Top} \to \mathsf{Set}

is a right adjoint, and hence it preserves limits. So if

\mathsf{Top}

has a pullback of some diagram, its underlying set and functions are given by the corresponding pullback of the diagram's image in

\mathsf{Set}

. That immediately gives us a candidate for the underlying set and functions of a pullback in

\mathsf{Top}

David Egolf (Nov 18 2024 at 18:07):

John Baez (Nov 18 2024 at 18:07):

I can't resist giving a hint (which you probably already know): in Set, a pullback is a subset of a product.

David Egolf (Nov 18 2024 at 18:16):

Oh, that's a helpful hint! I'll draw a diagram to illustrate the general situation:
pullback in Top

\mathsf{Set}

, we have that

A \times_B A'

is a subset of

A \times A'

. So in

\mathsf{Top}

, we could put the subspace topology on

A \times_B A'

. That topology is the coarsest one so that the inclusion

i:A \times_B A' \to A \times A'

is continuous.

David Egolf (Nov 18 2024 at 18:23):

We want to show that for any other cone over the

A \to_f B \leftarrow_f' A'

diagram, we have a unique continuous function

g

that makes this diagram commute:
universal property

David Egolf (Nov 18 2024 at 18:25):

If this diagram is to commute in

\mathsf{Top}

its image under

U:\mathsf{Top} \to \mathsf{Set}

must certainly commute. This determines

g

uniquely by the universal property of pullbacks in

\mathsf{Set}

David Egolf (Nov 18 2024 at 18:28):

Specifically, we must have

p(g(c)) =q(c)

and

p'(g(c)) = q'(c)

for any

c \in C

. So,

g(c) = (q(c), q'(c))

for any

c \in C

. It remains to show that this function is continuous.

David Egolf (Nov 18 2024 at 18:32):

Using the functions

q:C \to A

and

q':C \to A'

we get, using the fact that products exist in

\mathsf{Top}

, a corresponding continuous function

:C \to A \times A'

that acts by

c \mapsto (q(c),q(c'))

. The function

g

is then giving by co-restricting this induced function.

David Egolf (Nov 18 2024 at 18:33):

I seem to recall that if one has a continuous function

f:A \to B

with

f(A) \subseteq C \subseteq B

and one co-restricts

f

to get a function

:A\to C

where

C

has the subspace topology, then this co-restricted function is continuous. If that's true, then

g

is continuous, as desired.

David Egolf (Nov 18 2024 at 18:36):

(Along the way, I assumed that

\mathsf{Top}

has products. I'm content to assume this for now, unless someone thinks it would be good at this point to prove.)

David Egolf (Nov 18 2024 at 18:38):

Since

\mathsf{Top}

has pullbacks, in particular we can pull back a bundle to get another bundle. That ability, combined with the fact that we can convert between bundles and presheaves, gives us the ability to pull back a presheaf on

Y

along

f:X \to Y

to get a presheaf on

X

David Egolf (Nov 18 2024 at 18:40):

We next want to show that we can pull back sheaves to get not just a presheaf but a sheaf. To do that, it suffices to show that pulling back an etale space gives us an etale space (because we can convert between etale spaces and sheaves).

David Egolf (Nov 18 2024 at 18:44):

An etale space amounts to a local homeomorphism

f':A' \to B

. Recalling the definition of local homeomorphism,

f'

is a continuous map such that each point of

A'

has an open neighborhood

U

such that

f'(U)

is open and the restriction

f'|_{U}:U \to f'(U)

is a homeomorphism, where

U

and

f'(U)

are equipped with the subspace topology.

David Egolf (Nov 18 2024 at 18:46):

I want to show that pulling back an etale space

f':A' \to B

along a continuous map

f:A \to B

gives an etale space

p:A \times_B A' \to A

. We've already seen that the pulled back map

p

is continuous when

A \times_B A'

is equipped with the subspace topology induced by

A \times_B A' \subseteq A \times A'

, but it remains to show that

p

is a local homeomorphism.

David Egolf (Nov 18 2024 at 18:57):

It might help to draw a picture to visualize the pulling back of an etale space. But I'll leave that for next time.

John Baez (Nov 18 2024 at 21:42):

Yes! And it's even better than that. If we give

C \subseteq B

the subspace topology, a function

f: A \to C

is continuous if and only if it's the corestriction of some function

f: A \to B

whose image lies in

C

So to get a pullback in

\mathsf{Top}

we just take the pullback of the underlying diagram in

\mathsf{Set}

and give the resulting set the subspace topology coming from the product space (as you explained).

David Egolf (Nov 19 2024 at 18:43):

Ah, this rings a bell! I think you're mentioning what I've seen called the "characteristic property" of the subspace topology.

David Egolf (Nov 19 2024 at 18:46):

I next want to show that pulling back a local homeomorphism in

\mathsf{Top}

along any continuous function gives us a local homeomorphism. If we can do this, that'll mean that we can pull back a sheaf to get a sheaf.

David Egolf (Nov 19 2024 at 18:47):

This appears to be another example of something I've noticed earlier: learning this stuff has involved more topology than I expected :sweat_smile:! But the topology involved is good to learn too, so I don't mind too much.

David Egolf (Nov 19 2024 at 18:54):

I'll assume that

f':A' \to B

is a local homeomorphism, and I want to show that

p:A \times_B A' \to A

is then a local homeomorphism.

David Egolf (Nov 19 2024 at 18:56):

A function

g:X \to Y

is a local homeomorphism if these two conditions are met:

David Egolf (Nov 19 2024 at 18:58):

Let's show that

p

meets condition (1).
We recall that

A \times_B A'

has as points the pairs

(a,a')

such that

f(a)=f(a')

, and that

p

projects down to the first coordinate. Thus,

p(A \times_B A')

is the subset of

A

consisting of points

a \in A

such that there is some

a' \in A'

with

f(a) = f(a')

David Egolf (Nov 19 2024 at 18:59):

That is,

p(A \times_B A')

is the subset of

A

that is mapped by

f

to somewhere in the image of

f':A' \to B'

. This is the preimage under

f

f'(A')

David Egolf (Nov 19 2024 at 19:01):

Since

f'

is a local homeomorphism,

f'(A')

is open. And since

f

is continuous, its preimage of

f'(A')

is also open. Since this is exactly the image of

p

, we conclude that

p(A \times_B A')

is open in

A

David Egolf (Nov 19 2024 at 19:03):

It remains to show that

p

meets condition (2).
Given some point

(a,a') \in A \times_B A'

, we need to show there's some open neighbourhood

U

containing that point so that

p

restricts to a local homeomorphism on

U

David Egolf (Nov 19 2024 at 19:05):

Since

f':A' \to B

is a local homeomorphism, we know there is some open set

V \subseteq A'

containing

a'

so that

f'

restricts to a local homeomorphism on

V

. Maybe we can use

V

to create our open set

U \subseteq A \times_B A'

containing

(a,a')

David Egolf (Nov 19 2024 at 19:21):

David Egolf (Nov 19 2024 at 19:27):

Referencing the picture, if we have some

V

around around

a'

where

f'

restricts to a local homeomorphism, we can try forming the analogous open set around

(a,a')

. This still feels tricky.

I think I need to "spread out" in the

A

direction as well as the

A'

direction. We could try doing this by taking

f^{-1}(f'(V)) \subseteq A

David Egolf (Nov 19 2024 at 19:36):

David Egolf (Nov 19 2024 at 19:38):

For brevity, let

Y = f^{-1}(f'(V))

. What are the points "hovering over"

Y

A \times_B A'

near

(a,a')

David Egolf (Nov 19 2024 at 19:41):

To specify a point in

A \times_B A'

we need to specify an element of

A

and an element of

A'

. The

A'

coordinates in our subset of interest intuitively should all belong to

V

. So let

v \in V

. What is the corresponding element of

A

? Presumably it should be

f^{-1}(f'(v))

However, the problem with this is that

f

is not necessarily injective, so

f^{-1}(f'(v))

could have more than one point.

David Egolf (Nov 19 2024 at 19:44):

But that's okay, maybe. We can consider points of the form

(x,y)

where

y \in V

and

x \in f^{-1}(f'(y))

Is such a point

(x,y)

an element of

A \times_B A'

? We just need

f(x) = f'(y)

, which is true. So, such a point is indeed an element of

A \times_B A'

David Egolf (Nov 19 2024 at 19:47):

Next, I would want to show that the collection of such points

(x,y)

forms an open subset of

A \times_B A'

. I'm not sure how to do that.

David Egolf (Nov 19 2024 at 19:52):

Maybe there is a simpler way to do this. We know that the projection on the second coordinate

p':A \times_B A' \to A'

is continuous. Pick some point

(a,a') \in A \times_B A'

for which we wish to find an open neighbourhood around, such that

p

is a homeomorphism when restricted to that neighbourhood.

Then

p'(a,a') = a'

has some open set

V \subseteq A'

where

f'

restricts to a homeomorphism. Since

p'

is continuous,

(p')^{-1}(V)

is an open subset of

A \times_B A'

David Egolf (Nov 19 2024 at 19:57):

What is

(p')^{-1}(V)

like? It consists of points

(x,y)

such that

y \in V

and

f(x) = f'(y)

David Egolf (Nov 19 2024 at 19:59):

Actually, I think

(p')^{-1}(V)

is exactly the set I had arrived at by considering the pictures above! That's pretty cool! And now we know that set is an open subset of

A \times_B A'

David Egolf (Nov 19 2024 at 20:09):

We let

f':A' \to B'

be a local homeomorphism and we want to show that this implies that

p

is a local homeomorphism too. We already saw that the image of

p

is open, and it remains to show that for any point

(a,a') \in A \times_B A'

there exists an open set

U

containing

(a,a')

such that

p

restricts to a homeomorphism on

U

After some thought, we have arrived at a strategy for showing there exists such a

U

. Given

(a,a') \in A \times_B A'

, we know that

a' \in A'

has an open set

V

containing it such that

f'

restricts to a homeomorphism on

V

. We then take the preimage of

V

with respect to

p'

to obtain an open subset of

A \times_B A'

containing

(a,a')

David Egolf (Nov 19 2024 at 20:15):

Since

p

is continuous, its restriction to

U

is continuous. It remains to show that (1) its restriction to

U

is bijective and (2) its inverse as a function is also continuous.

David Egolf (Nov 19 2024 at 20:18):

A point in

U

is of the form

(x,y)

where

y \in V

and

f(x) = f'(y)

. Our map

p

returns the first coordinate.

p

is certainly surjective onto its image, but we still need to show that

p

is injective. That amounts to showing that if

(x,y)

and

(x,y')

are both in

U

, then

y=y'

David Egolf (Nov 19 2024 at 20:20):

(x,y)

and

(x,y')

are both in

U

, that implies that

f(x) = f'(y) = f'(y')

. But note that

y,y' \in V

, where

f

restricts to a homeomorphism - and in particular where

f

restricts to an injective function. Thus,

y=y'

as desired. So,

p

is injective when restricted to

U

We conclude that

p

restricts to a continuous bijection on

U

. It remains to show that the inverse (as a function) of this restricted function is also continuous.

David Egolf (Nov 19 2024 at 20:23):

Let's call the function that is an inverse (as a function) to

p

by the name

q

. So

q:p(U) \to U

. This sends an

x \in A

to the pair

(x,y)

where

y

is the unique

y \in V \subseteq A'

such that

f'(y) = f(x)

David Egolf (Nov 19 2024 at 20:28):

We recall that

f'

restricts to a homeomorphism on

V

. In particular it has a continuous inverse

(f')^{-1}:f'(V) \to V

. So we can compute our map

q:p(U) \to U

which sends

x \mapsto (x,y)

by using these two functions:

(Here,

i_A

is the inclusion map

:p(U) \to A

f

refers to a restricted and corestricted version of

f

, and

i_{A'}

is the inclusion map

:V \to A'

Each of these two functions is continuous, and thus the induced map to

A \times A'

is continuous. And, the corestrictions of this map to

A \times_B A'

and to

U

are both continuous. So

p

has a continuous inverse when restricted to

U

David Egolf (Nov 19 2024 at 20:32):

I like this approach! It is more work in some ways, but it's really nice to have motivation to learn some topology - and it's fun to see the topology in action. (I find it hard to get motivated to work on point set topology unless some other topic I care about makes use of it in a way I know about!)

David Egolf (Nov 19 2024 at 20:34):

I think I've shown above that the pullback of a local homeomorphism is a local homeomorphism. So we now have a way to pull back a sheaf to get another sheaf:

David Egolf (Nov 19 2024 at 20:37):

Consulting the current blog post, I see that we next have this puzzle, which will expand our understanding of how pullbacks of bundles work:

David Egolf (Nov 19 2024 at 20:37):

John Baez (Nov 19 2024 at 20:58):

Yes, because I wanted to introduce sheaves and topoi through the classical and 'familiar' example of sheaves on topological spaces. All my students had to have taken a year of topology (one quarter of point-set topology, one of differential topology and one of algebraic topology). So, I could build on that. Also, most applications of sheaves in math still use sheaves on topological spaces, though in his work on algebraic geometry (esp. etale cohomology, to prove Weil's conjectures) Grothendieck introduced sheaves on more general sites.

A more 'postmodern' approach might dive straight into sheaves on sites, but I prefer explaining math in a way that doesn't cut off the roots.

John Baez (Nov 19 2024 at 20:59):

John Baez (Nov 19 2024 at 21:01):

Some students find point set topology interesting for its sake, but a lot of it was developed for applications - e.g. to real and complex analysis, and thus to understanding integrals and differential equations and things like that. Developed as a subject in its own right it's like "baby category theory" - the study of a very particular class of posets.

David Egolf (Nov 20 2024 at 18:42):

The next goal is to show that pulling back any continuous function

f:X \to Y

\mathsf{Top}

extends to a functor

f^*:\mathsf{Top}/Y \to \mathsf{Top}/X

I am surprised that this is true, and curious as to whether it is the special case of a more general situation. Apparently it is! The nLab notes that in any category

C

with pullbacks a morphism

f:X \to Y

induces a pullback functor

f^*:C/Y \to C/X

, which is a sort of "base change".

Kevin Carlson (Nov 20 2024 at 18:49):

David Egolf (Nov 20 2024 at 18:58):

I guess, on first impression, it strikes me as an impressive coincidence that taking pullbacks defines not only one but two functors! (The functor I was previously aware of is the one that maps an appropriately shaped diagram to its pullback.)

I now wonder if other "take the limit" functors have additional functors associated to them in a similar way, or if the "take the pullback" functor is special in this regard. I suppose we should at least expect the "take the pushout" functor to have a corresponding "pushforward" functor.

David Egolf (Nov 20 2024 at 19:22):

Upon contemplating this diagram, I had an idea for how

f^*

should act on morphisms:
diagram

David Egolf (Nov 20 2024 at 19:23):

I think we can define

f^*

on an arbitrary morphism of bundles over

X

g:A \to B

, by defining

f^*g:X \times_YA \to X \times_Y B

(x,a) \mapsto (x,g(a))

. Notice that, referencing our diagram, we have

f^*b \circ f^*g = f^*a

, because

f^*b

and

f^*a

just grab the first coordinate, which is unchanged by

f^*g

The identity morphism

1_A:A \to A

gets mapped to

(x,a) \mapsto (x,1_A(a)) = (x,a)

, which is the identity morphism between the pulled back bundles involved.

f^*

respects composition because

(f^*(h) \circ f^*(g):(x,a) \mapsto (x,g(a)) \mapsto (x,h(g(a)) = (x, h \circ g(a))) = f^*(h \circ g)

David Egolf (Nov 20 2024 at 19:42):

Assuming this is correct (hopefully it is!), we now have our desired functor

f^*:\mathsf{Top}/Y \to \mathsf{Top}/X

induced by a continuous function

f:X \to Y

. This also gives us a functor from presheaves on

Y

to presheaves on

X

, and a functor from sheaves on

Y

to sheaves on

X

John Baez (Nov 20 2024 at 19:52):

Nice diagram! Contemplating this diagram, I immediately want to define

f^\ast g

using the universal property of the pullback

X \times_Y B

. Let's see: we've got morphisms

X \times_Y A \to X

and

X\times_Y A \to B

visible in the diagram, and they obey the necessary commutative square condition to make

X \times_Y A

into a 'competitor' of

X \times_Y B

, so there exists a unique map

X \times_Y A \to X \times_Y B

such that yada yada....

So yes, that works, but it should agree with your 'concrete' description of

f^\ast g

John Baez (Nov 20 2024 at 19:54):

There is some advantage to avoiding the 'concrete' description of

f^\ast g

in terms of ordered pairs, because this fact - that in any category

C

with pullbacks a morphism

f:X \to Y

induces a pullback functor

f^*:C/Y \to C/X

- holds even in contexts where pullbacks have nothing to do with ordered pairs.

John Baez (Nov 20 2024 at 19:58):

By the way, this fact is important all over the place, and so is the fact that in many contexts, like any topos, the pullback functor

f^\ast

has both a left and a right adjoint. We may even run into those in our course someday.

David Egolf (Nov 20 2024 at 20:22):

That makes sense. Let me see if I can understand how you used the universal property of pullbacks. A pullback cone is final among all the cones over the diagram involved. So if we can set up

X \times_Y A

to be the apex of a cone over the appropriate diagram, the universal property will guarantee a unique morphism of cones exists, which involves a morphism

:X \times_Y A \to X \times_Y B

David Egolf (Nov 20 2024 at 20:22):

For this to really be a cone, we need

f \circ f^*a = b \circ (g \circ p_A)

. We have

b \circ (g \circ p_A) = (b \circ g) \circ p_A = a \circ p_A = f \circ f^*a

as desired!

David Egolf (Nov 20 2024 at 20:23):

So then we are set up to use the universal property of pullbacks to find our morphism of interest

:X \times_Y A \to X \times_Y B

. Great!

John Baez (Nov 20 2024 at 20:37):

Good! Whenever you need to map something to a pullback, like

X \times_Y B

, you should feel a Pavlovian instinct to find maps from that something to

X

and to

B

David Egolf (Nov 21 2024 at 18:20):

I think we've now done all the puzzles/exercises from Part 4. So it's on to Part 5!

In this part, we're going to talk about why the category of presheaves on a given topological space forms an elementary topos. We'll work in a more general setting: apparently the category of presheaves on any category forms an elementary topos!

David Egolf (Nov 21 2024 at 18:23):

Let

C

be a category, so that the category of presheaves on

C

is the functor category

[C^{\mathrm{op}}, \mathsf{Set}]

. In this category, each object is a functor

:C^{\mathrm{op}} \to \mathsf{Set}

and the morphsims are natural transformations. So an object of this category attaches a set to each object of

C

David Egolf (Nov 21 2024 at 18:24):

For

[C^{\mathrm{op}}, \mathsf{Set}]

to be an elementary topos it needs to have, among other things, finite colimits. I thought it could be a good challenge to try to show that

[C^{\mathrm{op}}, \mathsf{Set}]

has finite colimits, before reading the part of the blog post that discusses this.

David Egolf (Nov 21 2024 at 18:30):

Roughly speaking, I think the intuition is that this category of presheaves inherits colimits from

\mathsf{Set}

in a way analogous to how a set of functions

:A \to \mathbb{R}

inherits a notion of addition "pointwise" from

\mathbb{R}

. For example, if

F,G:C^{\mathrm{op}} \to \mathsf{Set}

, then I expect that their coproduct

F+G

satisfies

(F+G)(c) \cong F(c) +G(c)

for each

c \in C

David Egolf (Nov 21 2024 at 18:38):

Let's consider the general case. Let

J

be a small category, and let

D:J \to [C^{\mathrm{op}}, \mathsf{Set}]

be a

J

-shaped diagram in

[C^{\mathrm{op}}, \mathsf{Set}]

. This is a bunch of presheaves related by natural transformations, potentially required to satisfy certain equations. I'll call the colimit of this diagram (should it exist) by the name

\mathrm{colim}(D):C^{\mathrm{op}} \to \mathsf{Set}

David Egolf (Nov 21 2024 at 18:40):

For any object

c \in C

we need to determine a set

\mathrm{colim}(D)(c)

. Intuitively, we can do this by grabbing the part of our diagram

D

concerned with

c

. This gives a diagram in

\mathsf{Set}

, and then we can take the colimit of that diagram to get

\mathrm{colim}(D)(c)

David Egolf (Nov 21 2024 at 18:45):

Starting from

D

, how can we get a diagram in

\mathsf{Set}

associated to the object

c

C

? Intuitively, we can do it like this:

David Egolf (Nov 21 2024 at 18:47):

I'd like to express this process as a functor from

C^\mathrm{op}

to the category of

J

-shaped diagrams in

\mathsf{Set}

, which I'll call

[J, \mathsf{Set}]

. So, we're looking for some functor

F:C^\mathrm{op} \to [J, \mathsf{Set}]

Peva Blanchard (Nov 21 2024 at 18:53):

Yes, I find helpful also to think of

D

as a functor

C^{op} \times J \to \text{Set}

. I.e., I have a set

D(c,j)

which is contravariant in

c

and covariant in

j

David Egolf (Nov 21 2024 at 18:59):

That does sound helpful! If I understand correctly, you are using this adjunction:

\mathsf{Cat}(A,[B,C]) \cong \mathsf{Cat}(A \times B, C)

. In our case, this becomes:

\mathsf{Cat}(C^{\mathrm{op}},[J,\mathsf{Set}]) \cong \mathsf{Cat}(C^{\mathrm{op}} \times J, \mathsf{Set})

So, a functor

F:C^{\mathrm{op}} \to [J, \mathsf{Set}]

is associated by this adjunction to some unique

F':C^{\mathrm{op}} \times J \to \mathsf{Set}

Or, working with

D:J \to [C^{\mathrm{op}}, \mathsf{Set}]

we can similarly get a corresponding functor

D':J \times C^{\mathrm{op}} \to \mathsf{Set}

Peva Blanchard (Nov 21 2024 at 19:02):

Yes, also, the analogy with linear algebra is interesting.
Let's momentarily think of

J

and

C^{op}

as finite sets.

We also have a function

X \to \mathbb{R}^X

that sends an element

x

to the "vector" that is 1 on

x

and 0 everywhere else. And this function looks like the Yoneda embedding

C \to \text{Set}^{C^{op}}

(one difference is that there is no action of arrows, so

\_^{op}

means nothing here)

David Egolf (Nov 21 2024 at 19:30):

I think we now have the tools in place to find our functor

F:C^{\mathrm{op}} \to [J, \mathsf{Set}]

, starting with

D:J \to [C^{\mathrm{op}},\mathsf{Set}]

. Moving

D

across the adjunction mentioned above, we get

D':J \times C^{\mathrm{op}} \to \mathsf{Set}

. Precomposing with the isomorphism

s:C^{\mathrm{op}} \times J \to J \times C^{\mathrm{op}}

, we get

D' \circ s:C^{\mathrm{op}} \times J \to \mathsf{Set}

. Moving this across the adjunction discussed above, we get

(D' \circ s)':C^{\mathrm{op}} \to [J, \mathsf{Set}]

, which I suspect is the

F

I was looking for.

David Egolf (Nov 21 2024 at 19:33):

Now, I seem to recall that there is a "take the colimit" functor

T:[J, \mathsf{Set}] \to \mathsf{Set}

. Assuming this is the case, we can form

T \circ (D' \circ s')':C^{\mathrm{op}} \to \mathsf{Set}

. I suspect that this is the (object part of the) colimit of our diagram in

[C^{\mathrm{op}},\mathsf{Set}]

that we started with.

David Egolf (Nov 21 2024 at 19:45):

Returning to the blog post, we have a related puzzle. Changing its notation to match what I'm using here, we have:

Here

D'

is the functor

D':J \times C^{\mathrm{op}} \to \mathsf{Set}

corresponding to

D:J \to [C^{\mathrm{op}}, \mathsf{Set}]

. Also,

D'(-,c)

refers to the functor

D'(-,c):J \to \mathsf{Set}

The "resulting functor" on objects I believe acts like

c \mapsto \mathrm{colim}D'(-,c)

David Egolf (Nov 21 2024 at 19:47):

John Baez (Nov 21 2024 at 21:06):

That sounds right if

J

is a small category (which is what you typically assume for a category being used as a "diagram shape".)

David Egolf (Nov 21 2024 at 22:47):

On a bit of a side note, there's a weird thing about

T

. Namely, it seems to require making a choice of colimit for each diagram, even when we have multiple isomorphic options available. This makes me wonder if there could be a nicer way to think about

T

or something similar to

T

John Baez (Nov 22 2024 at 00:19):

The same issue happens already when we write down "the" product functor

C \times C \to C

when

C

is a category with products. One solution is to use an [[anafunctor]], which maps an object not to an object but to the universal property of an object. Another, I believe, is to switch to homotopy type theory.

Josselin Poiret (Nov 22 2024 at 10:34):

yet another solution is to consider that "having colimits" is actually not property but structure, and that such categories should be equipped with a specific colimit-producing functor. This is the same approach as split vs. non-split Grothendieck fibrations: one shouldn't throw away structure by squashing it

Morgan Rogers (he/him) (Nov 22 2024 at 14:35):

You can come up with variants where this works to an extent, but rarely will you encounter any instances as wide-ranging as pullbacks!

David Egolf (Nov 24 2024 at 19:16):

To review, the current goal is to show that

\mathrm{colim}D'(-,c):C^{\mathrm{op}} \to \mathsf{Set}

is a functor, where

D':J \times C^{\mathrm{op}} \to \mathsf{Set}

is the functor corresponding to our

J

-shaped diagram of presheaves

D:J \to [C^{\mathrm{op}}, \mathsf{Set}]

, and

D'(-,c):J \to \mathsf{Set}

David Egolf (Nov 24 2024 at 19:19):

I think I'll start by trying to show that

D'(-, c):J \to \mathsf{Set}

really is a functor.

David Egolf (Nov 24 2024 at 19:20):

David Egolf (Nov 24 2024 at 19:22):

Then we can construct a functor

(1_J,\Delta_c):J \to J \times C^{\mathrm{op}}

using the fact that

\mathsf{Cat}

has products. This functor acts on objects by

j \mapsto (j,c)

David Egolf (Nov 24 2024 at 19:22):

We notice that

D'(-,c):J \to \mathsf{Set}

is the same thing as

D' \circ (1_J, \Delta_c):J \to J \times C^{\mathrm{op}} \to\mathsf{Set}

, and is therefore a functor.

David Egolf (Nov 24 2024 at 19:31):

We have obtained a diagram in

\mathsf{Set}

, which is the same shape as the diagram we started out with in

[C^{\mathrm{op}}, \mathsf{Set}]

. The

j

-th set in our diagram is

D'(j,c)

, which is what? I think

D'(j,c) = D(j)(c)

D(j)

is the

j

-th presheaf in our original diagram, and

D(j)(c)

is obtained by evaluating that

j

-presheaf at

c \in C

So, our diagram in

\mathsf{Set}

has this as its

j

-th set: the set attached by the

j

-th presheaf to

c \in C

. Intuitively, this diagram is obtained by evaluating our original diagram of presheaves at

c

David Egolf (Nov 24 2024 at 19:35):

What is

\mathrm{colim}D'(-,c)

? This is the colimit of the diagram discussed above. So, it is obtained by evaluating each presheaf in our original diagram

D

c

to get a diagram of sets, and then taking the colimit of that resulting diagram in

\mathsf{Set}

David Egolf (Nov 24 2024 at 19:36):

Given all this context, we want to show that

c \mapsto \mathrm{colim} D'(-,c)

defines a functor

G:C^{\mathrm{op}} \to \mathsf{Set}

David Egolf (Nov 24 2024 at 19:53):

We've already said what

G

does on objects: for

c \in C

, it takes in our diagram

D

of presheaves, evaluates it at

c

, and then takes the colimit of the resulting diagram in

\mathsf{Set}

David Egolf (Nov 24 2024 at 19:54):

Let

f:c \to c'

be a morphism in

C^{\mathrm{op}}

. We need to dream up some function from the colimit of

D

when evaluated at

c

, to the colimit of

D

when evaluated at

c'

David Egolf (Nov 24 2024 at 20:01):

I want to use the universal property of a colimit to do this.

G(c)

is the tip of a cone under the diagram

D

when evaluated at

c

, and

G(c)

is the tip of a cone under the diagram

D

when evaluated at

c'

. If we can somehow get a cone under

D

evaluated at

c

with tip

G(c')

, we'll be in business.

David Egolf (Nov 24 2024 at 20:03):

David Egolf (Nov 24 2024 at 20:05):

Each

P_i

is a presheaf in our diagram

D

. We have a morphism

P_i(f):P_i(c) \to P_i(c')

for each

i

. I am hoping that we can compose these

P_i(f)

with the morphisms in the cone under

D

evaluated at

c'

to get a cone under

D

evaluated at

c

with tip

G(c')

David Egolf (Nov 24 2024 at 20:09):

To do this, I think it suffices to show that a morphism

f:c \to c'

C^{\mathrm{op}}

induces a morphism of diagrams of shape

J

\mathsf{Set}

. Then a cone under a diagram of shape

J

is also a morphism of diagrams of shape

J

, and thus the composition of these two morphisms is as well.

David Egolf (Nov 24 2024 at 20:11):

Basically, we want to show that there is a functor

:C^{\mathrm{op}} \to [J, \mathsf{Set}]

that acts on objects by sending

c \in C

to a

J

-shaped diagram in

\mathsf{Set}

given by evaluating

D

c

David Egolf (Nov 24 2024 at 20:14):

We already have

D':J \times C^{op} \to \mathsf{Set}

. We saw above that we can use an adjunction and the "swapping" isomorphism between

C^{\mathrm{op}} \times J

and

J \times C^{\mathrm{op}}

to get such a functor. So we have some functor

H:C^{\mathrm{op}} \to [J, \mathsf{Set}]

. Thus we are assured that any morphism in

C^{\mathrm{op}}

induces a morphism of certain

J

-shaped diagrams in

\mathsf{Set}

David Egolf (Nov 24 2024 at 20:20):

I just want to double check that

H

sends

c \in C

to our diagram

D

evaluated at

c

. Referencing the adjunction above, we have

H(c)(j)=D'(j,c) = D(j)(c)

. So, the

j

-position in our diagram

H(c)

is indeed given by evaluating our original diagram at location

j

c

. Thus,

H

indeed sends an object

c

to the diagram

D

evaluated at

c

David Egolf (Nov 24 2024 at 20:23):

Now we are in business! We can now say what the functor

c \mapsto \mathrm{colim}D'(-,c)

does on morphisms, recalling that it sends an object

c

to the colimit of our diagram

D

evaluated at

c

. Given

f:c \to c'

C^{\mathrm{op}}

, we get a morphism of

J

-shaped diagrams namely

H(f):D(c) \to D(c')

, where

D(c)

refers to our diagram

D

evaluated at

c

and

D(c')

is defined similarly.

David Egolf (Nov 24 2024 at 20:25):

Then, a colimit

G(c)

has an associated cone

u_c:D(c) \to G(c)

, and a colimit

G(c')

has an associated cone

u_{c'}:D(c') \to G(c')

. Then we can form a cone

u_{c'} \circ H(f): D(c) \to D(c') \to G(c')

Then we can use the universal property of colimits to obtain a function from

G(c)

G(c')

David Egolf (Nov 24 2024 at 20:28):

David Egolf (Nov 24 2024 at 20:32):

It remains to check that this

G

really is a functor, and that it is the colimit of our diagram

D

of presheaves.

David Egolf (Nov 24 2024 at 20:36):

In diagram form, this is our situation, where we are working in the category of

J

-shaped diagrams in

\mathsf{Set}

:
diagram

David Egolf (Nov 24 2024 at 20:39):

We want to show that

G(f \circ g) = G(f) \circ G(f)

. By definition,

G(f \circ g)

is the unique morphism that makes the outermost path in this diagram commute:
diagram

David Egolf (Nov 24 2024 at 20:41):

Since both of the inner rectangles commute, we can paste them together to get a larger commuting rectangle:

G(g) \circ G(f) \circ u_c = u_{c''} \circ H(g) \circ H(f)

. Since

H

is a functor, this implies that

(G(g) \circ G(f)) \circ u_c = u_{c''} \circ H(g \circ f)

. That is,

G(g) \circ G(f)

satisfies the condition that uniquely determines

G(g \circ f)

, so we must have

G(g \circ f) = G(g) \circ G(f)

David Egolf (Nov 24 2024 at 20:43):

The identity morphism

1_c:c \to c

induces the identity morphism from the diagram

D(c)

to itself, and consequently

G(1_c) = 1_{\mathrm{colim} D(c)}= 1_{G(c)}

David Egolf (Nov 24 2024 at 20:45):

Thus, we conclude that

G:C^{\mathrm{op}} \to \mathsf{Set}

is indeed a functor. It remains to show that

G

is really the colimit of our

J

-shaped diagram of presheaves,

D:J \to [C^\mathrm{op}, \mathsf{Set}]

David Egolf (Nov 24 2024 at 20:47):

To show that

G

is really the colimit, we can aim to show it satisfies the appropriate universal property. To do that, we first need to think about how we get a colimit cone under

D

with tip

G

David Egolf (Nov 24 2024 at 20:50):

The first idea that comes to mind is as follows. To get a natural transformation

\lambda:P \to G

, where

P

is some presheaf in our diagram

J

, we can try setting

\lambda

by specifying each component. We can try setting

\lambda_c:P(c) \to G(c)

using the corresponding part of the colimit cone (in

\mathsf{Set}

) under

D(c)

with tip

G(c)

David Egolf (Nov 24 2024 at 20:52):

If we set

\lambda_c:P(c) \to G(c)

for each

c \in C

in this way, do we really get a natural transformation

\lambda:P \to G

David Egolf (Nov 24 2024 at 21:15):

We want to show that this square commutes for any morphism

f:c \to c'

C^{\mathrm{op}}

:
naturality square

David Egolf (Nov 24 2024 at 21:18):

I am guessing that this is part of the big diagram we saw earlier. If we can figure out how, exactly, then the commutativity of our earlier diagram should imply the commutativity of this one.

David Egolf (Nov 24 2024 at 21:22):

David Egolf (Nov 24 2024 at 21:24):

The left diagram is in

\mathsf{Set}

, and expresses the (hopeful) naturality of

\lambda:P\to G

. The right diagram is in

[J, \mathsf{Set}]

and uses the fact that

G(c)

is a colimit of the diagram

D(c)

to induce a morphism from

G(c)

G(c'

David Egolf (Nov 24 2024 at 21:26):

We can think of the diagram on the right as a collection of (related) diagrams. For each

j\in J

we get a diagram in

\mathsf{Set}

by evaluating each functor at

j

Let's assume that

P

is in our diagram

D

at location

j

. So

D(j) = P

. Then we can form a new diagram for our diagram in

[J, \mathsf{Set}]

by precomposing with

j:1 \to J

. This is the functor from the category with a single object and morphism that sends the single object to

j \in J

David Egolf (Nov 24 2024 at 21:29):

Our new diagram replaces

D(c)

with

D(c)_j = P(c)

, and similarly

D(c')

with

D(c')_j = P(c')

. It also sends

u_c

to its

j

-th component

:P(c)=D(c)_j \to G(c)

, which is

\lambda_c

. Similarly, it sends

u_{c'}

\lambda_{c'}

David Egolf (Nov 24 2024 at 21:32):

In forming this new diagram, we also replace

H(f)

with its

j

-th component

:D(c)_j \to D(c')_j

, which goes from

P(c)

P(c')

. So, if

H(f)_j = P(f)

the new diagram we form from the one on the right is just the diagram on the left.

David Egolf (Nov 24 2024 at 21:42):

Intuitively,

H:C^{\mathrm{op}} \to [J,\mathsf{Set}]

takes in an object

c

C

and then creates a diagram in

\mathsf{Set}

by evaluating all our presheaves in

D

c

. So we get a bunch of diagrams, one associated to each object of

C

. Each diagram is a functor, and

H

maps each morphism to a natural transformation. We must have

H(f):H(c) \to H(c')

, so that

H(f)

is a natural transformation from

D(c)

D(c')

induced by

f

David Egolf (Nov 24 2024 at 21:48):

I could just assume that this works out, but I would prefer to prove that

H

acts in the way that I want. To figure out what

H

does on morphisms, we can first figure out what

D'

does on morphisms.

John Baez (Nov 25 2024 at 01:26):

I'm feeling a bit too busy to check what you just wrote, David, especially since I bet it's all fine. (If you're worried about something please say so!) Instead I want to add a remark to an earlier conversation:

I forgot to mention one way people usually deal with this. They show that if you choose colimits for every

J

-shaped diagram and get a functor

T

, and I do it some other way and get a functor

T'

, then there's a natural isomorphism between

T

and

T'

This reassures us that it doesn't matter which choice we make. At least, it doesn't matter if we refrain from doing anything 'evil' - i.e., something that works for one functor but not for some other naturally isomorphic functor!

This is a nice concrete example of why it's good to avoid 'evil'. It's not just a matter of esthetics. It means we can choose a take-the-colimit functor

T

without having to decide which one.

David Egolf (Nov 25 2024 at 01:53):

I think I'm on the right track, it's just taking me a while to get to my destination. But so far so good, I think! (In general, when I write something long like this I don't really expect anyone else to read it all. Despite that, I still like to document the learning process in case it could be useful to someone!)

That's reassuring! Although there are a bunch of different "take the colimit" functors in this context, they are all basically the same, so it doesn't really matter which one we pick.

I suppose in general we can avoid "evil" in this sense by only identifying an object we're working with up to isomorphism. This "blurriness" then would stop us from using any of the particular features of any specific choice, which might not be invariant across isomorphic alternatives. Although maybe this is often inconvenient!

John Baez (Nov 25 2024 at 03:14):

It's often inconvenient to only know the isomorphism class of an object: it's like holding a slippery pig that's twisting around so wildly that you can't point to any specific feature. But it's perfectly fine if you know an object up to a specified isomorphism, which is precisely what happens when the object is defined by a limit or colimit, or any other universal property. In this case, if you make a choice of this object, say

X

, and I make a choice, say

X'

, they are not merely isomorphic, we both get access to a specific isomorphism

\alpha \colon X \to X'

. This allows us to transfer any structure you may have on

X

to my

X'

, and vice versa.

Peva Blanchard (Nov 25 2024 at 20:09):

I'm not sure I understand precisely this point. I'll try to spell it out in the case of the representability of a presheaf

F

on a category

C

Choosing a representative of

F

amounts to choosing an object

X

and a natural isomorphism

\alpha : F \Rightarrow C(\_, X)

. Now, assume we have two such choices

(X, \alpha)

and

(X', \alpha')

. Then, we have an explicit natural isomorphism

By Yoneda lemma, this yields a specific isomorphism

r[\alpha' \alpha^{-1}] : X \to X'

Is that the kind of examples you have in mind when you mention "having access to a specific isomorphism"?

John Baez (Nov 26 2024 at 01:45):

Yes, exactly. And more generally, whenever anything is defined by a universal property like a limit, or colimit, or tensor product of modules, etc., we say X is a universal object with structure S if for any other X' with structure S, there exists a unique morphism from X to X' (or the other way around) making some diagrams commute. Then if both X and X' are universal objects with structure S, there are unique morphisms

making those diagrams commute, and we can use the uniqueness clause in a clever but completely standard way to show

\alpha

and

\beta

are inverses!

John Baez (Nov 26 2024 at 01:47):

Thus, not only are

X

and

X'

isomorphic - which is enough to transfer any property from

X

X'

, or vice versa - but we also get a specific isomorphism between them, which lets us transfer any structure from

X

X'

or vice versa.

John Baez (Nov 26 2024 at 01:48):

For example, "having 7 elements" is a property, so if

X

is a set with 7 elements and

X'

is isomorphic to

X

then we know

X'

has 7 elements.

But "being a group" is a structure, so if the set

X

is made into a group in some particular way and we only know

X'

is isomorphic to

X'

, we don't have enough to make

X'

into a group in some particular way. A specific isomorphism

\alpha \colon X \stackrel{\sim}{\longrightarrow} X'

would be enough.

David Egolf (Nov 26 2024 at 03:37):

That's pretty interesting! It it interesting how the above argument relates these two things: being "extremal and concisely so" (attempting to convey some of the intuition associated with being a universal object) with respect to having certain structure, and being related by isomorphism. It makes me wonder if we could relax the "extremal and concisely so" condition and get a notion of morphism that is weaker than isomorphism but still indicates some measure of similarity is present.

David Egolf (Nov 26 2024 at 03:39):

For example, we could call

X

a "weak universal object with structure

S

" if for any other

X'

with structure

S

there exists at least one morphism from

X

X'

making some diagrams commute. Then perhaps we would obtain some notion of induced morphism between weakly universal objects with structure

S

, which indicates some degree of similarity?

John Baez (Nov 26 2024 at 03:43):

People do talk about weak limits and especially weak pullbacks, which have the weakened sort of universal property you mention. I've never used them. But I sometimes joke about one object being merely "morphic" to another, as opposed to isomorphic.

Alex Kreitzberg (Nov 26 2024 at 04:30):

Your argument for transferring structure just needs an isomorphism, not a unique isomorphism right? Is there some other feature of limits that are preserved because it's a unique isomorphism?

My intuition is that there's a contractible groupoid of objects described by the limit, making the limit "unique", so "everything" gets translated even "stuff", because there's only one thing up to equivalence.

But the way you explained the above is making wonder if that's not quite right. That I'm confused about some detail.

John Baez (Nov 26 2024 at 06:37):

It needs to be a specified isomorphism: that is, an isomorphism you actually know.

The existence and uniqueness clauses in the definition of any universal property guarantee that whenever we have two objects

X

and

X'

with the same universal property, we get a specified isomorphism between them. Without the existence clause we might not have any isomorphism at all; without uniqueness there wouldn't be a particular one.

There are other ways to specify isomorphisms between objects, but a universal property quickly and efficiently specifies an isomorphism between any two objects with that property!

John Baez (Nov 26 2024 at 06:43):

Let me think about that! That's an interesting point. My intuition was that since a functor

U: \mathsf{C} \to \mathsf{D}

that "forgets stuff" is not faithful, even if

\mathsf{D}

is a contractible groupoid,

\mathsf{C}

may not be.

Morgan Rogers (he/him) (Nov 26 2024 at 10:09):

The groupoid need not be contractible! There can be multiple isomorphisms between the objects, that's why we need to specify one.

Amar Hadzihasanovic (Nov 26 2024 at 13:17):

I guess that the precise statement would be that the category of limit cones over a diagram

F: J \to \mathsf{C}

is a contractible groupoid, and its objects are "labelled in objects of

\mathsf{C}

" via the functor that sends a cone to its tip. So perhaps "a contractible groupoid labelled in objects" rather than "contractible groupoid of objects" is the accurate rephrasing?

Morgan Rogers (he/him) (Nov 26 2024 at 14:01):

Yes, I wasn't being too deliberately pedantic, just underlining the point John was making about how much information you need about isomorphisms to transfer properties vs structure.

David Egolf (Nov 26 2024 at 19:08):

Returning to the topos theory blog posts, my current goal is as follows. We have an adjunction

- \times C^{\mathrm{op}} \dashv [C^{\mathrm{op}},-]

of functors

:\mathsf{Cat} \to \mathsf{Cat}

. In particular, this implies we have

\mathsf{Cat}(J \times C^{\mathrm{op}}, \mathsf{Set}) \cong \mathsf{Cat}(J, [C^{\mathrm{op}}, \mathsf{Set}])

This means that given a functor

D:J \to [C^{\mathrm{op}}, \mathsf{Set}]

there is a corresponding functor

D':J \times C^{\mathrm{op}} \to \mathsf{Set}

. I want to figure out how

D'

acts!

David Egolf (Nov 26 2024 at 19:18):

This feels like an important thing to know how to do, but I'm a bit unsure how to get started!

David Egolf (Nov 26 2024 at 19:31):

The idea of using the Yoneda lemma somehow vaguely occurs to me, but I don't quite see how that would help.

David Egolf (Nov 26 2024 at 19:57):

Maybe this is an idea: because we have the adjunction above, in particular we have a natural isomorphism

\mathsf{Cat}(- \times C^{\mathrm{op}}, \mathsf{Set}) \cong \mathsf{Cat}(-, [C^{\mathrm{op}},\mathsf{Set}])

. Expressing this in terms of the opposite category of

\mathsf{Cat}

we have a natural isomorphism

\mathsf{Cat}^{\mathrm{op}}( \mathsf{Set},- \times C^{\mathrm{op}}) \cong \mathsf{Cat}^{\mathrm{op}}([C^{\mathrm{op}},\mathsf{Set}],- )

David Egolf (Nov 26 2024 at 20:01):

The Yoneda lemma then tells us that this natural isomorphism corresponds to an element of

\mathsf{Cat}^{\mathrm{op}}( \mathsf{Set}, [C^{\mathrm{op}}, \mathsf{Set}] \times C^{\mathrm{op}})

, which is an element of

\mathsf{Cat}( [C^{\mathrm{op}}, \mathsf{Set}] \times C^{\mathrm{op}}, \mathsf{Set})

David Egolf (Nov 26 2024 at 20:02):

This special element I bet is going to be an "evaluation" functor, and working out exactly what it is will probably be useful.

John Baez (Nov 26 2024 at 20:02):

Are you familiar with [[cartesian closed categories]]? There are lots of categories where the functor "taking the product with the object x" has a right adjoint called [x, -], and Cat is one of those.

David Egolf (Nov 26 2024 at 20:05):

Somewhat! I've been mostly referring to the article [[closed monoidal category]]. As far as I understand, a cartesian closed category is a closed monoidal category where the monoidal product is given by taking the product.

David Egolf (Nov 26 2024 at 20:06):

Scrolling down in the article you linked above, I see a section called "Some basic consequences" which looks like it might be helpful.

John Baez (Nov 26 2024 at 20:07):

Right. I think you can find a formula for D' in terms of D just by writing down the simplest thing that parses and then checking it works - that's the way to solve 90% of problems in category theory. :smirk:

David Egolf (Nov 26 2024 at 20:11):

I figured that I could probably guess how

D'

should work in the way you describe, but I was somehow feeling that I wanted to understand more generally the process of hopping across an adjunction.

I think the article you linked gives an answer that makes me somewhat happy though: In particular, that article notes that we can get from a morphism

\phi:Z \to [X,Y]

to a morphism

\phi':Z \times X \to Y

as follows: we apply the evaluation morphism

:[X,Y] \times X \to Y

after the morphism

\phi \times 1_X:Z \times X \to [X,Y] \times X

David Egolf (Nov 26 2024 at 20:13):

So, I can just spell out what the evaluation morphism does, and I should be in business!

I also find it satisfying to note that the evaluation morphism probably comes from applying the Yoneda lemma to the adjunction in question, as I was begin to investigate above.

David Egolf (Nov 26 2024 at 20:19):

Alright, we want some "evaluation morphism"

e:[X, Y] \times X \to Y

. This should be a functor. On objects, intuitively it will map

(F,x) \mapsto F(x)

. On morphisms, it needs to send a pair

(\alpha, f)

where

\alpha:F \to G

and

f:x \to y

to some morphism from

F(x)

G(y)

David Egolf (Nov 26 2024 at 20:20):

Since

\alpha:F \to G

is a natural transformation, we have this commutative square:
naturality square

David Egolf (Nov 26 2024 at 20:21):

So I'm going to guess that

e

sends

(\alpha:F \to G, f:x \to y)

G(f) \circ \alpha_x = \alpha_y \circ F(f)

. This morphism goes from

F(x)

G(y)

as required.

David Egolf (Nov 26 2024 at 20:29):

e(1_{(F,x)})=e((1_F:F \to F, 1_x:x\to x)) = F(1_x) \circ (1_F)_x = F(1_x)\circ 1_{F(x)} = 1_{F(x)} = 1_{e(F,x)}

, so

e

acts as it should on identity morphisms.

David Egolf (Nov 26 2024 at 20:35):

It remains to show that

e

preserves composition. But I will leave that for next time!

Peva Blanchard (Nov 26 2024 at 21:43):

This is interesting, I wasn't expecting the use of the Yoneda lemma to highlight the evaluation functor.
When exercising with adjunctions, I found interesting to describe their units and counits.
I don't want to interrupt your flow here, so I'll just write in a "spoiler" box :)

Let's work with $\text{Set}$ , a cartesian closed monoidal category that is simpler, for me, to apprehend than $\text{Cat}$ .

Given any set $Y$ , we have an adjunction $\_ \times Y \dashv [Y, \_]$ , as illustrated by the bijection

$\text{Set}(X \times Y, Z) \cong \text{Set}(X, [Y, Z])$

natural in $X$ , and where $[Y, Z] = \text{Set}(Y, Z)$ is the set of functions from $Y$ to $Z$ .

This bijection works as follows

$\begin{align*} \text{Set}[X \times Y, Z) &\to \text{Set}(X, [Y, Z]) \\ f &\mapsto \Big(x \mapsto \big(y \mapsto f(x,y)\big)\Big) \\ \text{Set}(X, [Y, Z]) &\to \text{Set}(X \times Y, Z) \\ g &\mapsto \Big((x,y) \mapsto g(x)(y) \Big) \end{align*}$

The first map is called currying, while the second if called uncurrying.

The unit of the adjunction is $\eta : Id \Rightarrow [Y, \_ \times Y ]$ given by

$\begin{align*} \eta_X : X &\to [Y, X \times Y] \\ x &\mapsto (y \mapsto (x,y)) \end{align*}$

The (more interesting) counit of the adjunction is $\epsilon : [Y, \_] \times Y \Rightarrow Id$ , given by

$\begin{align*} \epsilon_Z : [Y, Z] \times Z &\to Z \\ (f, z) &\mapsto f(z) \end{align*}$

In other words, the counit is the evaluation.

David Egolf (Nov 27 2024 at 18:55):

So let us consider a composite morphism

(\alpha',f') \circ (\alpha, f):(F,x) \to (G,y) \to (H,z)

. Here

f:x \to y

f':y \to z

\alpha:F \to G

and

\alpha':G \to H

. This morphism is

(\alpha' \circ \alpha:F \to H, f' \circ f:x \to z)

e

maps this to:

H(f' \circ f) \circ (\alpha' \circ \alpha)_x = H(f') \circ H(f) \circ (\alpha')_x \circ \alpha_x

David Egolf (Nov 27 2024 at 18:56):

That morphism is the bottom left path from top left to bottom right in the following diagram:
diagram

David Egolf (Nov 27 2024 at 18:57):

Every (small) square in this diagram is a naturality square, so any path from the top left to the bottom right composes to form the same morphism.

David Egolf (Nov 27 2024 at 18:59):

Now we want to compute

e(\alpha',f') \circ e(\alpha, f)

. We have

e(\alpha, f) = G(f) \circ \alpha_x

and

e(\alpha':G \to H, f':y \to z) = H(f') \circ \alpha'_y

So this composite is

e(\alpha',f') \circ e(\alpha, f) = (H(f') \circ \alpha'_y) \circ (G(f) \circ \alpha_x)

This is another path from top left to bottom right in our diagram, so this is equal to

H(f' \circ f) \circ (\alpha' \circ \alpha)_x

. We conclude that

e

does indeed preserve composition!

David Egolf (Nov 27 2024 at 19:03):

Thanks for pointing that out! Now I feel more incentivized to think about the unit and counit of adjunctions that I may run across in the future...

David Egolf (Nov 27 2024 at 19:07):

Now, the reason I did all this was because I wanted to figure out how

D':J \times C^{\mathrm{op}} \to \mathsf{Set}

acts on morphisms, given a

J

-shaped diagram

D:J \to [C^{\mathrm{op}}, \mathsf{Set}]

of presheaves on

C

. We should be able to spell out

D'

in detail now!

David Egolf (Nov 27 2024 at 19:11):

We start with

D:J \to [C^{\mathrm{op}}, \mathsf{Set}]

. Using

D

, we'll first form

D \times 1_{C^{\mathrm{op}}}: J \times C^{\mathrm{op}} \to [C^{\mathrm{op}}, \mathsf{Set}] \times C^{\mathrm{op}}

. Then we'll apply our evaluation functor

e:[C^{\mathrm{op}}, \mathsf{Set}] \times C^{\mathrm{op}} \to \mathsf{Set}

to end up in

\mathsf{Set}

David Egolf (Nov 27 2024 at 19:28):

So let's consider

D' = e \circ (D \times 1_{C^{\mathrm{op}}}):J \times C^{\mathrm{op}} \to \mathsf{Set}

On morphisms, it acts like this:

(f:j \to j', g:c \to c') \mapsto (D(f):D(j) \to D(j'), g:c \to c')

\mapsto D(j')(g) \circ D(f)_{c}

David Egolf (Nov 27 2024 at 19:29):

David Egolf (Nov 27 2024 at 19:30):

Hopefully I did that right. It feels like it would be easy to make a mistake here.

David Egolf (Nov 27 2024 at 19:33):

Scrolling way back, I think my reason for spelling out how

D'

works was to figure out how

H: C^{\mathrm{op}} \to [J, \mathsf{Set}]

works. This would involve "hopping across" the adjunction again :sweat_smile:!

David Egolf (Nov 27 2024 at 19:34):

That sounds like a lot of work, so I might instead take some time to rethink my strategy. I'll stop here for today.

John Baez (Nov 27 2024 at 19:43):

It's always useful to start by guessing what the answer must be: in problems of this sort, there is usually an "obvious best guess".

John Baez (Nov 27 2024 at 19:44):

The physicist John Wheeler gave some advice that really affected me, even though I'd already half-known it before. Namely: never do a calculation unless you already know the answer.

John Baez (Nov 27 2024 at 19:46):

(It's actually enough to think you know the answer - then the calculation will prove you wrong.)

John Baez (Nov 27 2024 at 19:52):

So, when you get time you might just write down what you think

H

should be, without calculating it.

David Egolf (Nov 27 2024 at 19:53):

That's an interesting perspective! I think I sometimes use calculations as a sort of "extension ladder" to reach beyond what my intuition is telling me. But if I started reaching too far beyond things get a bit wobbly. So the idea of consistently grounding calculation in a specific guess or intuition sounds potentially quite helpful!

David Egolf (Nov 27 2024 at 19:54):

John Baez (Nov 27 2024 at 19:55):

Yes, Wheeler was exaggerating for effect; both perspectives on calculation are important!

Alex Kreitzberg (Nov 28 2024 at 06:42):

Do you have a cute story/anecdote that insists on calculating the answer when you believe you already know it? The temptation to say "I already know this, what's the point of the calculation?" Feels far more lethal to me. (Of course all advice is tailored to who is receiving it)

John Baez (Nov 28 2024 at 18:25):

I don't have a specially cute story like that: I just know I'm pretty much unable to do a serious calculation correctly unless I have a good idea about where it's going. Here I'm generally talking about calculations that involve integrals, algebraic equations, etc. - since those are the most complicated calculations I've done. So typically what happens is that I calculate rather quickly but make lots of copying mistakes, where when copying from one line of text to the next and doing some manipluations a double minus sign turns into a single minus sign, I forget to distribute a factor over all the terms in a sum, and so on. If I know where the calculation should be going, I can tell when something is going wrong, so I can diagnose these errors. But if I have no idea what the result should be, it takes a long time, because I tend to become 'blind' to these errors: I can look at them over and over, and still not see them.

John Baez (Nov 28 2024 at 18:27):

The same general principles apply to category-theoretic computations, especially with enormous commutative diagrams.

However, what I like about category theory is that it's harder to make computational mistakes, because in many situations there's only one possible expression that can possibly parse: if you write down the wrong thing, you get something that has the wrong type or is undefined. I like to say that category theory is 'rigid', not flexible: if you accidentally bend things a bit, they tend to break completely, so you can tell.

Physicists try to get themselves into similar situations by relentlessly using dimensional analysis. Then many mistakes can be spotted by noticing that what you wrote has the wrong dimensions. This amounts to replacing the ring of real numbers by a graded commutative ring, graded in some abelian group, where you're not allowed to add things of different grades.

James Dolan noticed that graded commutative rings of this sort can also be seen as categories called 'dimensional categories'... so using dimensional analysis in physics gives it more of the 'rigidity' we expect from category theory:

John Baez (Nov 28 2024 at 18:47):

Recently I spent two or three weeks trying to correctly do some computations in statistical mechanics, essentially taking the limit of an integral, and I screwed up about 100 times before getting it right! The problem was that I really didn't know the right answer ahead of time: I had a rough idea of it, but I quickly discovered that rough idea was wrong and then I was lost at sea. In the end I had to do a very concrete example of these calculations, rather than the general abstract calculations, before I discovered a conceptual error I was making:

I actually enjoyed these few weeks very much, since it's been a long time since I've done calculations that were so involved, and so deeply reliant on ideas from physics. When I finally straightened out all the mistakes it was glorious!

David Egolf (Nov 28 2024 at 18:57):

That rings true for me as well! Certainly in the context of math it often helps me to try and consider a specific example, but that's also true in the case of writing a program. I've spent weeks slowly debugging a program that is supposed to reconstruct images, where my only clue that something is wrong is that the image just doesn't look right at all. Without that clue, just reading the code, I would have been very hard pressed to identify what was wrong!

David Egolf (Nov 28 2024 at 19:03):

Let me see if I can dream up a guess for how the functor

H: C^{\mathrm{op}} \to [J, \mathsf{Set}]

should work. The context is that we have a

J

-shaped diagram

D:J \to [C^{\mathrm{op}}, \mathsf{Set}]

of presheaves on

C

On objects, I expect

H

to send

c \in C

to the

J

-shaped diagram in

\mathsf{Set}

given by evaluating each presheaf in our original diagram at

c

David Egolf (Nov 28 2024 at 19:06):

On morphisms, I don't have a guess for what

H

should do yet. If

f:c \to c'

C^{\mathrm{op}}

H(f):H(c) \to H(c')

. This is to be a natural transformation from

H(c):J \to \mathsf{Set}

H(c'):J \to \mathsf{Set}

. To specify it, it suffices to specify its components.

David Egolf (Nov 28 2024 at 19:12):

So let

\lambda:j \to k

J

. Here's the naturality square for

\lambda

:
naturality square

David Egolf (Nov 28 2024 at 19:14):

Let's see if we can figure out a guess for

H(f)_j:H(c)(j) \to H(c')(j)

. Now,

H(c):J \to \mathsf{Set}

is a diagram of sets obtained by evaluating each presheaf at

c

. If we just grab the

j

-th set from that diagram, this should just be what we get when we evaluate the

j

-th presheaf at

c \in C

David Egolf (Nov 28 2024 at 19:15):

H(f)_j:H(c)(j) \to H(c')(j)

should be a function from (the set obtained by evaluating the

j

-th presheaf in

D

c

) to (the set obtained by evaluating the

j

-th presheaf in

D

c'

David Egolf (Nov 28 2024 at 19:21):

The

j

-th presheaf is

D(j):C^{\mathrm{op}} \to \mathsf{Set}

. This is a functor, so we have

D(j)(f): D(j)(c) \to D(j)(c')

. I think we have

D(j)(c) = H(c)(j)

and

D(j)(c') = H(c')(j)

. So,

D(j)(f):H(c)(j) \to H(c')(j)

David Egolf (Nov 28 2024 at 19:24):

I can now form a guess for what

H:C^{\mathrm{op}} \to [J,\mathsf{Set}]

does on morphisms. It takes a morphism

f:c \to c'

C^{\mathrm{op}}

and sends it to the natural transformation from

H(c):J \to \mathsf{Set}

H(c'):J \to \mathsf{Set}

with

j

-th component given by

D(j)(f):H(c)(j) \to H(c')(j)

David Egolf (Nov 28 2024 at 19:28):

I'd next want to check that this guess makes the above naturality square commute. But I'll stop here for today.

John Baez (Nov 28 2024 at 23:03):

David Egolf (Nov 28 2024 at 23:07):

Yes, @John Baez that is what I'm trying to do! I agree that

D(j)(c) = H(c)(j)

for objects

c,j

, but I hadn't considered that the same equation could hold for morphisms!

John Baez (Nov 28 2024 at 23:09):

Okay. It's good to notice that when you have a two-variable functor like

D(-)(--)

H(--)(-)

, it makes sense when both

-

and

--

are objects, when both of them are morphisms, and also when one is an object and another is a morphism!

John Baez (Nov 28 2024 at 23:10):

So, it's good to do as many computations as you can while remaining noncommital about whether the variables are objects or morphisms. Then you can effectively do multiple computations at once.

David Egolf (Nov 28 2024 at 23:13):

That sounds cool! I'm not quite understanding yet how that equation can make sense when both the things we feed in our morphisms.

Let's say

f:c \to c'

C^{\mathrm{op}}

. Then

H(f)

is a natural transformation between two functors in

[J, \mathsf{Set}]

. I'm not seeing how it makes sense to then feed a morphism in

J

H(f)

; I don't think of

H(f)

as something that takes in morphisms - it's just a bunch of component functions.

John Baez (Nov 28 2024 at 23:13):

John Baez (Nov 28 2024 at 23:16):

Now maybe you're at the stage before you're convinced that Cat is cartesian closed. Once you're convinced, you know that

and this has no trouble eating a morphism in

C^{\text{op}}

and a morphism in

J

and producing a morphism in

\mathsf{Set}

John Baez (Nov 28 2024 at 23:18):

eats a morphism in

C^{\text{op}}

and a morphism in

J

and produces a morphism in

\mathsf{Set}

David Egolf (Nov 28 2024 at 23:21):

Scrolling way back, I reached a point where I needed to know what

H

does to morphisms. That's what I would like to understand.

I already am convinced that

\mathsf{Cat}

is cartesian closed. So I agree with you when you talk about how the functor

H

is just another way of talking about some functor

\tilde{H}

. I think my problem is that I don't understand exactly how moving across this adjunction actually works.

David Egolf (Nov 28 2024 at 23:22):

It's one thing to know that I can exchange one functor for another using this adjunction. It's another thing to understand exactly what functor I get out from this exchange process.

David Egolf (Nov 28 2024 at 23:23):

It's possibly my original approach was just a needlessly painful way to go about things, and that this could all be avoided with a different strategy.

John Baez (Nov 28 2024 at 23:24):

eats a morphism in

C^{\text{op}}

and a morphism in

J

and produces a morphism in

\mathsf{Set}

- that is, to get myself into trouble as you just have, and get back out- I need to remember more precisely how

eats a morphism

f : c \to c'

C^{\text{op}}

and a morphism

g: j \to j'

J

and produces a morphism in

\mathsf{Set}

On the one hand, we just take the morphism

(f,g) : (c,j) \to (c',j')

and feed it in

\tilde{H}

. But on the other hand, it's good to remember that

John Baez (Nov 28 2024 at 23:26):

This formula I just urged you to "remember", which may have some typos in it, allows you to break things down in a way where you don't bust your brain wondering what does a natural transformation do to a morphism???

John Baez (Nov 28 2024 at 23:26):

(What it does to a morphism, ultimately, is give a commutative square, meaning an equation of some sort... so, in a way it doesn't do much!)

John Baez (Nov 28 2024 at 23:32):

is a completely general formula that says: when you've got a morphism in a product of categories, you can always write it as

This is why, when you have a functor going out of a product category like

C^{\text{op}} \times J

, you never need to think about what it does to a pair of morphisms, one in

C^{\text{op}}

and one in

J

. You can always just think about what it does to a pair consisting of one object and one morphism.

John Baez (Nov 28 2024 at 23:33):

And I feel that allows us to avoid the problem you were facing (and ultimately also confront that problem and solve it).

David Egolf (Nov 28 2024 at 23:50):

David Egolf (Nov 28 2024 at 23:52):

By the way, I think I ran across earlier today a way that one can think of applying a natural transformation to a morphism. One starts by contemplating the naturality square for the morphism in question, which is

f:X \to Y

in the picture below:
naturality square

David Egolf (Nov 28 2024 at 23:52):

Then one can define

\alpha(f) = G(f) \circ \alpha_X = \alpha_Y \circ F(f):F(X) \to G(Y)

David Egolf (Nov 28 2024 at 23:53):

David Egolf (Nov 29 2024 at 00:01):

I like the formula

(f,g) = (f, 1_{j'}) \circ (1_c, g)

. I don't immediately see how to use it to figure out what

H

does to a morphism. Maybe something will occur to me when I give this another proper try, hopefully tomorrow.

John Baez (Nov 29 2024 at 00:04):

Well, I think you ran into trouble figuring out what

H(-)(--)

is when both

-

and

--

are morphisms, and this formula helps with that. If both of them are morphisms, you can use the formula I gave to reduce to the case where just one is an interesting morphism, and the other is an identity morphism (and thus essentially an object). Since

John Baez (Nov 29 2024 at 00:04):

David Egolf (Nov 29 2024 at 00:11):

That sounds helpful - thanks! Unfortunately the pieces aren't quite coming together for me right now. I'm not even sure I could clearly explain my point of confusion at the moment. I'll plan to sleep on this and give it a solid try tomorrow.

John Baez (Nov 29 2024 at 00:12):

Peva Blanchard (Nov 29 2024 at 12:49):

For every object

c

H(c)

is a functor from

J

\text{Set}

.
Hence, for every object

c \in C^{op}

and

j \in J

, we have

Now, let

g : c \to c'

be a morphism of

C

H(g)

should be a natural transformation from the functor

H(c') : J \to \text{Set}

to the functor

H(c) : J \to \text{Set}

. I find useful to denote a natural transformation explicitly as a family of morphisms, here indexed by a variable

j

running over the objects of

J

. I.e.,

In other words,

H(g)(j)

is the component of the natural transformation

H(g)

at object

j

I find that these notations, and John's hint, should help defining the functor

\tilde{H} : C^{op} \times J \to \text{Set}

on objects and morphisms.

David Egolf (Nov 29 2024 at 19:11):

Thanks to both of you, I feel that I understand this adjunction a lot better now!

David Egolf (Nov 29 2024 at 19:13):

I'm going to try and start this puzzle over again, using the using recent discussion to help solve it. I'm going to try to keep my discussion of the puzzle much more concise this time.

John Baez (Nov 29 2024 at 22:10):

David Egolf (Nov 29 2024 at 22:52):

We start with a

J

-shaped diagram

D:J \to [C^{\mathrm{op}}, \mathsf{Set}]

of presheaves on

C

, where

J

is a small category. Our goals are to: (1) describe a candidate colimit for this diagram and (2) show that the candidate colimit really is a colimit.

David Egolf (Nov 29 2024 at 22:55):

We will make use of this adjunction:

\mathsf{Cat}(J \times C^{\mathrm{op}}, \mathsf{Set}) \cong \mathsf{Cat}(J, [C^{\mathrm{op}}, \mathsf{Set}])

. Given

D

we can use this bijection to know there is a corresponding

D':J \times C^{\mathrm{op}} \to \mathsf{Set}

A morphism in

J \times C^{\mathrm{op}}

is of the form

(g:j \to j', f:c \to c')

. We can rewrite this as

(g, 1_{c'}) \circ (1_j, f)

. So to describe what

D'

does on morphisms it suffices to describe

D'(g, 1_{c'})

and

D'(1_j,f)

We have

D'(g, 1_{c'}) = D(g)(c')

. Here

D(g)

is a natural transformation from

D(j)

D(j')

, which are both functors

:C^{\mathrm{op}} \to \mathsf{Set}

. By

D(g)(c')

we mean the

c'

component of

D(g)

We also have

D'(1_j, f) = D(j)(f)

. Here

D(j):C^{\mathrm{op}}\to \mathsf{Set}

and

f:c \to c'

C^{\mathrm{op}}

so it makes to directly supply

f

D(j)

David Egolf (Nov 29 2024 at 22:59):

Next, we introduce a functor

F_c:J \to J \times C^{\mathrm{op}}

, defined by

F = (1_J, \Delta_c)

, where

1_J:J \to J

is the identity functor and

\Delta_c:J \to C^{\mathrm{op}}

is the functor constant at

c \in C

We can then form

D' \circ F_c:J \to \mathsf{Set}

. Since

\mathsf{Set}

has colimits, we can take the colimit of this diagram to get some set,

\mathrm{colim~}(D' \circ F_c)

David Egolf (Nov 29 2024 at 23:01):

Intuitively,

D' \circ F_c

is the diagram obtained by evaluating each of our presheaves at

c

, and by taking the

c

-th component of each natural trasnformation

D(g)

g

ranges over morphisms in

J

David Egolf (Nov 29 2024 at 23:03):

Given a morphism

f:c \to c'

we define a natural transformation

\alpha(f):D' \circ F_c \to D' \circ F_{c'}

, by setting

\alpha(f)_j = D(j)(f)

. (To be concise, I won't spell out here the details involved with checking that this really is a natural transformation.)

David Egolf (Nov 29 2024 at 23:05):

For each morphism

f:c \to c'

, we then obtain a natural transformation

\mathrm{colim~}(D' \circ F_c) \to \mathrm{colim~}(D' \circ F_{c'})

by using the universal property of a colimit in

\mathsf{Set}

:
diagram

David Egolf (Nov 29 2024 at 23:06):

Here

u_c

and

u_{c'}

are the natural transformations corresponding to the universal cones with apex

\mathrm{colim~}(D' \circ F_c)

and

\mathrm{colim~}(D' \circ F_{c'})

under the diagrams

D' \circ F_c

and

D' \circ F_{c'}

, respectively.

I will use

(\mathrm{colim~}D)(f)

to refer to the unique dashed natural transformation that makes the above diagram commute.

David Egolf (Nov 29 2024 at 23:10):

We can now complete the object part of goal (1). Here is the object part of our candidate colimit for

D

, which I'll call

\mathrm{colim~}D:C^{\mathrm{op}} \to \mathsf{Set}

David Egolf (Nov 29 2024 at 23:14):

To complete goal (1), we also need to give a universal cone under

D

with apex

\mathrm{colim~}D

. For each position

j

in our diagram

D

, we need a natural transformation

\lambda(j):D(j) \to \mathrm{colim~}D

We can achieve this by setting

\lambda(j)_c = (u_c)_j

for each

c \in C

, where

(u_c)_j

is the

j

-th component of the natural transformation

u_c:(D' \circ F_c) \to \mathrm{colim~}(D' \circ F_c)

. (To be concise, I will not spell out here the verification that

\lambda(j)

is really a natural transformation.)

David Egolf (Nov 29 2024 at 23:28):

Defining each

\lambda(j):D(j) \to \mathrm{colim~}D

in this way does indeed define a cone under

D

with tip

\mathrm{colim~}D

. (Again, I won't spell out the verification here).

So we have accomplished goal (1); we have found a colimit candidate for our diagram

D

David Egolf (Nov 29 2024 at 23:32):

It remains to show that

\mathrm{colim~}D

is really a colimit of

D

. So, for any other cone under

D

, we need to show there is a unique morphism to that from our candidate universal cone with tip

\mathrm{colim~}D

David Egolf (Nov 29 2024 at 23:34):

Referencing the diagram above, we have a cone under

D

with tip

X

and "legs" of the form

\alpha(j)

. We wish to show there is a unique natural transformation

f:\mathrm{colim~} D \to X

that induces a morphism of cones from our candidate colimit cone to the cone with tip

X

David Egolf (Nov 29 2024 at 23:37):

Earlier, we saw that there is an evaluation functor

e:C^{\mathrm{op}} \times [C^{\mathrm{op}}, \mathsf{Set}] \to \mathsf{Set}

. We also have a functor

[C^{\mathrm{op}}, \mathsf{Set}] \to C^{\mathrm{op}} \times [C^{\mathrm{op}}, \mathsf{Set}]

induced by the constant functor at

c \in C

and the identity functor of

[C^{\mathrm{op}}, \mathsf{Set}]

By composing these two, we obtain a functor

:[C^{\mathrm{op}}, \mathsf{Set}] \to \mathsf{Set}

that evaluates any presheaf at

c

David Egolf (Nov 29 2024 at 23:40):

Since applying a functor preserves composition, we can "evaluate at

c

" the diagram above to get another commuting diagram. This tells us that the

c

-th component of

f

is forced to be the function

f_c

that makes the following diagram commute:
diagram

David Egolf (Nov 29 2024 at 23:44):

Thus, if

f

exists, it is unique. It remains to show that all these

f_c

assemble to form a natural transformation, and that this natural transformation corresponds to the morphism of cones pictured earlier.

David Egolf (Nov 30 2024 at 00:06):

We begin by checking that

f

is a natural transformation. So, for any

h:a \to b

C^{\mathrm{op}}

we want to show that this naturality square commutes:
naturality square

David Egolf (Nov 30 2024 at 00:08):

I'm running out of steam, so I'll stop here for now. I think I got further than I did last time, at least!

Next time, I may try to work out an example, to see how the

f_c

assemble to form a naturality square in practice. Hopefully that will help! But if I can't figure this out next time, I may give up and consult "Categories for the Working Mathematician".

David Egolf (Dec 01 2024 at 18:53):

This feels super close to working, and I had an idea. The morphisms

\mathrm{colim~}D(h)

and

f_b

are both induced by universal properties. I think it could be helpful to see if the morphism they compose to is also given by a universal property.

David Egolf (Dec 01 2024 at 19:01):

David Egolf (Dec 01 2024 at 19:02):

D(a)

means the diagram evaluated at

a

, and similarly for

D(b)

. To avoid overloading

\alpha

, I'm now using

v

to refer to the cone under

D

with tip

X

David Egolf (Dec 01 2024 at 19:03):

This diagram illustrates that we have a cone under

D(a)

with tip

X(b)

, given by composing the natural transformations

v_b \circ \alpha(h):D(a) \to X(b)

David Egolf (Dec 01 2024 at 19:06):

The universal property of

\mathrm{colim~}D(a)

then ensures that there is a unique morphism

!:\mathrm{colim~}D(a) \to X(b)

such that

! \circ u_a = v_b \circ \alpha(h)

. But because the rectangle and the triangle in the above square commute, this unique morphism must be

f_b \circ \mathrm{colim~}D(h)

David Egolf (Dec 01 2024 at 19:09):

My next idea is to try and show that

X(h) \circ f_a

also satisfies this condition, and hence by uniqueness is equal to

f_b \circ (\mathrm{colim~}D)(h)

David Egolf (Dec 01 2024 at 19:33):

We obtain a new diagram, where our goal is to show that

(X(h) \circ f_a) \circ u_a = v_b \circ \alpha(h)

:
diagram

David Egolf (Dec 01 2024 at 19:36):

If we can show that two certain sub-diagrams commute, we can paste them together to conclude that the outermost part of this diagram commutes. We want to show:

David Egolf (Dec 01 2024 at 19:40):

We immediately have

f_a \circ u_a = v_a

because

f_a

is by definition the unique morphism that makes this triangle commute.

David Egolf (Dec 01 2024 at 19:45):

The condition

X(h) \circ v_a = v_b \circ \alpha(h)

looks a lot like a naturality square condition.

This still feels tricky, but I think it's something I could ask a question about concisely. Maybe I'll start a new thread to ask a specific related question. [edit: I have now done so!]

David Egolf (Dec 04 2024 at 19:15):

I better understand all of this now, I feel, following the discussion in #learning: questions > organizing related cones. I'm going to try and start this puzzle over again, using notation inspired by that thread. Hopefully I can take the argument to its conclusion this time.

John Baez (Dec 04 2024 at 21:11):

David Egolf (Dec 04 2024 at 23:38):

That's somewhat encouraging! I'm finding that this puzzle is really pushing me to get better at being precise and organizing my thoughts. There are just so many little things I've needed to check and keep track of.

I thought I had figured it out when I posted my message above, but on closer examination I realized I wasn't quite done yet!

David Egolf (Dec 04 2024 at 23:39):

I guess there's a certain amount of acclimatization required as one gets used to handling diagrams where the morphisms are natural transformations :sweat_smile: .

David Egolf (Dec 08 2024 at 21:28):

I think I figured it out! My argument has a couple assumptions that I can check later, but I want to first explain my approach. Hopefully I can manage to do that clearly and concisely!

John Baez (Dec 08 2024 at 21:35):

David Egolf (Dec 08 2024 at 21:47):

We start with a diagram

D:J \to [C^{\mathrm{op}}, \mathsf{Set}]

of presheaves. Our goal is to find a colimit for

D

, and to prove that we really found a colimit.

Since

\mathsf{Cat}

is a closed monoidal category, Proposition 3.1 on the nLab on the nLab tells us that

[J, [C^{\mathrm{op}}, \mathsf{Set}]] \cong [J \times C^{\mathrm{op}}, \mathsf{Set}]

. Since

J \times C^{\mathrm{op}} \cong C^{\mathrm{op}} \times J

, we have that

(J \times C^{\mathrm{op}}, \mathsf{Set}) \cong (C^{\mathrm{op}} \times J, \mathsf{Set})

\mathsf{Cat}^{\mathrm{op}} \times \mathsf{Cat}

. Since

[-,-]

is a functor, there is an isomorphism

[J \times C^{\mathrm{op}}, \mathsf{Set}] \cong [C^{\mathrm{op}} \times J, \mathsf{Set}]

. Applying Proposition 3.1 again, we learn there is an isomorphism

[C^{\mathrm{op}} \times J, \mathsf{Set}] \cong [C^{\mathrm{op}},[J, \mathsf{Set}]]

David Egolf (Dec 08 2024 at 21:48):

Putting all these isomorphisms together, we conclude there is an isomorphism

t:[J ,[ C^{\mathrm{op}}, \mathsf{Set}]] \cong [C^{\mathrm{op}}, [J, \mathsf{Set}]]

\mathsf{Cat}

John Baez (Dec 08 2024 at 21:53):

Interrupting momentarily, in any symmetric monoidal closed category we always have a god-given isomorphism

and this is what you've really proved, since you didn't use anything that doesn't work at that level of generality. It's a good thing to know!

John Baez (Dec 08 2024 at 21:53):

The symmetric part is what let you switch

a

and

b

a \otimes b \cong b \otimes a

David Egolf (Dec 08 2024 at 21:54):

David Egolf (Dec 08 2024 at 22:00):

Our goal is to find a universal cone

\beta: D \to \Delta_{\mathrm{colim~} D}

, which provides a colimit of

D

. The functor

\Delta_{\mathrm{colim~} D}:J \to [C^{\mathrm{op}}, \mathsf{Set}]

is the functor constant at some functor

\mathrm{colim~} D:C^{\mathrm{op}} \to \mathsf{Set}

To describe our proposed universal cone

\beta

, we first describe

\mathrm{colim~} D

. We will make use of our "transposing" isomorphism

t:[J ,[ C^{\mathrm{op}}, \mathsf{Set}]] \cong [C^{\mathrm{op}}, [J, \mathsf{Set}]]

to do this.

David Egolf (Dec 08 2024 at 22:05):

My first assumption is that

t(D) = D^t:C^{\mathrm{op}} \to [J, \mathsf{Set}]

is defined as follows:

David Egolf (Dec 08 2024 at 22:08):

This is a big mess of symbols. But intuitively,

D^t(c):J \to \mathsf{Set}

is simply our original diagram of presheaves evaluated at

c

David Egolf (Dec 08 2024 at 22:18):

Here

\mathrm{colim~} \alpha

is the function that induces the unique natural transformation

\mathrm{colim~} \alpha:\Delta_{\mathrm{colim~} F} \to \Delta_{\mathrm{colim~} F'}

such that

(\mathrm{colim~} \alpha) \circ u_F = u_{F'} \circ \alpha

, where

u_F

and

u_{F'}

are the colimit cones under

\mathrm{colim~} F

and

\mathrm{colim~} F'

. This situation is illustrated below:
diagram

David Egolf (Dec 08 2024 at 22:22):

Now we can define

\mathrm{colim~} D

. We define it as

\mathrm{colim~} D=G \circ D^t:C^{\mathrm{op}} \to [J, \mathsf{Set}] \to \mathsf{Set}

David Egolf (Dec 08 2024 at 22:29):

We next define our proposed universal cone

\beta:D \to \Delta_{\mathrm{colim~} D}

. This is a natural transformation between functors

:J \to [C^{\mathrm{op}}, \mathsf{Set}]

. Hence we need to provide one component for each

j \in J

David Egolf (Dec 08 2024 at 22:31):

It turns out that each

\beta_j

is a natural transformation and that

\beta

is also a natural transformation. So, it makes sense for us to propose

\beta:D \to \Delta_{\mathrm{colim~D}}

as our universal cone under

D

David Egolf (Dec 08 2024 at 22:36):

Let

\iota:D \to \Delta_X

be any cone under

D

, where

\Delta_X

is the functor

:J \to [C^{\mathrm{op}},\mathsf{Set}]

constant at

X:C^{\mathrm{op}} \to \mathsf{Set}

. We want to show that there is a unique

!:\Delta_{\mathrm{colim~} D} \to \Delta_X

that makes this diagram commute:
diagram

David Egolf (Dec 08 2024 at 22:42):

Since

t:[J ,[ C^{\mathrm{op}}, \mathsf{Set}]] \cong [C^{\mathrm{op}}, [J, \mathsf{Set}]]

is an isomorphism of categories, the above diagram commutes iff its image under

t

commutes. Further, there is a unique

!

that makes this diagram commute iff there is a unqiue

!^t

that makes the image of the diagram commute.

I indicate application of the isomorphism

t

with a superscript

t

. So, to show that

\beta

is a universal cone under

D

, it suffices to show that there is a unique

!^t

that makes this diagram commute:
"transposed" diagram

David Egolf (Dec 08 2024 at 22:55):

I next assume the definition of

(\Delta_{\mathrm{colim~} D})^t

, and

(\Delta_X)^t

. Hopefully this is one of cases where the most obvious guess ends up being correct.
I assume that

(\Delta_{\mathrm{colim~} D})^t

is the functor

\Delta_{(\mathrm{colim~} D)(-)}:C^{\mathrm{op}} \to [J, \mathsf{Set}]

David Egolf (Dec 08 2024 at 22:55):

I assume that

(\Delta_X)^t

is the functor

\Delta_{X(-)}:C^{\mathrm{op}} \to [J, \mathsf{Set}]

David Egolf (Dec 08 2024 at 23:01):

I next assume the definition of

\beta^t

and

\iota^t

, again hoping that these assumptions are correct.

\beta^t

is a natural transformation, going between functors

:C^{\mathrm{op}} \to [J, \mathsf{Set}]

. Hence it has one compoent for each

c \in C^{\mathrm{op}}

. We define

\beta^t

as follows:

David Egolf (Dec 08 2024 at 23:01):

\iota^t

also needs one component for each

c \in C^{\mathrm{op}}

. We define

(\iota)^t

as follows:

David Egolf (Dec 08 2024 at 23:04):

Now, since

!^t \circ \beta^t = \iota^t

is an equation of natural transformations, it holds iff it holds at each component. Thus we are forced to define

(!^t)_c

as the unique morphism that makes the diagram below commute:
diagram

David Egolf (Dec 08 2024 at 23:05):

A unique morphism

(!^t)_c

that satisfies this condition exists, because

(\beta)^t_c = \gamma_c

is the universal cone under our diagram evaluated at

c

, and

(\iota^t)_c

is another cone under our diagram evaluated at

c

Thus, if

!^t

exists, each of its components is uniquely determined. So if

!^t

exists, it is the unique morphism that satisfies

!^t \circ \beta^t = \iota^t

David Egolf (Dec 08 2024 at 23:09):

It remains to show that that

(!^t)_c

assemble to form a natural transformation. Since

!^t

is to be a natural transformation between functors

:C^{\mathrm{op}} \to [J, \mathsf{Set}]

, we consider a naturality square for

f:c \to c'

:
naturality square

David Egolf (Dec 08 2024 at 23:18):

David Egolf (Dec 08 2024 at 23:19):

To show that this diagram commutes, we contemplate this rather large diagram:
large diagram

David Egolf (Dec 08 2024 at 23:22):

We have

(\beta^t)_{c'} \circ D^t(f) = (\mathrm{colim~}D)(f) \circ (\beta^t)_c

, which I'll call the "rectangular equation", by the definition of

\mathrm{colim~}D(f)

. We have

(!^t)_{c'} \circ (\beta^t)_{c'} = (\iota^t)_{c'}

, which I'll call the "triangular equation," by the definition of

(!^t)_c

David Egolf (Dec 08 2024 at 23:25):

We can combine the diagrams involved in the triangular and rectangular equations to get a larger diagram. Thus,

(!^t)_{c'} \circ (\mathrm{colim~}D)(f)

is the unique morphism that makes this larger diagram commute. There really is a unique morphism that makes this larger diagram commute, because

(\beta^t)_c

is the universal cone under our original diagram of presheaves evaluated at

c

David Egolf (Dec 08 2024 at 23:28):

If we can show that

X(f) \circ (!^t)_c

also makes this larger diagram commute, when substituted in the "bottom edge", then we can conclude that

X(f) \circ (!^t)_c = (!^t)_{c'} \circ ((\mathrm{colim~}D)(f))

, because there is a unique morphism that satisfies this condition.

David Egolf (Dec 08 2024 at 23:29):

So, we want to show that

X(f) \circ (!^t)_c \circ (\beta^t)_c = (\iota^t)_{c'} \circ D^t(f)

. To show this, it suffices to show that

(!^t)_c \circ (\beta^t)_c = (\iota^t)_c

and

X(f) \circ (\iota^t)c = (\iota^t)_{c'}\circ D^t(f)

David Egolf (Dec 08 2024 at 23:31):

We immediately have

(!^t)_c \circ (\beta^t)_c = (\iota^t)_c

by definition of

(!^t)_c

. And

X(f) \circ (\iota^t)_c = (\iota^t)_{c'}\circ D^t(f)

is a naturality square for

\iota^t:D^t \to (\Delta_X)^t

:
naturality square

David Egolf (Dec 08 2024 at 23:34):

(we recall that

(\Delta_X)^t(c) = \Delta_{X(c)}

and

(\Delta_X)^t(f) = X(f)

)

David Egolf (Dec 08 2024 at 23:35):

But since

\iota:D \to \Delta_X

is a natural transformation,

\iota^t:D^t \to (\Delta_X)^t

is also a natural transformation. Thus the above naturality square commutes.

David Egolf (Dec 08 2024 at 23:36):

Consequently,

X(f) \circ (!^t)_c \circ (\beta^t)_c = (\iota^t)_{c'} \circ D^t(f)

and thus

X(f) \circ (!^t)_c = (!^t)_{c'} \circ ((\mathrm{colim~}D)(f))

. Hence, any arbitrary naturality square for

!^t

commutes, and hence

!^t

is a natural transformation.

David Egolf (Dec 08 2024 at 23:38):

Combining this existence result with our earlier uniqueness result, we conclude that our cone

\beta:D \to \Delta_{\mathrm{colim~}D}

is initial in the category of cones under

D

, and thus that a colimit of

D

exists - given by the universal cone

\beta

with tip

\mathrm{colim~}D

. So we conclude that our category of presheaves on

C

has small colimits.

David Egolf (Dec 08 2024 at 23:40):

I made several assumptions, and it's possible I made some mistakes. But I'm hopeful that this approach works! Assuming that this argument is at least mostly right in spirit, I'm feeling ready to move on to the next part of the blog post.

David Egolf (Dec 08 2024 at 23:42):

If someone else knows of a faster way to prove the above result , I'd also be quite interested in that!

John Baez (Dec 08 2024 at 23:58):

By the way, what was my original puzzle, which led you into this? You can just tell me the blog post number and puzzle number.

John Baez (Dec 08 2024 at 23:58):

David Egolf (Dec 08 2024 at 23:59):

The puzzle is in this blog post: here. It is the first puzzle in this blog post.

John Baez (Dec 09 2024 at 00:06):

Okay. Sorry, I've lost track of what's going on. Here's the puzzle in words: if we have a

D

-shaped diagram of presheaves on a category

C

, and we want to take its colimit, here's how: for each object of

C

we get a

D

-shaped diagram of sets, and we take the colimit of that. Since taking the colimit is functorial, the result is a presheaf on

C

. Prove this is the desired colimit.

John Baez (Dec 09 2024 at 00:08):

If we have a

J

-shaped diagram of presheaves on a category

C

, and we want to take its colimit, here's how: for each object of

C

we get a

J

-shaped diagram of sets, and we take the colimit of that. Since taking the colimit is functorial, the result is a presheaf on

C

. Prove this is the desired colimit.

John Baez (Dec 09 2024 at 00:10):

So it looks like there are two significant leaps of logic in what I just said: "taking the colimit is functorial" and "this is the resulting colimit".

John Baez (Dec 09 2024 at 00:15):

Maybe you could explain a bit, in words but no preferably no math symbols, about how you tackled these two hard parts. I'll admit I'm having "can't see the forest for the trees" syndrome in reading what you just wrote. For that, it's always good to talk without math symbols, as if we were hiking through the woods.

David Egolf (Dec 09 2024 at 00:36):

John Baez (Dec 09 2024 at 00:40):

If you prefer, you can say just a little at a time and I can ask questions - no need to write a long essay. But if you want to write more, that's fine too!

David Egolf (Dec 09 2024 at 00:42):

Regarding "taking the colimit is functorial":
We could talk about the existence of a functor from (diagrams of sets) to sets, that sends each diagram of sets to its colimit. I didn't include a proof that this process is functorial above (if I recall correctly).

However, I did prove that this process is functorial in my offline notes. Basically, I used the universal property of colimits to say what our "take the colimit" functor should do to morphisms. Then I used that universal property again to show that this process respects composition and identities.

David Egolf (Dec 09 2024 at 00:43):

I'm not sure that this is exactly what you are hoping to discuss.
(I'm pausing here to give you a chance to ask questions, should you wish to do so!)

John Baez (Dec 09 2024 at 00:59):

Okay, that's great so far. So it sounds like the functorality of colimits is the part you weren't explaining in your latest string of comments. So, by a process of exclusion, I guess you must have been talking about what I called the second significant leap of logic: the "Prove this is the desired colimit" part here:

John Baez (Dec 09 2024 at 01:13):

Category theorists would probably say that in this step you're proving "colimits in presheaf categories can be computed pointwise".

David Egolf (Dec 09 2024 at 01:19):

David Egolf (Dec 09 2024 at 01:20):

To check and see if if some object satisfies the universal property of a colimit, we need to come up with a proposed "universal" cone which has as its tip our colimit object. Then we need to show that, for any other cone under our original diagram, there is a unique morphism from our proposed universal cone to that other cone.

David Egolf (Dec 09 2024 at 01:24):

So I started by describing my proposed universal cone. This needs to have one "leg" for each object in our shape-category

J

. Each leg needs to be a natural transformation between presheaves. To figure out the natural transformation in each "leg" of our proposed universal cone, I tried to relate this to the universal cone we get under our diagram evaluated at any object in

C

David Egolf (Dec 09 2024 at 01:25):

I checked that this really gave a natural transformation in each "leg", and that all the legs assemble to form a cone under our diagram of presheaves. So far this wasn't too difficult, I felt.

David Egolf (Dec 09 2024 at 01:25):

The really difficult part (for me at least!) was showing that this cone was initial among cones under our diagram of presheaves.

John Baez (Dec 09 2024 at 01:28):

I suppose you used the fact that your proposed colimit of presheaves on

C

had been computed "pointwise", so that for each object in

C

you had taken a colimit of sets, with its own initial cocone?

John Baez (Dec 09 2024 at 01:28):

An important fact about category theory is that "it takes money to make money" - so to get initiality you probably use initiality.

David Egolf (Dec 09 2024 at 01:29):

Yes, that was crucial. Being able to make use of the universal property of colimits in

\mathsf{Set}

was essential. Although I think a similar thing would have worked even if we were considering presheaves valued in a different category having small colimits.

David Egolf (Dec 09 2024 at 01:30):

I was glad I didn't ever have to use the specific description of colimits in

\mathsf{Set}

- because I always find that hard to remember :sweat_smile:. I just had to use the universal property.

John Baez (Dec 09 2024 at 01:31):

Yes, I bet it should have worked the same way for any cocomplete category replacing

\mathsf{Set}

- I doubt you had to stare hard enough at the objects of

\mathsf{Set}

to really notice they were sets.

David Egolf (Dec 09 2024 at 01:33):

So far all of this I think was about the same between my various attempts. At this point my efforts diverged a bit between my different attempts.

David Egolf (Dec 09 2024 at 01:34):

In the course of spinning in circles trying to show that my proposed universal cone really is initial, I seem to recall that I I often wanted to work with a sort of "transposed" version of some natural transformations I already had.

David Egolf (Dec 09 2024 at 01:34):

That motivated my most recent attempt, which begins by transporting this whole problem across the functorial isomorphism we discussed earlier.

John Baez (Dec 09 2024 at 01:35):

(Btw, every time you said "cone" recently, I think you meant "cocone". If you turn an ice-cream cone upside-down it's a "cocone"... which is why most people, or at least ice cream vendors, don't talk about "cocones".)

David Egolf (Dec 09 2024 at 01:36):

(I use "cone under" to mean "cocone". I think Riehl does this in "Category Theory in Context". Sometimes when it's clear from context that I'm only talking about cones under a diagram I abbreviate this just to "cone" - perhaps that is confusing though!)

David Egolf (Dec 09 2024 at 01:38):

This is the isomorphism of categories I'm alluding to:

t:[J ,[ C^{\mathrm{op}}, \mathsf{Set}]] \cong [C^{\mathrm{op}}, [J, \mathsf{Set}]]

\mathsf{Cat}

. I'm not sure how to nicely talk about this without using math symbols... but it's nice because it lets us swap out natural transformations with components corresponding to objects in

J

for natural transformations having components corresponding to objects in

C

John Baez (Dec 09 2024 at 01:40):

To me this functor is so bland and vanilla-like (I must have ice-cream cones on my mind!) that I can scarcely get excited about it. It's like saying "if you've got a function

f(x,y)

you can make a new function

g(y,x) = f(x,y)

David Egolf (Dec 09 2024 at 01:41):

Interesting! This functor is still new and a bit scary to me. Hopefully it can become bland to me as my understanding of it develops a bit further.

David Egolf (Dec 09 2024 at 01:43):

I don't even really understand what this functor does to functors and natural transformations. I made some guesses (which seem to make things work out!), but it sounds difficult to validate these guesses.

John Baez (Dec 09 2024 at 01:44):

It helps to note that it exists for all cartesian closed categories, e.g.

\mathsf{Set}

\mathsf{Cat}

. In all of these the following are the same:

where by "the same" I mean we have natural isomorphisms between the sets of such maps.

John Baez (Dec 09 2024 at 01:46):

You wound up proving this in the special case of

\mathsf{Cat}

, but as we noted the proof would work just as well in any cartesian closed category, or more generally symmetric monoidal category (where we write

\otimes

instead of

\times

John Baez (Dec 09 2024 at 01:47):

So, since unlike you I haven't tried to write up an answer to this puzzle, I probably let this stuff fall into the "unspoken background".

David Egolf (Dec 09 2024 at 01:49):

I can sketch a possible process for proving that

t

acts in a way according to my guesses above.

David Egolf (Dec 09 2024 at 01:50):

John Baez (Dec 09 2024 at 01:51):

I'm not sure I want to think about the inner workings of

t

at all. It would be like cutting into my mattress to see that yes, it's made of a bunch of soft white stuff.

John Baez (Dec 09 2024 at 01:52):

More interesting to me is that you needed to think about this "transposition" process to get the job done.

David Egolf (Dec 09 2024 at 01:53):

I would be happy to not have to think about

t

in great detail! (That sounds like a lot of tedious work). So maybe we can talk about why I found myself using it.

John Baez (Dec 09 2024 at 01:54):

I guess it's about changing views from "a

J

-shaped diagram of sets for each object of

C

, and maps between them" to "a presheaf on

C

taking values in

J

-shaped diagrams of sets".

John Baez (Dec 09 2024 at 01:54):

John Baez (Dec 09 2024 at 01:55):

But somehow this change of view is needed to show that computing the colimit pointwise computes the colimit of the whole presheaf.

David Egolf (Dec 09 2024 at 01:57):

...I feel like an archaeologist looking through my notes on this. Let me see if I can find the point where I start encountering this transposition stuff.

John Baez (Dec 09 2024 at 01:58):

I think this is worthwhile, if only to "compact" the idea of the proof down to something more manageable.

David Egolf (Dec 09 2024 at 02:07):

I need to rest up for today. But I'll try and think about this some more another day!

Peva Blanchard (Dec 09 2024 at 06:28):

This thread feels like reading one of Plato's dialogues :smile: I'm learning as much about topos theory as about teaching maths.

David Egolf (Dec 09 2024 at 18:30):

I want to think a bit about why I used

t:[J ,[ C^{\mathrm{op}}, \mathsf{Set}]] \cong [C^{\mathrm{op}}, [J, \mathsf{Set}]]

. Hopefully this can help to "conceptually compact" the argument above, and maybe I'll discover a way to avoid using

t

David Egolf (Dec 09 2024 at 18:32):

The first place where I used

t

was to come up with the candidate for the colimit presheaf. I applied

t

to our original diagram of presheaves, and got a presheaf on

C

valued in

J

-shaped diagrams of sets. Then I composed with the "take the colimit" functor to convert this to a presheaf valued in sets.

David Egolf (Dec 09 2024 at 18:41):

The next place where a sort of "transposition" occurs in the argument above, is in defining the universal cone under our diagram of presheaves. Each "leg" of this cocone is a natural transformation, and all the legs assemble to form a natural transformation. So we have "two layers" of natural transformations: we have a natural transformation (the cocone) where each component (a leg) is also a natural transformation.

David Egolf (Dec 09 2024 at 18:43):

So, in a way, we have a "matrix" of natural transformation components in this situation. The components are indexed by two pieces of data: which "leg" they are associated to, and then which object in

C

they are associated to.

David Egolf (Dec 09 2024 at 18:45):

To define the universal cocone, I ended up "transposing" a different matrix of natural transformation components. Each component in this other matrix is indexed first by some object in

C

, and then by some object in

J

. In this case, the data is organized as follows: we have a universal cocone under our diagram of presheaves evaluated at some object in

C

, and then each "leg" of that cocone corresponds to some object of

J

David Egolf (Dec 09 2024 at 18:47):

This other matrix is very natural to start out with, because we build it working directly from the fact that the category of sets has colimits. Each universal cone under our diagram evaluated at some object of

C

lives in the category of sets.

David Egolf (Dec 09 2024 at 18:49):

John Baez (Dec 10 2024 at 00:14):

David Egolf (Dec 10 2024 at 21:16):

Great! I'll now explain the next step in my thought process that led to the use of our transposition isomorphism

t

David Egolf (Dec 10 2024 at 21:17):

We next want to show that our proposed universal cocone really is initial in the category of cones under our original diagram.

David Egolf (Dec 10 2024 at 21:19):

My strategy to do this was to try and construct a morphism of cocones from our proposed universal cocone to an arbitrary cone under our diagram of presheaves. And in the course of constructing this morphism, I hoped to show along the way that this morphism must be unique.

David Egolf (Dec 10 2024 at 21:23):

David Egolf (Dec 10 2024 at 21:25):

John Baez (Dec 11 2024 at 00:39):

Yes. Did you think about this stage of the proof may be another case of the whole "transposition" business? I'm not sure it is, but it feels like it might be. Let's see. In the first stage you were using transposition to turn

a functor from

J

to the category of [functors from

\mathsf{C}^{\text{op}}

\mathsf{Set}]

a functor from

\mathsf{C}^{\text{op}}

to the category of [functors from

J

\mathsf{Set}

]

where the square brackets are just for reading clarity (I'm not sure they help).

John Baez (Dec 11 2024 at 00:42):

a colimit of a

J

-shaped diagram in the category of [functors from

\mathsf{C}^{\text{op}}

\mathsf{Set}]

a functor sending objects of

\mathsf{C}^{\text{op}}

to [colimits of

J

-shaped diagrams in

\mathsf{Set}

]

John Baez (Dec 11 2024 at 00:52):

So it seems this "switcheroo" might pervade the whole argument! Let's see. A colimit is a cocone with a universal property. So let's talk about cocones first, and worry about the universal property later. That seems to be what you're doing above.

A cocone in a category

X

is a functor

\text{cocone}(J) \to X

where

J

is the diagram that's the 'base' of the cocone and

\mathrm{cocone}(J)

is the result of adding a new terminal object to

J

I'm sorry to be so fancy - I don't really think like this, but I feel it could be helpful today. So, maybe we want to get

a functor from

\text{cocone}(J)

to [functors from

\mathsf{C}^{\text{op}}

\mathsf{Set}

]

a functor from

\mathsf{C}^{\text{op}}

to [functors from

\text{cocone}(J)

\mathsf{Set}

]

John Baez (Dec 11 2024 at 00:54):

These certainly are 'secretly the same', thanks to your switcheroo trick, which is completely general.

John Baez (Dec 11 2024 at 00:55):

What I'd like to do is carry out your whole argument by showing that at every stage, even the part where we are proving our desired cocone is universal, all we are doing is applying the switcheroo trick to various things. I'm not sure I can quickly do this... I'll stop here.

David Egolf (Dec 11 2024 at 01:07):

Wow, that is fancy! That's an interesting different perspective on the definition of a cocone.

John Baez (Dec 11 2024 at 01:13):

Fancy, but obvious when you hear it - I just looked it up! I want to get everything in the format so that we can keep applying the same fact:

where

a,b,c \in \mathsf{Cat}

. Notice this tranposition isomorphism will work not just for functors (objects in things like

[b,c]

) but also natural transformations (morphisms in things like

[b,c]

). So I'm hoping it can do everything you want.

David Egolf (Dec 11 2024 at 01:15):

Heh, it seems that we have moved from trying to avoid the "switcheroo" trick to embracing it more completely!

It's very cool that we can now apply the same trick to cocones. However, it will take me some time to see how this helps us out.

David Egolf (Dec 11 2024 at 01:17):

It would be very nice if we only need to use the fact that we have some isomorphism doing this switcheroo. Then we could avoid thinking about all the details of how this isomorphism works.

John Baez (Dec 11 2024 at 01:28):

Right, something like that. I don't want to peek into the black box unless we absolutely have to. This isomorphism has a bunch of nice and highly believable properties, all arising from the universal properties of

\times

and

[-,-]

, which means we can just fiddle around and act like the isomorphism does whatever we want. Then later, if we get an argument that seems to work, we can fill in those details.

David Egolf (Dec 11 2024 at 01:33):

Ah, right. So we would probably use a bit more than the fact that this isomorphism is an isomorphism - we would also potentially use some special properties of that isomorphism. But that's still conceptually cleaner than completely spelling out exactly how it works.

John Baez (Dec 11 2024 at 01:34):

John Baez (Dec 11 2024 at 01:38):

I'm not sure I can pull off this trick tonight... I need to do some actual work for a while and then have dinner. But my goal would be to formulate the existence of the desired colimit in a slick way: for example, using the fact that a colimit is an initial cocone. (See [[cocone]] for morphisms of cocones.) Then I'd try to get the desired cocone by some switcheroo magic.

David Egolf (Dec 11 2024 at 01:41):

That sounds like an idea! I'll probably try to explore that line of thought over the next day or two, and see how it goes.

David Egolf (Dec 11 2024 at 23:43):

One thing does catch my attention... I was hoping to apply our switcheroo to get an isomorphism:

[\mathrm{cocone}(J),[C^{\mathrm{op}}, \mathsf{Set}]] \cong [C^{\mathrm{op}}, [\mathrm{cocone}(J), \mathsf{Set}]]

. However, the objects in the category on the left are not all cocones under

D

. They are cocones under

J

-shaped diagrams in

[C^{\mathrm{op}}, \mathsf{Set}]

- but only some of these cocones are under

D

David Egolf (Dec 11 2024 at 23:45):

As a result, I suspect that the colimit cocone under

D

may not necessarily be an initial object in

[\mathrm{cocone}(J),[C^{\mathrm{op}}, \mathsf{Set}]]

David Egolf (Dec 12 2024 at 00:25):

Ah! One thing I can try is to repeat my argument above but without "looking in the black box". For example, instead of spelling out

D^t

explicitly, I can just say it is

t(D)

, where

t

is our "switcheroo" isomorphism.

Presumably at some point I'll encounter a problem doing this, which might give some direction when searching for relevant special properties of our switcheroo isomorphism.

John Baez (Dec 12 2024 at 01:20):

Yes. I just feel that the big and beautiful triangular prism diagram must somehow be avoidable - I feel it's the result of excessive concreteness.

John Baez (Dec 12 2024 at 01:20):

But I'm not sure if I've spotted the right approach. At some point I may break down and see what Mac Lane and Moerdijk say about this result.

David Egolf (Dec 12 2024 at 01:28):

Currently I'm working on trying to define our universal cocone under

D

by using our switcheroos. My approach at the moment is to use this isomorphism:

[\mathrm{cocone}(J),[C^{\mathrm{op}}, \mathsf{Set}]] \cong [C^{\mathrm{op}}, [\mathrm{cocone}(J), \mathsf{Set}]]

. So to specify our proposed universal cocone under

D

, we can specify a functor

\gamma:C^{\mathrm{op}} \to [\mathrm{cocone}(J),\mathsf{Set}]

. (Then we can try setting our universal cone to be

s(\gamma)

David Egolf (Dec 12 2024 at 01:28):

I'm hoping to set

\gamma(c):\mathrm{cocone}(J) \to \mathsf{Set}

to be the diagram

D

evaluated at

c

together with a colimit cocone - which is a diagram in

\mathsf{Set}

. I'm now defining "our diagram evaluated at

c

" as the functor

t(D)(c)

, where

t:[J ,[ C^{\mathrm{op}}, \mathsf{Set}]] \cong [C^{\mathrm{op}}, [J, \mathsf{Set}]]

. It's taking some work to see how to define

\gamma

on morphisms, and to check that we indeed get a natural transformation

\gamma(f)

for each

f

C^{\mathrm{op}}

John Baez (Dec 12 2024 at 01:36):

If it helps at all, note that you can equally well specify a functor

J \times C^{\text{op}} \to \mathsf{Set}

. This is yet another way to specify the same thing, and may be the easiest.

Peva Blanchard (Dec 12 2024 at 22:42):

First, it can be helpful to consider the following definition of a colimit.
A colimit for a functor

F : A \to B

is an object

l

B

s.t. we have a bijection

natural in

z

. I.e., the

\text{Set}

-valued functor

z \mapsto [A,B](F, \Delta_z)

is represented by

l

Now, reusing the notations from the previous messages, a colimit for the functor

D : J \to [C^{op}, \text{Set}]

is an object

X

[C^{op}, \text{Set}]

s.t.

\begin{align*} [C^{op}, \text{Set}](X, Z) &\cong [J, [C^{op}, \text{Set}]](D, \Delta_Z) \\ &\cong [C^{op}, [J, \text{Set}]](D^t, \Delta^t_Z) \\ \end{align*}

[C^{op}, [J, \text{Set}]](D^t, \Delta^t_Z) \cong [C^{op}, \text{Set}](\text{colimit}_j~D(j,\_), Z]

ps: notice that when

C = 1

is the terminal category this last equation really amounts to the definition of a colimit.

David Egolf (Dec 13 2024 at 19:44):

Thanks for your comment @Peva Blanchard ! However, I don't yet know anything about ends/coends - despite putting in some effort, they remain mysterious to me. But between this puzzle and "Day convolution" (which I was curious about recently), apparently there are multiple reasons it would be useful to learn about ends/coends!

David Egolf (Dec 13 2024 at 19:46):

I've been having an enjoyable time revisiting my argument above while trying to avoid the concrete details of any given switcheroo. I'm not yet stuck - I just need to put in some more effort to progress what I have so far.

John Baez (Dec 13 2024 at 19:54):

I've been having too much fun working on another math side-project (involving random permutations and groupoids) to put time into this issue. So I decided to peek at Mac Lane and Moerdijk to see how they proved that colimits in a presheaf category

\mathsf{Set}^{\mathsf C^{\text{op}}}

are computed pointwise. It looks they don't prove it: they just say it's true.

Seriously: if you continue to make progress polishing your argument, I will continue trying to read it and help a little, though not as much as I should.

David Egolf (Dec 13 2024 at 20:57):

I think "Category Theory in Context" (p.92) and "Categories for the Working Mathematician" (p.115) both discuss things relevant to this puzzle. For now I'm happy trying to explore our current approach, but I may consult one or both of those at some point in the future.

John Baez (Dec 13 2024 at 21:05):

I couldn't find the relevant bit in Categories for the Win. But yes, it's exactly the section you pointed to: page 115, section V.3. And hey: the proof of the main theorem contains a diagram shaped like a triangular prism! That's the shape I was complaining about in your argument! Maybe no coincidence? Anyway, it suggests your approach was not clunky compared to Mac Lane's.

David Egolf (Dec 19 2024 at 17:20):

So, today I'm going to try writing out the start of the argument (for showing that we inherit colimits in a presheaf category from

\mathsf{Set}

),while aiming to not use the low-level details of the "switcheroo" isomorphism.

David Egolf (Dec 19 2024 at 17:23):

We start with a diagram

D:J \to [C^{\mathrm{op}},\mathsf{Set}]

, where

J

is a small category. Our aim is to propose a colimit for this diagram, and to show that it really is a colimit.

David Egolf (Dec 19 2024 at 17:28):

The tip of the colimit cocone should be a presheaf. We will use two ingredients to make this proposed colimit object:

David Egolf (Dec 19 2024 at 17:30):

We form our proposed colimit presheaf

\mathrm{colim~} D:C^{\mathrm{op}} \to \mathsf{Set}

\mathrm{colim~} D = \mathrm{colim}_J \circ t(D)

David Egolf (Dec 19 2024 at 17:33):

Next, we want to propose a colimit cocone for

D

. To do this using another switcheroo, we introduce the category

\mathrm{cocone}(J)

. We define

\mathrm{cocone}(J)

as follows:

David Egolf (Dec 19 2024 at 17:35):

A cocone under

D

corresponds to a functor

:\mathrm{cocone}(J) \to [C^{\mathrm{op}}, \mathsf{Set}]

such that this functor agrees with

D

on all the objects and morphisms of

J

Inspired by this, we introduce our second switcheroo isomorphism

s:[\mathrm{cocone}(J),[C^{\mathrm{op}}, \mathsf{Set}]] \cong [C^{\mathrm{op}}, [\mathrm{cocone}(J), \mathsf{Set}]]

. This is an isomorphism in

\mathsf{Cat}

David Egolf (Dec 19 2024 at 17:48):

I was going to start defining a functor

\gamma:C^{\mathrm{op}} \to [\mathrm{cocone}(J), \mathsf{Set}]

, with the idea that our proposed colimit cocone would be

s^{-1}(\gamma)

. However, I just realized this: I don't know how to show that

s^{-1}(\gamma)

agrees with

D

on the objects and morphisms of

J

. At least, I don't immediately see how to do that without using low-level details about the switcheroo

s

- and I'm trying to avoid using those.

David Egolf (Dec 19 2024 at 17:51):

Maybe I need to start by applying the switcheroo

s

to our diagram

D

in some way, and then use that to build up the switcheroo-ed version of the colimit cocone.

John Baez (Dec 19 2024 at 17:52):

If you have two functors that have the same domain and the same codomain, built from the same basic ingredients, and you feel they should be naturally isomorphic, I recommend that you assume they are naturally isomorphic and see if you can complete your argument based on that assumption.

John Baez (Dec 19 2024 at 17:53):

If you have good intuitions, you'll be right about that assumption, and then you can go back and work out the details.

David Egolf (Dec 19 2024 at 18:01):

Alright, so let's define a

\gamma:C^{\mathrm{op}} \to [\mathrm{cocone}(J), \mathsf{Set}]

, hoping for now that

s^{-1}(\gamma)

agrees with

D

on the objects and morphisms of

J

David Egolf (Dec 19 2024 at 18:02):

Each

\gamma(c)

is a

\mathrm{cocone}(J)

-shaped diagram in

\mathsf{Set}

. We define it as follows:

David Egolf (Dec 19 2024 at 18:04):

The next order of business is to define

\gamma(f)

for any morphism

f:c \to c'

C^{\mathrm{op}}

. This should be a natural transformation

:\gamma(c) \to \gamma(c')

, where

\gamma(c)

and

\gamma(c')

are both functors

:\mathrm{cocone}(J) \to \mathsf{Set}

. Hence,

\gamma(f)

has one component for each object in

\mathrm{cocone}(J)

David Egolf (Dec 19 2024 at 18:06):

To figure out what

\gamma(f)_j

should be, for some

j \in J

, let us consider a naturality square for the morphism

g:j \to j'

\mathrm{cocone}(J)

, where

g

is also in

J

David Egolf (Dec 19 2024 at 18:26):

David Egolf (Dec 19 2024 at 18:28):

By definition, we have

\gamma(c)(j) = t(D)(c)(j)

and similarly

\gamma(c)(g) = t(D)(c)(g)

. So we can rewrite our naturality square as:
naturality square 2

David Egolf (Dec 19 2024 at 18:32):

Since

t(D):C^{\mathrm{op}} \to [J, \mathsf{Set}]

we have a natural transformation

t(D)(f):t(D)(c) \to t(D)(c')

for any

f:c \to c'

C^{\mathrm{op}}

. Noting that

t(D)(c),t(D)(c'):J \to \mathsf{Set}

we can recognize the above square as a naturality square for a natural transformation

:t(D)(c) \to t(D)(c')

. So if we set

\gamma(f)_j = t(D)(f)_j

for each

j \in J

, the above naturality square will commute for any

g:j \to j'

J

David Egolf (Dec 19 2024 at 18:33):

It still remains to define

\gamma(f)_1

, where

1

is the terminal object we added to

J

to make

\mathrm{cocone}(J)

David Egolf (Dec 19 2024 at 18:45):

We have a colimit cocone under

t(D)(c)

with tip

\gamma(c)(1)

and a colimit cocone under

t(D)(c')

with tip

\gamma(c')(1)

. Composing this cocone under

t(D)(c')

with the natural transformation

t(D)(f)

we get another cocone under

t(D)(c)

. Then by the universal property of colimits there is a unique morphism

:\gamma(c)(1) \to \gamma(c')(1)

that makes the above diagram commute. We define

\gamma(f)_1:\gamma(c)(1) \to \gamma(c')(1)

to be that unique morphism.

David Egolf (Dec 19 2024 at 18:46):

This completes the definition of

\gamma(f)

. However, it remains to show that

\gamma(f)

is in fact a natural transformation.

David Egolf (Dec 19 2024 at 18:48):

I'll pause here. I'm a bit worried that what I have above is already not so conceptually clear. And there's a long ways to go still, too...

I'm considering abandoning ship with regards to this proof approach, and just seeing how Riehl does this in "Category Theory in Context"... I'm hoping that the proof there will be nice and compact, as the proofs in that book often are.

David Egolf (Dec 19 2024 at 21:41):

Unfortunately, it appears this proof is left as an exercise in Riehl - namely Exercise 3.3.vi. (That exercise has a hint, but the hint is concerned with the part of the exercise I already know how to do!)

David Egolf (Dec 19 2024 at 21:48):

However, thankfully both Leinster and Awodey give proofs (without using fancy triangular prism diagrams!). I may next try to understand one or both of these proofs.

John Baez (Dec 19 2024 at 21:49):

You could also check out the proof in Categories for the Working Mathematician! It's proved on page 115, section V.3. The proof is short but I have not tried to understand it.

David Egolf (Dec 19 2024 at 21:50):

I'm slightly biased against that proof because on first glance it seems to involve a triangular prism diagram, which I was trying to avoid. But perhaps it could be good to study that proof anyways!

John Baez (Dec 19 2024 at 21:53):

Of course it's good to study. I have nothing against triangular prisms, just proofs that I don't understand! I bet if you read all three of Leinster, Awodey and Mac Lane's proofs, without letting yourself get bogged down in details, you can synthesize an intuitive understanding of what's going on.

(Often beginners try to understand every step of a proof before getting the idea of a proof, and I'm afraid you might try to do that. I think it's much faster to relax and just get the idea before worrying about details. In math your subconscious mind is much more powerful than your conscious reasoning.)

David Egolf (Dec 20 2024 at 17:32):

Alright, today I'm going to try and read Mac Lane's proof, with the aim of trying to understand the idea/approach without focusing (yet) too much on the details. I'll do my best to summarize the idea here.

John Baez (Dec 20 2024 at 17:56):

It has the advantage of being super-short, so you can read it 20 times... and if you're anything like me, you'll probably need to. :smirk:

Todd Trimble (Dec 20 2024 at 18:00):

I glanced yesterday at page 115, and it looked like a variation on the following theme: given a functor

F: C \to D

, we are assured of having an adjunction

F \dashv G

if only we know that

D(F-, d)

is representable for every object

d \in D

. I'd better look again to be sure.

Anyhow, this fact is well worth knowing. The basic point is that the definition of the functor

G

and the natural transformations for unit and counit, etc. -- they all naturally come out of the wash in working with universal properties. Triangular prisms could be a distraction from this point.

David Egolf (Dec 20 2024 at 18:27):

Thanks for your comment @Todd Trimble ! I like the idea that everything we need can emerge from universal properties. The theme you mention regarding an adjunction

F \dashv G

is unfamiliar to me! That does look like a useful fact to know. At some point it would probably be good for me to spend time trying to prove (or understand a proof of) what you just said.

David Egolf (Dec 20 2024 at 18:28):

I did a first read of Mac Lane's proof. The bulk of the proof appears to be occupied with setting up the proposed limit and proposed universal cone. (Which I believe I already understand how to do). Then Mac Lane concludes in one sentence that the proposed universal cone is actually universal!

David Egolf (Dec 20 2024 at 18:31):

That one sentence might be pretty useful though. The idea appears to be this: to construct our unique morphism from the apex of an arbitrary cone to the apex of our limit cone, we view each apex as a collection of objects. And the collection of objects associated to our limit cone are all limit objects, and so we try to use that to build up an induced morphism.

David Egolf (Dec 20 2024 at 18:37):

In our case, we have a proposed colimit object

:J \to [C^{\mathrm{op}}, \mathsf{Set}]

. This amounts to a functor

:C^{\mathrm{op}} \to \mathsf{Set}

. So this colimit object has associated to it one set for each object in

C^{\mathrm{op}}

. I think the idea is that each of these sets will be a colimit, and that this will help us build a morphism from our colimit object to the tip of an arbitrary cocone under diagram.

David Egolf (Dec 20 2024 at 19:14):

Alright, let me try to write my own proof, now - using ideas from Mac Lane's proof. I think I have some strategies that will work to help keep it shorter and avoid the switcheroo isomorphism.

David Egolf (Dec 20 2024 at 19:42):

Well, I got all the way through except I got stuck at the very end: I found all the components of our induced unique natural transformation from our colimit object to the tip of any arbitrary cocone under our diagram. However, to show that these components assemble to form a natural transformation, I suspect I'd need to pull out my big triangular prism diagram again.

David Egolf (Dec 20 2024 at 19:43):

As far as I can see, Mac Lane doesn't elaborate on this tricky (at least for me) point at all. He just asserts that these components assemble to form a natural transformation.

So, as far as I can tell, Mac Lane's proof doesn't contain the idea I need to avoid that part of my proof. And I would like to avoid if I can, because the triangular prism stuff feels like a big calculation that just ends up working somehow - but I don't have a lot of intuition for it.

David Egolf (Dec 20 2024 at 19:53):

I'll next take a look at the proof in Leinster's "Basic Category Theory" (p. 150). (But I'll stop here for today!)

Peva Blanchard (Dec 21 2024 at 09:52):

I'm not sure I understand your intention.
Do you want to avoid the big prism calculation because you already understand it but are not satisfied with the style?
Or do you want to build intuition for why the big prism calculation works?

David Egolf (Dec 21 2024 at 17:39):

David Egolf (Dec 21 2024 at 17:41):

On first glance, Awodey's proof (on page 202 of "Category Theory") looks like it will help me with these goals. It uses the Yoneda lemma a couple times, and at least on first glance it looks quite different from the argument I was developing above.

David Egolf (Dec 21 2024 at 17:42):

But I will return to this later. For now, I'm signing off for about a week for the holidays! Wishing everyone an excellent week!

David Egolf (Jan 02 2025 at 19:16):

I'm next going to work through Leinster's proof of how a functor category inherits limits (on page 149 of "Basic Category Theory").

David Egolf (Jan 02 2025 at 20:47):

After spending some time on it today, I think I understand Leinster's proof all the way to the last step, which is the step I've been stuck on. This step involves showing that our proposed unique morphism to our proposed limit object is indeed a morphism. This morphism is supposed to be a natural transformation and we've defined it componentwise, so it remains to show these components assemble to form a natural transformation.

David Egolf (Jan 02 2025 at 20:48):

Thankfully Leinster elaborates on this point, indicating that it follows from Lemma 6.1.3(b) on page 145. So, next time I will work to understand that lemma!

John Baez (Jan 02 2025 at 21:08):

If you get really desperate you can invoke the main himself with a suitable @ sign here. :wink:

David Egolf (Jan 05 2025 at 19:00):

Lemma 6.1.3 is interesting. Part (a) tells us that a morphism between diagrams yields a morphism between the limits of those diagrams (assuming the needed limits exist). So a natural transformation

\alpha:D \to D'

for diagrams

D,D':I \to A

induces a morphism

\lim \alpha:\lim D \to \lim D'

David Egolf (Jan 05 2025 at 19:02):

As a nice bonus, Leinster indicates that in this way we get a functor

:[I,A] \to A

(provided all the

I

-shaped diagrams have limits in

A

). And in fact this functor is right adjoint to the diagonal functor! Since right adjoints preserve limits (and I seem to recall they preserve monomorphisms as well), this sounds useful to know.

David Egolf (Jan 05 2025 at 19:04):

The idea behind Lemma 6.1.3(b) is less clear to me. It seems to start out by introducing a generalized notion of morphism between cones. It's a generalized notion because the cones involved aren't required to be over the same diagram, only over diagrams of the same shape.

David Egolf (Jan 05 2025 at 19:05):

Here,

s:a \to a'

is a morphism in

A

such that this diagram commutes for all

i

. For each

i

f_i:a \to D(i)

is the

i

-th leg of a cone over

D

. And similarly

f_i':a' \to D'(i)

is the

i

-th leg of a cone over

D'

. Finally,

\alpha_i:D(i) \to D'(i)

is the

i

-th component of a natural transformation

\alpha:D \to D'

, where

D,D':I \to A

are

I

-shaped diagrams in

A

David Egolf (Jan 05 2025 at 19:08):

D = D'

and

\alpha = 1_D

is the identity natural transformation, then I think that

s

is just a morphism of cones over the same diagram.

I think this notion of morphism between cones over different diagrams can be obtained by considering the arrow category of

[I,A]

, and viewing an object

a \in A

as a constant diagram

\Delta_a:I \to A

David Egolf (Jan 05 2025 at 19:13):

Before pressing on, I'll pose this question: Can we think of some concrete examples of this generalized notion of morphism between cones?

David Egolf (Jan 06 2025 at 18:20):

I'm hoping to consider my question above a bit more, to try and get a better understanding of what Lemma 6.1.3b is about. I guess we need a situation where we have two functors

:I \to A

and some natural transformation between them. Since we've been learning about presheaves, perhaps we can consider two presheaves

D,D':\mathcal{O}^{\mathrm{op}}(X) \to \mathsf{Set}

, where

\mathcal{O}^{\mathrm{op}}(X)

is the poset of open subsets of a topological space

X

ordered under reverse inclusion.

David Egolf (Jan 06 2025 at 18:24):

Then a natural transformation

\alpha:D \to D'

intuitively has as components functions

\alpha_U:D(U) \to D'(U)

for each open set

U

X

. So we can "locally compute" some of the data attached to

U

D'

in terms of the data attached to

U

D

David Egolf (Jan 06 2025 at 18:26):

A cone

f:\Delta_a \to D

then has a bunch of legs of the form

f_U:a \to D(U)

U

varies over the open sets of

X

. We might think of

a

as something that is being "locally observed" by each

f_U

. (

f_U(a)

is thought of as the observation of

a

"at"

U

David Egolf (Jan 06 2025 at 18:28):

Finally a function

s:a \to a'

can be thought of as computing some information in

a'

from

a

. We might intuitively think of this as an "observation" of

a

in terms of

a'

David Egolf (Jan 06 2025 at 18:33):

Following the intuition-based discussion with presheaves above, we now might view an object

f:\Delta_a \to D

as a collection of compatible observations of

a

. Then a morphism

(s,a):f \to f'

consists of two things: (1) a direct observation of

a

in terms of the data of

a'

, and (2) a sort of processing of local observations of

a

that yields partial local observations of

a'

. And we require the direct observation to be compatible with this processing of local observations.

David Egolf (Jan 06 2025 at 18:37):

That makes me feel a bit more comfortable with this kind of generalized notion of morphism between cones. Next time, I'll try to work through the proof of Lemma 6.1.3b!

David Egolf (Jan 09 2025 at 23:28):

I think I now understand all the proof of Lemma 6.1.3b, except for the last step, which invokes Exercise 5.1.36(a). Eventually I'll get to the bottom of this!

David Egolf (Jan 09 2025 at 23:29):

Exercise 5.1.36(a) is this, with some changing of notation:
Let

D:I \to \mathcal{A}

be a diagram and

p_i:L \to D(i)

(as

i

ranges over

I

) a limit cone on

D

Prove that whenever

h,h':A \to L

are maps such that

p_i \circ h = p_i \circ h'

for all

i \in I

then

h = h'

David Egolf (Jan 09 2025 at 23:31):

Intuitively, we are asked to show this: agreement upon post-composition by all the "legs" of a limit cone implies agreement between the original morphisms.

David Egolf (Jan 09 2025 at 23:33):

I will aim to work this out next time. And then hopefully I can put the pieces together to finally see how presheaf categories inherit limits/colimits!

Peva Blanchard (Jan 09 2025 at 23:53):

Maybe I'm missing something but isn't the uniqueness (and existence) part of the definition of a limit cone?

David Egolf (Jan 12 2025 at 00:51):

I would guess that the solution is probably something like that, @Peva Blanchard. However, I don't quite see how it works yet...

David Egolf (Jan 12 2025 at 00:54):

I think we have a bijection between morphisms from

A

L

and cones over

D

with apex

A

. Let's call this bijection

f:\mathcal{A}(A, L) \to \mathrm{Cone}(A,D)

. The way

f

works is this: it takes in a morphism

:A \to L

and it post-composes it with all the legs of the limit cone.

From the given information, we have

f(h) = f(h')

. But since

f

is a bijection it is in particular injective and indeed

h=h'

David Egolf (Jan 12 2025 at 00:58):

I think that makes sense, so I believe I now understand the proof of Lemma 6.1.3b.

The next order of business is to see how Lemma 6.1.3b allows us to finish off the proof that limits in a functor category are computed pointwise.

David Egolf (Jan 12 2025 at 01:39):

After some work, I think I see how Lemma 6.1.3b is supposed to help us. There's a lot of tedious details that would need checking though, and I'm not sure I would learn much from checking them.

Overall, this approach doesn't feel very conceptual to me. It feels more like a long calculation where you do the "obvious" thing on each step, and then with sufficient cleverness you can justify that everything works as it should. But it's unclear to me why everything works out.

David Egolf (Jan 12 2025 at 01:40):

Consequently, next time I work on this, I'll plan to work on understanding Awodey's proof that functor categories inherit limits pointwise, which takes a different (hopefully more conceptual) approach.

Peva Blanchard (Jan 12 2025 at 17:15):

I don't know if it will be a more conceptual approach, but I found an interesting angle.
I will assume the following characterization of colimits.

Let

X, D

be categories. I write

[D,X]

for the category of functors

D \to X

, namely the

D

-shaped diagrams in

X

. Let

\Delta : X \to [D, X]

the diagonal functor: it maps every object

x

to the constant functor

d \mapsto x

Then

X

has all colimits over

D

if and only if

\Delta

has a left adjoint

\text{colim} : [D, X] \to X

\text{colim} \dashv \Delta

So, we want to show that

X = [C^{op}, \text{Set}]

has all colimits over

D

. That is, we want to find an adjunction

\begin{gather*} \text{colim} \dashv \Delta \\ \text{colim} : [D, X] \to X \\ \Delta : X \to [D, X] \\ \end{gather*}

where

\Delta

is the diagonal functor. Substituting for

X = [C^{op}, \text{Set}]

and implicitly using the switcharoo,
this is equivalent to finding an adjunction

\begin{gather*} \text{colim} \dashv \Delta \\ \text{colim} : [C^{op}, [D, \text{Set}]] \to [C^{op}, \text{Set}] \\ \Delta : [C^{op}, \text{Set}] \to [C^{op}, [D, \text{Set}]] \\ \end{gather*}

We know that

\text{Set}

has all small colimits over

D

. Hence, we have an adjunction

\begin{gather*} \kappa \dashv \delta \\ \kappa : [D, \text{Set}] \to \text{Set} \\ \delta : \text{Set} \to [D, \text{Set}] \\ \end{gather*}

Now, if we apply the functor

[C^{op}, \_] : \text{Cat} \to \text{Cat}

to each parts of the adjunction

\kappa \dashv \delta

, we obtain a pair of candidates

\begin{align*} \text{colim} = [C^{op}, \kappa] \\ \Delta = [C^{op}, \delta] \\ \end{align*}

It seems to me that

[C^{op}, \delta]

does in fact yield the diagonal functor

\Delta

Hence, our goal would follow from the fact that the endofunctor

[C^{op}, \_]

preserves adjunctions.
Unfortunately, I don't know a one-line argument for that. But I quickly tried to check it, using the hom-sets based definition of adjunctions, and it seems to hold true.

David Egolf (Jan 12 2025 at 18:49):

There's an approach to showing that functor categories inherit limits pointwise in "Higher Dimensional Categories" by Grandis (on page 55). And the proof given there also approaches the problem in terms of adjunctions! So maybe this would be a good direction to explore next. I'll plan to carefully read what you wrote!

David Egolf (Jan 12 2025 at 19:54):

I took a look at what you wrote. I like the approach of trying to show that we have a left adjoint to the diagonal functor

\Delta:[C^{\mathrm{op}}, \mathsf{Set}] \to [D, [C^{\mathrm{op}}, \mathsf{Set}]]

. It looks like you are trying to build up a candidate left adjoint by starting with a related adjunction involving

\mathsf{Set}

. So it seems relevant to think about how we can build new adjunctions from old ones.

David Egolf (Jan 12 2025 at 20:00):

Grandis references a fact in this direction during his proof, which I paraphrase here:

Let

(\eta, \varepsilon):F \dashv G

be an adjunction with unit

\eta

and counit

\varepsilon

. Let

S

be a small category. Then a functor

F:C \to D

has a canonical "extension" to functor categories on

S

, so we get a

F^S:C^S \to D^S

. We can do a similar thing for a natural transformation

\varphi: F \to G

, getting

\varphi^S:F^S \to G^S

. Then, our adjunction has a canonical extension to an adjunction

(\eta^S, \varepsilon^S):F^S \dashv G^S

David Egolf (Jan 12 2025 at 20:06):

Let's assume this is true for the moment. Since

\mathsf{Set}

has colimits, the diagonal functor

\delta:\mathsf{Set} \to [D, \mathsf{Set}]

is right adjoint to the colimit-taking functor

\kappa:[D, \mathsf{Set}] \to \mathsf{Set}

. So we have

\kappa \dashv \delta

David Egolf (Jan 12 2025 at 20:16):

David Egolf (Jan 12 2025 at 20:39):

I am optimistic that this approach will provide a nice conceptual proof! And it's very interesting to learn that we can "exponentiate" an adjunction by a small category to get a new adjunction.

David Egolf (Jan 12 2025 at 20:44):

To show that we really can exponentiate an adjunction and get a new one, Grandis suggests making use of the characterization of an adjunction in terms of the triangular equations.

Peva Blanchard (Jan 12 2025 at 22:17):

Oh nice! Indeed, now I recall that the unit-counit based definition of adjunctions allows to define them in "2-category theory" (where we don't really have hom-sets, if I understood correctly).

David Egolf (Jan 13 2025 at 20:21):

I want to understand how/why we can "exponentiate" an adjunction. As a first step, given a functor

F:C \to D

and a small category

S

, I want to show how to get a functor

F^S:C^S \to D^S

. (I'm following Grandis's "Higher Dimensional Categories", page 52).

David Egolf (Jan 13 2025 at 20:23):

David Egolf (Jan 13 2025 at 20:25):

where juxtaposition denotes whiskering along

F

. I believe that these conditions hold.

John Baez (Jan 14 2025 at 18:23):

David Egolf (Jan 14 2025 at 19:30):

Thanks for sharing that link! I do promise I'll come back to sheaves eventually :sweat_smile:. This concept - how functor/presheaf categories inherit limits/colimits - just seems really important, and so I want to give it proper attention.

John Baez (Jan 14 2025 at 21:48):

It is important. It just happens that this particular aspect of category theory - which I might unfairly describe as "fiddling around with limits and colimits" - is something I've never spent much time on. So I prefer watch you talk about it with other people, and I'll wait patiently until you start talking about something I like better.

(I've never really felt like a true category theorist, and one reason is that I tend to dislike calculations involving limits or colimits, so I'm not good at them. Luckily there are plenty of other people here who are experts on these matters, and some of them are willing to help me now and then!)

David Egolf (Jan 15 2025 at 18:24):

I next want to see how exponentiating functors preserves composition. Let us imagine we have functors

F:C \to D

and

G:D \to E

, and a small category

S

. Then is

(G \circ F)^S = G^S \circ F^S:C^S \to E^S

David Egolf (Jan 15 2025 at 18:27):

David Egolf (Jan 15 2025 at 18:29):

For objects

X,X':S \to C

, a morphism

\alpha:X \to X'

is acted on as follows:

These two things yield the same answer: (1) whiskering along one functor and then along another; (2) whiskering along the composition of the two functors at once. So the two functors act in the same way on morphisms.

David Egolf (Jan 15 2025 at 18:31):

We conclude that

(G \circ F)^S = G^S \circ F^S

, so that exponentiating functors preserves composition.

David Egolf (Jan 15 2025 at 18:33):

The next steps are: (1) figure out how to exponentiate natural transformations and (2) show that this process preserves composition of natural transformations.

David Egolf (Jan 15 2025 at 19:27):

Given functors

F,G:C \to D

and a natural transformation

\phi:F \to G

, we want to find some natural transformation

\phi^S:F^S \to G^S

. Since both

F^S

and

G^S

map from

C^S

\phi^S

has one component for each

X:S \to C

. The component for

X

should be a morphism

:F^S(X) \to G^S(X)

, which is thus a morphism

(F \circ X) \to (G \circ X)

. That is,

(\phi^S)_X

should be a natural transformation from

F \circ X

G \circ X

David Egolf (Jan 15 2025 at 19:27):

We can get such a natural transformation by whiskering

\phi

after

X

. So we propose

(\phi^S)_X = \phi X

David Egolf (Jan 15 2025 at 19:30):

It remains to show that

\phi^S

really is a natural transformation. Letting

\alpha:X \to X'

be a morphism in

C^S

, we get a corresponding naturality square:
naturality square

David Egolf (Jan 15 2025 at 19:31):

I believe this diagram commutes because of the interchange law that whiskering obeys. So,

\phi^S

really is a natural transformation!

David Egolf (Jan 15 2025 at 19:33):

The next order of business is to show that "exponentiating" natural transformations in this way preserves composition.

David Egolf (Jan 15 2025 at 19:40):

So let us have functors

F,G,H:C \to D

and natural transformation

\phi:F \to G

and

\psi:G \to H

. And again let

S

be a small category.

David Egolf (Jan 15 2025 at 19:42):

This equation holds iff it holds at each component. Each side of this equation is a natural transformation

:F^S \to H^S

. Since

F^S, H^S:C^S \to D^S

, the natural transformations here each have a component for each

X:S \to C

David Egolf (Jan 15 2025 at 19:43):

So our desired equation holds if

(\psi \circ \phi)^S_X = \psi^S_X \circ \phi^S_X

for any

X

. Rewriting each side using our definition of the exponentiation of a natural transformation, this becomes

(\psi \circ \phi)X = (\psi X) \circ (\phi X)

. And this is true by the distributivity property of whiskering!

David Egolf (Jan 15 2025 at 19:44):

So we conclude that "exponentiating" natural transformations in this way preserves composition of natural transformations.

David Egolf (Jan 15 2025 at 19:45):

Next time, I hope to see how this helps us "exponentiate" an adjunction to get a new adjunction!

David Egolf (Jan 17 2025 at 17:24):

To exponentiate an adjunction, we will first need to remember that an adjunction can be defined in terms of certain equations of certain natural transformations. The idea is to then use the fact that exponentiating natural transformations preserves composition, so that these equations will still hold after with exponentiate everything.

David Egolf (Jan 17 2025 at 17:29):

Given functors

F:C \to D

and

G:D \to C

, we have that

F

is left adjoint to

G

exactly if:

David Egolf (Jan 17 2025 at 17:33):

Letting

S

be a small category, we would like to show that

F^S

is left adjoint to

G^S

, with unit

\eta^S

and counit

\varepsilon^S

David Egolf (Jan 17 2025 at 17:44):

If we are to have any hope of getting an adjunction, we need

\eta^S

to go from

1_{C^S}

G^S \circ F^S

. Thankfully, we do have this as

(G \circ F)^S = G^S \circ F^S

and

(1_C)^S=1_{C^S}

. (The fact

(1_C)^S=1_{C^S}

I believe follows from the fact that whiskering a natural transformation along an identity functor just gives you back that same natural transformation).

David Egolf (Jan 17 2025 at 17:45):

David Egolf (Jan 17 2025 at 17:52):

David Egolf (Jan 17 2025 at 17:53):

Exponentiating these last two equations, and using the fact that exponentiating preserves composition of natural transformations, we get:

David Egolf (Jan 17 2025 at 17:55):

So if we can show that exponentiating behaves nicely enough, then we'll be in business! Specifically, we want to show that exponentiating sends identity natural transformations to identity natural transformations, and that we can "distribute" exponentiation to the natural transformation and functor involved in whiskering.

John Baez (Jan 17 2025 at 17:57):

All this looks exactly right. But there's something a bit tiring about it. Let me see if I can compress the argument a bit using higher technology.

There's a 2-category

\mathbf{Adj}

called the 'walking adjunction' such that any 2-functor

F: \mathbf{Adj} \to \mathbf{Cat}

is an adjunction, and vice versa.

So, given any adjunction

F: \mathbf{Adj} \to \mathbf{Cat}

we can compose it with

-^{\mathsf{S}} : \mathbf{Cat} \to \mathbf{Cat}

and get a new adjunction.

Of course, I've moved the work over to 1) learning about 2-categories, 2) checking that there is a 2-category

\mathbf{Adj}

with the property described, 3) checking that

-^{\mathsf{S}}

is a strict 2-functor.

It looks like you're about to do part 3) - though perhaps not using the language of 2-categories. It looks like you're going to check that

-^{\mathsf{S}}

preserves composition of functors, vertical composition of natural transformations, and whiskering - or horizontal composition of natural transformations.

John Baez (Jan 17 2025 at 18:06):

Category theorists love to talk about adjoint functors, but 2-category
theorists know that these are just a special example of an "adjunction".
An adjunction is something that makes sense in any 2-category; if we
take the 2-category to be Cat we get adjoint functors.  There are lots
of other nice examples that make this generalization worthwhile.  For
example, in "week83" I explained how a pair of dual vector spaces is
also an example of an adjunction.

To study adjunctions, it suffices to study the "walking adjunction".
This is a little 2-category containing exactly the stuff any adjunction
in any 2-category must have: not a jot more, not a tiddle less!  It was
first studied by Schanuel and Street:

3) Stephen Schanuel and Ross Street, The free adjunction,
Cah. Top. Geom.  Diff. 27 (1986), 81-83.

In a bit more detail, the walking adjunction is the 2-category freely
generated by two objects:

a and b,

two morphisms:

L: a -> b  and  R: b -> a,

and two 2-morphisms, called the "unit" and "counit":

i: 1_a => LR  and  e: RL => 1_b

satisfying two relations, called the "triangle equations".

I wrote down these equations already last week, but let me do it again
using "string diagrams", as explained in "week79" and "week92".  In a
2-categorical string diagram, objects are denoted by 2d regions in the
plane, morphisms are denoted by 1d edges, and 2-morphisms are denoted by
0d points.  If the dimensions look sort of upside-down, you're right -
that's exactly the point!

Instead of explaining the whole theory, I'll just plunge in with
the example at hand.  The unit i looks like this:

                     i
                    / \
                   L   R
                  /     \
              a  /   b   \  a

while the counit e looks like this:

              b  \   a   /  b
                  R     L
                   \   /
                    \ /
                     e

Note that as you cross a line labelled "L" from left to right, you go
from region a to region b, which is our way of saying that L: a -> b.
Similarly, as you cross a line labelled "R" from left to right, you go
from region b to region a, since R: b -> a.

In terms of string diagrams, the triangle equations just say that we can
straighten out a zig-zag:

                     |                     |
           i         |                     |
          / \        L                     |
    a    /   \       |                     |
        /     \      |                     |
       |       R     /         =      a    L    b
       |        \   /                      |
       L         \ /    b                  |
       |          e                        |
       |                                   |

or a zag-zig:

         |                                  |
         |          i                       |
         R         / \                      |
         |        /   \   a                 |
         |       /     \                    |
          \     L       |       =      b    R    a
           \   /        |                   |
       b    \ /         R                   |
             e          |                   |
                        |                   |

We can build any 2-morphism in the walking adjunction by vertically
and horizontally composing units and counits, which corresponds to
sticking together string diagrams in a vertical or horizontal way.
Thus, a typical 2-morphism looks like this:

      \     \   a   /   \   a   /      /               |
       \     R     L     R     L      /       i        |
        \     \   /       \   /      /       / \       L
         \     \ /         \ /      /   a   /   R      |    b
          \     e           e      /       /     \     |
    a      L                      R        \      \   /
            \         b          /     i    \      \ /
             \                  /     / \    L      e
              \                /     L   R    \
               \              /     /  b  \    \

By the triangle equations, we could straighten out the zig-zag without
changing the 2-morphism.

As you may know, the word "anaranjado" means "orange" in Spanish - there
was no word in English for "orange" before people in England started
importing oranges from Spain.  And this is a nice mnemonic, because if
we take the above picture and paint the regions labelled "a" orange, and
paint the regions labelled "b" black, the above picture has a roughly
tiger-striped appearance.  In fact, these tiger stripes tell you
everything you need to know about the 2-morphism!  For example, starting
from just this:

      \     \   a   /   \   a   /      /               |
       \     \     /     \     /      /       _        |
        \     \   /       \   /      /       / \       |
         \     \_/         \_/      /   a   /   \      |    b
          \                        /       /     \     |
    a      \                      /        \      \   /
            \         b          /     _    \      \_/
             \                  /     / \    \
              \                /     /   \    \
               \              /     /  b  \    \

you can figure out where everything else should go.

By the way, note that orange stripes can disappear as we go down the
page, and they can split, but they can't appear or merge.  Black stripes
can appear or merge, but they can't disappear or split.  As a result,
there can never be any orange or black *spots*

I went on to explain that sitting inside the walking adjunction, the sub-2-category with just the object a is the walking monad, and the sub-2-category with just the object b is the walking comonad.

David Egolf (Jan 17 2025 at 18:25):

That makes a lot of sense! It did feel like I was checking a lot of "functor-like" things. So viewing all these checks as corresponding to a specific strict 2-functor

-^S

is conceptually nice!

David Egolf (Jan 17 2025 at 19:03):

The post was interesting, although I found anything involving the string diagrams difficult to follow. It is exciting to know that 2-functors from

\mathbf{Adj}

give us adjunctions! I've recently been impressed by how important adjunctions are, and this gives us a really powerful way to make new adjunctions from old ones - you can just post-compose with an endo 2-functor!

[Tangentially, I hope someone will (hopefully soon!) make a really nice and easy to use website for drawing string diagrams. Maybe there already is one - but I haven't found one I really like yet. I'd love to be able to drag the dots and "strings" around in a somewhat constrained way, to allow for intuitive manipulation of these diagrams.]

John Baez (Jan 17 2025 at 21:22):

It's interesting that you find string diagrams difficult to follow. Their fans tend to think they are extremely easy to understand, perhaps even more intuitive than communication with words. (See for example Bob Coecke's attempts to explain quantum mechanics to children using string diagrams.) But in fact I spent a lot of time studying the rules for how they work, e.g. what they mean and when two string diagrams count as "the same".

David Egolf (Jan 17 2025 at 21:53):

I think they would become easy to follow once I put a lot of work in. But until I do, I (1) get caught up worrying about how all the rules work and (2) quickly lose intuition as a diagram is modified. Someday I'll learn that stuff, hopefully!

Peva Blanchard (Jan 17 2025 at 22:44):

The 2-endofunctor

(\_)^S : \text{Cat} \to \text{Cat}

seems special.
If we write it as

\text{Cat}(S, \_)

, it looks like the representable

Hom(S, \_)

in the

1

-categorical setting.

John Baez (Jan 17 2025 at 22:53):

David Egolf (Jan 20 2025 at 01:18):

I guess that this perspective of adjunctions in

\mathbf{Cat}

as 2-functors

F:\mathbf{Adj} \to \mathbf{Cat}

means that we get some kind of category of adjunctions in

\mathbf{Cat}

and "natural transformations" of some kind between them. I wonder if this "functor category" inherits limits/colimits from

\mathbf{Cat}

so that we get ways to make new adjunctions from old ones.

David Egolf (Jan 20 2025 at 01:19):

Since adjunctions can yield "nice objects/concepts" from their fixed points, then if we can combine adjunctions to make new ones via some limit/colimit construction, this suggests a way to view certain nice objects/concepts as being derived from others.

David Egolf (Jan 20 2025 at 01:23):

This looks relevant: 2-category of adjunctions. I think the "functor category" I mentioned above corresponds to the second bullet point in that article. However, that article doesn't seem to discuss if limits/colimits are inherited pointwise by a functor 2-category. (This article also doesn't seem to discuss that: functor 2-category).

David Egolf (Jan 20 2025 at 01:33):

If I'm not careful, I'm going to end up wanting to learn some 2-category theory :sweat_smile:.

John Baez (Jan 20 2025 at 02:38):

You need a bit of 2-category theory to do category theory well. Unfortunately you also need a bit of 3-category theory to do 2-category theory well, and so on. Luckily for most purposes you can truncate this infinite series at any point and make do with that, knowing that some things you're doing could be done more efficiently if you knew more....

I find it amusing, though, that while working on whether you can compute limits pointwise in functor categories, you have become interested in whether you can compute limits pointwise in functor 2-categories. I can predict what you'll become interested in as soon as you know the answer to that! That way madness lies. :smiling_devil:

But anyway, higher categories tend to work like lower ones except for important subtleties involving strictness. Since you can compute co/limits in functor categories pointwise, you should expect that you can compute limits pointwise in functor 2-categories, so you should guess that the functor 2-category

\mathbf{Cat}^{\mathbf{Adj}}

will inherit co/limits from

\mathbf{Cat}

. But you should worry about whether it's best to define this functor 2-category using strict 2-functors from

\mathbf{Adj}

\mathbf{Cat}

and strict natural transformations between them (as we've implicitly been assuming), or some kind of 'weak' or 'pseudo' 2-functors that preserve composition only up to isomorphism, and 'pseudonatural' transformations between those... and also whether you'll want to use strict limits or some kind of 'weak' or 'pseudo' limits, and so on. I would guess that if you do everything strictly, it will all work - but someone may find an example showing it's not optimal.

Peva Blanchard (Jan 20 2025 at 09:10):

I have to confess that higher categories look cool, but very scary to me. I feel like a casual hiker, happily walking on his favourite

1

-categorical track, who would suddenly encounter a big steep for-alpinists-only mountain.

David Egolf (Jan 20 2025 at 18:20):

Do you have any recommendations for resources suitable for learning some 2-category theory? I occasionally reference "2-Dimensional Categories" by Johnson and Yau, which is pretty good. I see the nLab also lists "Foundations for Almost Ring Theory" by Gabber and Ramero. But I always like to discover new learning resources!

Well, in this case I became interested in a functor 2-category because I learned we can view an adjunction as an object in that setting. I don't yet know of any cool constructions that can be viewed as an object in a functor 3-category, so I may be safe for a while. :sweat_smile:

I see! For now I think I'll just remember that there are probably ways to combine adjunctions using limits and colimits.

David Egolf (Jan 20 2025 at 18:22):

I sometimes feel similarly. What's most scary for me in this context is when a book introduces a definition that is really long and specific, without motivating in detail where it all came from. So I can follow the long definition and maybe even use it, but it feels like there's another layer of understanding I'm missing.

John Baez (Jan 20 2025 at 18:32):

Well, if you ever encounter such a definition just ask us here, and we can have fun trying to understand it and/or explain it. This is just poor math pedagogy, very common, not special to higher categories. Or maybe the author has a specific audience in mind and you're not in it.

If you're trying to get the 'feel' for higher categories you might try my old series The Tale of n-Categories, spread over many issues of This Week's Finds, but with each episode containing a link to the next.

David Egolf (Jan 20 2025 at 18:49):

Well, this lengthy digression on the topic of limits/colimits in a functor category has been interesting and educational. But I am feeling the desire to return to the topos theory blog posts! So, to summarize and conclude for now:

David Egolf (Jan 20 2025 at 18:52):

David Egolf (Jan 22 2025 at 17:43):

To show that there are exactly two different morphisms

:G_V \to G_E

, I am tempted by this line of argument:

David Egolf (Jan 22 2025 at 17:44):

(I should note that we have a category

G

with two objects called

v

and

e

, with two morphisms called

s

and

t

from

v

e

. Then we are viewing graphs as presheaves on this category.)

David Egolf (Jan 22 2025 at 17:45):

Let's see what

G(-,v):G^{\mathrm{op}} \to \mathsf{Set}

is. I expect it will be

G_V

John Baez (Jan 22 2025 at 17:48):

It seems like overkill to use the Yoneda embedding to see how many graph morphisms there are from

John Baez (Jan 22 2025 at 17:49):

Maybe I'm assuming the reader has already worked out what graph morphisms are like? I guess I just thought this was obvious or something.

David Egolf (Jan 22 2025 at 17:51):

I guess I didn't want to assume that the usual notion of morphism between graphs was the same as a natural transformation between presheaves on

G

. And because checking that sounded like a bunch of work, I was trying to look for a shortcut using the Yoneda embedding :sweat_smile:.

David Egolf (Jan 22 2025 at 17:52):

But perhaps it would be good to just bite the bullet and show what a morphism of presheaves in this context amounts to in terms of the corresponding picture with vertices and edges.

David Egolf (Jan 22 2025 at 17:54):

Let's see. If we have graphs

H, H':G^{\mathrm{op}} \to \mathsf{Set}

, what does a natural transformation

\alpha:H \to H'

amount to?

John Baez (Jan 22 2025 at 17:54):

It seems both instructive and extremely easy to work out what a functor from

G^{\text{op}}

\mathsf{Set}

is, and what a natural transformation between such functors is. The category

\mathsf{G}

has just two objects and two non-identity morphisms, so there's not a lot to check.

David Egolf (Jan 22 2025 at 17:55):

I think you are right. This is probably one of the times where I overestimated how much work something would be!

David Egolf (Jan 22 2025 at 17:57):

A natural transformation

\alpha:H \to H'

amounts to a function

\alpha_e

from the edges of

H

to the edges of

H'

, and a function

\alpha_v

from the vertices of

H

to the vertices of

H

', such that these naturality squares commute:
source square

David Egolf (Jan 22 2025 at 17:59):

Basically, this means that if we have an edge

e

H

, it must be mapped to an an edge in

H'

that goes from the image of the source of

e

H'

to the image of the target of

e

H'.

David Egolf (Jan 22 2025 at 18:01):

So then it is immediate that we have two morphisms from the walking vertex graph to the walking edge graph! We can choose to send the vertex of the walking vertex to either vertex in the walking edge.

David Egolf (Jan 22 2025 at 18:03):

Calling these two morphisms

f,g:G_V \to G_E

, the next goal is to find the equalizer and co-equalizer of

f

and

g

[G^{\mathrm{op}}, \mathsf{Set}]

David Egolf (Jan 22 2025 at 18:06):

David Egolf (Jan 22 2025 at 18:09):

Given an object of

G

, we can get a version of this diagram that lives in

\mathsf{Set}

. Let's start with

v \in G

. If I remember how this works, we should get this:
diagram at v

David Egolf (Jan 22 2025 at 18:11):

We can spell this out more explicitly, because we know

G_V(v)

is a singleton set,

G_E(v)

is a set with two elements, and

f_v, g_v

describe what happens to vertices in our two graph morphisms:
diagram at v, version 2

David Egolf (Jan 22 2025 at 18:13):

David Egolf (Jan 22 2025 at 18:14):

Consulting the nLab, I learn that the equalizer of two functions

m,n:X \to Y

is the subset of

X

where

m

and

n

agree.

David Egolf (Jan 22 2025 at 18:14):

David Egolf (Jan 22 2025 at 18:15):

And if a graph doesn't have vertices it can't have edges! So I guess the equalizer of

G_V

and

G_E

[correction: the equalizer of

f

and

g

] is the empty graph... That's not really what I would have expected, so I'm worried I made a mistake somewhere.

John Baez (Jan 22 2025 at 18:16):

If I were really your teacher, I would make you stay after class until you'd proved that equalizers in Set work like you just said. :smiling_devil:

John Baez (Jan 22 2025 at 18:20):

You meant to say the equalizer of

f

and

g

is the empty graph. That's correct - no mistake! The intuition is that

f

and

g

have empty equalizer because the equalizer is "where two morphisms are equal", and these morphisms are "completely different": they're not equal on any vertices or edges of the domain of these morphisms. (The domain has no edges, and just one vertex, but I'm giving the general criterion for when two morphisms between graphs have empty equalizer.)

David Egolf (Jan 22 2025 at 18:22):

David Egolf (Jan 22 2025 at 18:29):

I'll quickly check that equalizers work as described in

\mathsf{Set}

. We contemplate this diagram:

Is there a unique function

h

that induces a morphism of cones here? We need

i \circ h =j

David Egolf (Jan 22 2025 at 18:30):

We need

i(h(a)) = h(a)

for each

a \in A

. Thus

h(a)

needs to be in the preimage with respect to

i

j(a)

for each

a

. But since

i

is injective, each preimage is a singleton set. So,

h(a)

should be for each

a

the unique element of

W

that maps to

j(a)

under

i

David Egolf (Jan 22 2025 at 18:31):

David Egolf (Jan 22 2025 at 18:33):

j(a)

is in the image of

j

, then

f(j(a)) = g(j(a))

. Thus

j(a)

is in

W

, as

W

is the subset of

X

where

f

and

g

agree. Thus, such an

h

always exists.

David Egolf (Jan 22 2025 at 18:34):

John Baez (Jan 22 2025 at 18:35):

David Egolf (Jan 22 2025 at 18:36):

David Egolf (Jan 22 2025 at 18:37):

David Egolf (Jan 22 2025 at 18:38):

David Egolf (Jan 22 2025 at 18:39):

David Egolf (Jan 22 2025 at 18:49):

I'm currently trying to remember / re-figure-out how coequalizers work in

\mathsf{Set}

. Since the equalizer of two functions was a subobject of the source set, maybe the coequalizer of two functions is a co-subobject of the target set. By "co-subobject" I just mean epimorphism from an object. And an epimorphism from a set relates closely to an equivalence relation on that set.

So I'm guessing that the coequalizer of functions

f,g:X \to Y

Y/\sim

where

\sim

is some equivalence relation on

Y

induced by

f

and

g

The first idea for such an equivalence relation that comes to mind is this: we say

y \sim y'

iff there is some

x

f(x)=y

and

g(x)=y'

f(x)=y'

and

g(x)=y

John Baez (Jan 22 2025 at 18:50):

John Baez (Jan 22 2025 at 18:53):

The phrase "co-subobject" reminds me of James Dolan's term "co-bigger". He says X is bigger than Y if Y is a sub-object of X, and X is "co-bigger" than Y if Y is a co-subobject of X.

Given what you've already said, it's probably not giving away too much to admit that what you're calling a "co-subobject" is often called a "quotient object".

David Egolf (Jan 22 2025 at 18:56):

Somewhat tangentially, I do love how we have this kind of conceptual duality between subobject and quotient object. That's the sort of analogy/duality I wouldn't have expected!

John Baez (Jan 22 2025 at 18:56):

At an intuitive level, X is bigger than Y if Y fits inside X, while X is cobigger than Y if you can squash down X to get Y.

John Baez (Jan 22 2025 at 18:59):

In the category of finite-dimensional vector spaces, X is bigger than Y iff it's cobigger than Y. But in the category of graphs the concepts are radically different!

David Egolf (Jan 22 2025 at 19:00):

David Egolf (Jan 22 2025 at 19:02):

Before pressing onward, I suppose for completeness it would be good to check the construction I described above satisfies the universal property of coequalizers in

\mathsf{Set}

John Baez (Jan 22 2025 at 19:03):

David Egolf (Jan 22 2025 at 19:08):

I think the equivalence relation I proposed on

Y

can be more simply described like this: it is the one generated by decreeing

f(x) \sim g(x)

for all

x \in X

. Then we have the projection map

\pi:Y \to Y/\sim

which sends each element to its equivalence class.

David Egolf (Jan 22 2025 at 19:09):

And we want to show that there is a unique function

h

in the below situation corresponding to a morphism of cocones:
cocone morphism

David Egolf (Jan 22 2025 at 19:14):

We need

h \circ \pi = \rho

. So,

h[y] = \rho(y)

for each

y \in Y

. (Here

[y]

is the equivalence class of

y

under

\sim

). So if

h

exists, it is unique.

However, for this definition for

h

to make sense,

\rho(y)

needs to be constant as

y

varies across elements of

[y]

David Egolf (Jan 22 2025 at 19:19):

We know that

(\rho \circ f)(x) = (\rho \circ g)(x)

for all

x \in X

. Thus if we have

y = f(x)

and

y' = g(x)

then

\rho(y) = \rho(y')

David Egolf (Jan 22 2025 at 19:40):

I'm having trouble thinking clearly about this equivalence relationship, and how it interacts with

\rho

. It's probably time for me to take a break and come back when I have a bit more energy!

David Egolf (Jan 23 2025 at 00:49):

I drew a picture to help visualize what the induced equivalence relationship on

Y

can look like, for two functions

f,g:X \to Y

:
induced equivalence relationship

David Egolf (Jan 23 2025 at 00:50):

The idea is that

X

here is the set of two dots on the left, and

Y

is the set of five dots on the right. And we have two functions

f,g:X \to Y

f:X \to Y

corresponds to the black arrows and

g:X \to Y

corresponds to the red arrow.

David Egolf (Jan 23 2025 at 00:50):

We are interested in the equivalence relationship

\sim

Y

induced by requiring

f(x) \sim g(x)

for all

x \in Y

. We see that all three dots connected by dashed lines are equivalent in this case. (That is because if

a \sim b

and

b \sim c

, we require

a \sim c

David Egolf (Jan 23 2025 at 00:52):

Notice that the dots not in the image of either

f

g

are in equivalence classes having just one element.

David Egolf (Jan 23 2025 at 01:11):

I think that if

y_1\sim y_n

for two elements

y_1

and

y_n

Y

, then there is a finite sequence

y_1 \sim y_2 \sim \dots \sim y_n

, where, for each

i

y_i \sim y_{i+1}

can be written in the form

f(x_i) \sim g(x_i)

for some

x_i\in X

. This is because our equivalence relationship is being generated by things of the form

f(x) \sim g(x)

for

x \in X

David Egolf (Jan 23 2025 at 01:13):

For any

x_i \in X

, we have

\rho(f(x_i)) = \rho(g(x_i))

, because

A

is at the tip of a cocone involving

\rho

. Thus

\rho(y_i) = \rho(y_{i+1})

for all

i

. By transitivity of

=

, we conclude that

\rho(y_1) = \rho(y_n)

David Egolf (Jan 23 2025 at 01:20):

So, for any two equivalent elements

y \sim y'

Y

, we have

\rho(y) = \rho(y')

. Thus it makes sense to define

h[y] = \rho(y)

. This

h

is the only function that can possibly make the above diagram a morphism of cocones.

We conclude that

Y/\sim

(together with the projection function

\pi:Y \to Y/\sim

) indeed satisfies the universal property of a coequalizer for

f,g:X\to Y

David Egolf (Jan 23 2025 at 01:24):

Now we can use this to compute the coequalizer of two graph morphisms. First, we compute the coequalizer in

\mathsf{Set}

of this diagram to figure out the vertex set of our coequalizer graph:
coequalizer diagram in Set for vertices

David Egolf (Jan 23 2025 at 01:25):

We see that

1 \sim 2

, as the two functions output those values for the same input

1

. Hence

F(v)=\{1,2\}/\sim

is a singleton set and our coequalizer graph should have a single vertex.

David Egolf (Jan 23 2025 at 01:29):

David Egolf (Jan 23 2025 at 01:30):

More explicitly, using the fact that the walking vertex has no edges and the walking edge has one edge:

David Egolf (Jan 23 2025 at 01:32):

There is only one equivalence class in the one possible equivalence relation on a singleton set. Thus, our coequalizer graph should have a single edge.

David Egolf (Jan 23 2025 at 01:33):

Our coequalizer graph has one edge and one vertex. So that edge has got to go from that single vertex back to that vertex! The coequalizer of

f

and

g

I think is a "loop" graph.

John Baez (Jan 23 2025 at 02:17):

My moral from this and other calculations is that taking the equalizer of

f, g: x \to y

picks out the largest piece of

x

on which

f

and

g

are equal, while taking their coequalizer squashes down

y

just enough to make

f

and

g

become equal.

John Baez (Jan 23 2025 at 02:18):

David Egolf (Jan 23 2025 at 02:38):

David Egolf (Jan 23 2025 at 02:52):

Because I always like imaging metaphors, we could imagine trying to apply this idea in that setting. If we have two ways of imaging, we could consider the situations in which they agree. Or, we could "squash" the outputs of these different imaging strategies to make them always equal. Each perspective might lead to an approach for measuring how "compatible" two imaging methods are.

David Egolf (Jan 23 2025 at 02:55):

David Egolf (Jan 23 2025 at 02:57):

I suspect that

G_V

and

G_E

are the objects in

\mathsf{Graph}

in the image of the Yoneda embedding. And I seem to recall there is some general result about how arbitrary objects in a presheaf category can be built up as a colimit of objects in the image of the Yoneda embedding.

David Egolf (Jan 23 2025 at 03:01):

Yes, Theorem 6.5.7 in "Category Theory in Context" by Riehl appears to be the result I was hoping for. But I'm not sure that using such a powerful theorem is in the spirit of this puzzle.

David Egolf (Jan 23 2025 at 03:03):

Intuitively, if we think of vertices and edges as little building blocks, it's clear we can build any graph out of those. I'm not sure if taking a colimit corresponds to the intuition of "sticking building blocks together", though.

David Egolf (Jan 23 2025 at 03:04):

It might good to start by trying to do something simple, like "attaching" two walking edges at a single vertex:

John Baez (Jan 23 2025 at 19:56):

That's an excellent place to start if you want to stay in the spirit of the puzzle. Indeed you can crush this puzzle using the fact that any object in any presheaf category is canonically a colimit of representables. But it would be a shame to crush the puzzle that way without seeing directly how colimits let you build the graph

In general building tinker toys out of a fixed collection of different kinds of tinker-toy parts is a lot like taking coimits of representables, and the category of graphs is an excellent way to explore this idea!

David Egolf (Jan 24 2025 at 17:28):

By the way, this example has made this point to me: even if something is built out of certain building blocks, it can be very different compared to the building blocks. I think for a while I was unfairly dismissing the move to the presheaf category facilitated by the yoneda embedding, on the idea that "you don't really add anything new". But it is certainly interesting to study graphs, and they are obtained in that way!

David Egolf (Jan 24 2025 at 17:29):

Now I am more excited by the yoneda embedding, with the idea that it lets us add many more related objects of study. We can get a "wild and wonderful" setting - that can be a lot richer than the setting we started with!

David Egolf (Jan 24 2025 at 17:51):

To take an example related to recent discussion in another thread: let's start with the opposite of the category of commutative rings

\mathsf{CRing}^{\mathrm{op}}

. If you are convinced that this is a category of geometric things, then the category

[(\mathsf{CRing}^{\mathrm{op}})^{\mathrm{op}},\mathsf{Set}]] = [\mathsf{CRing},\mathsf{Set}]

intuitively consists of geometric things built up from the building-block geometric objects in

\mathsf{CRing}^{\mathrm{op}}

David Egolf (Jan 24 2025 at 17:56):

Anyways, back to this particular example. We want to figure out what colimit in

\mathsf{Graph}

sticks these two walking edges together as shown:

David Egolf (Jan 24 2025 at 17:58):

A problem with this proposed approach is that I'm unsure if doing one colimit after the other can be "compressed" down to taking a single colimit.

David Egolf (Jan 24 2025 at 18:01):

My next idea is to use a pushout, inspired by how a pushout is used in the context of adjunction spaces - which are topological spaces formed by gluing topological spaces together.

John Baez (Jan 24 2025 at 18:03):

What is it called when you first do a coproduct and then a coequalizer in this way?

David Egolf (Jan 24 2025 at 18:10):

I don't know by memory, but I could try to figure it out in

\mathsf{Set}

, for example.

David Egolf (Jan 24 2025 at 18:13):

So let's imagine we start with two sets

X

and

Y

. Their coproduct

X \coprod Y

is just their disjoint union, which can think of as two collections of dots "sitting side by side". Then, let's say I want to glue together two of these dots. So I might use functions

f,g:1 \to X \coprod Y

where

1

is a set with a single element. Taking the coequalizer of

f

and

g

should result in a version of

X \coprod Y

where we've glued together the two points indicated by

f

and

g

David Egolf (Jan 24 2025 at 18:15):

Now, let's compare this to the pushout of

f

and

g

. Referencing the nLab discussion on pushouts in

\mathsf{Set}

(not wanting to get too distracted by working out what pushouts are in

\mathsf{Set}

) I think we'll get the same thing in this way!

David Egolf (Jan 24 2025 at 18:16):

So I strongly suspect that a coproduct followed by a coequalizer is the same thing as a pushout. That sounds pretty useful, so it might be good to try to prove that.

John Baez (Jan 24 2025 at 18:32):

Good!!! It's good to get a lot of "hands-on" experience in colimits like initial objects, coproducts, pushouts, coequalizers, pushouts, and [[directed colimits]]. Attempting to prove that you can get a pushout from a coproduct followed by a coequalizer is a very good use of time.

David Egolf (Jan 27 2025 at 18:17):

Ok, time to figure this out! Last time I worked on this, I was trying to prove this in the general case and getting confused. So let me see exactly how all the diagrams work in

\mathsf{Set}

. Hopefully that should motivate the general argument.

David Egolf (Jan 27 2025 at 18:29):

Let

f:S \to X

and

g:S \to Y

be functions. Using the inclusions

i_X:X \to X+Y

(where

X+Y

denotes disjoint union) and

i_Y:Y \to X+Y

, we can form

i_X \circ f:S \to X \to X+Y

and

i_Y \circ f:S \to Y \to X+Y

Taking the coequalizer of these two functions give us this diagram:
coequalizer diagram

David Egolf (Jan 27 2025 at 18:30):

David Egolf (Jan 27 2025 at 18:35):

I think

C

together with

p_X:X \to C

and

p_Y:Y \to C

should form a colimit cocone under the span

X \leftarrow_f S \to_g Y

. Here

p_X:X \to C

and

p_Y:Y \to C

are the morphisms induced by

p:X + Y \to C

using the universal property of coproducts.

David Egolf (Jan 27 2025 at 18:54):

To prove this, I'll aim to show that

C

together with

p_X:X \to C

and

p_Y:Y \to C

satisfies the universal property for a pushout of our span

X \leftarrow_f S \to_g Y

David Egolf (Jan 27 2025 at 18:54):

So, given any other cocone (say with tip

Z

) under this span, there should be a unique morphism

h

that induces a morphism of cocones:
universal property of pushout

Note, by the way, that this really forms a cocone. We need

p_X \circ f= p_Y \circ g

. But we already know that

p \circ i_X \circ f= p \circ i_Y \circ g

. Since

p_X = p \circ i_X

and

p_Y = p \circ i_Y

, we indeed have a cocone.

David Egolf (Jan 27 2025 at 18:59):

We require

h

to satisfy

h \circ p_X = \rho_X

and

h \circ p_Y = \rho_Y

. Since

p_X = p \circ i_X

, we need

h \circ p \circ i_X = \rho_X

. Similarly, we need

h \circ p \circ i_Y = \rho_Y

. This situation relates to this diagram:
coproduct diagram

David Egolf (Jan 27 2025 at 19:01):

If such an

h

meets these conditions, then

h \circ p

provides a morphism of cocones in the coproduct diagram above. There is a unique such morphism

:X+Y \to Z

. Thus if two different morphisms

h

and

h'

satisfy these conditions, then we must have

h \circ p = h' \circ p

. But since

p

is an epimorphism,

h=h'

. So, if an

h:C \to Z

inducing a morphism of cocones under our span exists, it is unique.

David Egolf (Jan 27 2025 at 19:02):

It remains to show that an

h:C \to Z

meeting this condition exists. To find a candidate for

h

, we consider an analogous situation involving a coequalizer diagram:
coequalizer diagram 2

David Egolf (Jan 27 2025 at 19:05):

Here

\rho:X+Y \to Z

is the morphism induced by

\rho_X:X \to Z

and

\rho_Y:Y \to Z

using the universal property of coproducts. This defines a cocone if

\rho \circ i_X \circ f = \rho\circ i_Y \circ g

. We can rewrite this equation as

\rho_X \circ f = \rho_Y\circ g

, which is true because

\rho_X

and

\rho_Y

form a cocone under our span.

David Egolf (Jan 27 2025 at 19:06):

We propose setting

h=k

, where

k:C \to Z

is the unique morphism induced by the universal property of coequalizers. We need to show that

k \circ p_X = \rho_X

and

k \circ p_Y = \rho_Y

By definition of

k

, we have

k \circ p = \rho

. Precomposing with

i_X

i_Y

lets us conclude that

k \circ p_X = \rho_X

and

k \circ p_Y = \rho_Y

David Egolf (Jan 27 2025 at 19:07):

We conclude that

p_X:X \to C

and

p_Y:Y \to C

form a cocone that satisfies the universal property of pushouts! So you can indeed make a pushout using a coproduct followed by a coequalizer.

David Egolf (Jan 27 2025 at 19:11):

This lets me think of a pushout using the "side by side" and "squashing until two morphisms agree" intuition I have for the coproduct and coequalizer respectively!

John Baez (Jan 27 2025 at 19:16):

and then you give a completely general argument that uses nothing about

\mathsf{Set}

! You use a few decorative words here and there, like calling the coproduct "disjoint union", but those can be removed.

But I understand this very well! When I have a concrete example like

\mathsf{Set}

in mind, I can carry out general category-theoretic arguments more easily. Without the example in mind I more easily get lost.

David Egolf (Jan 27 2025 at 19:18):

I mostly was using intuition in

\mathsf{Set}

to help me set up the starting diagrams. From there I felt comfortable switching to a more general argument.

John Baez (Jan 27 2025 at 19:18):

John Baez (Jan 27 2025 at 19:20):

By the way, you just proved "any category with binary coproducts and coequalizers has pushouts". There's another theorem that I think of as a companion: "any category with pushouts and an initial object has binary coproducts".

John Baez (Jan 27 2025 at 19:24):

From binary coproducts, coequalizers, pushouts and an initial object we can get all finite colimits - but these two theorems show those four are redundant.

David Egolf (Jan 31 2025 at 17:45):

Given the above, I think we want to take the pushout of our two "walking arrow" graphs to stick them together. Today, I'd like to see exactly how this work!

David Egolf (Jan 31 2025 at 18:17):

David Egolf (Jan 31 2025 at 18:19):

David Egolf (Jan 31 2025 at 18:23):

First, let's figure out how many vertices the pushout graph

H

will have. To do this, we can work with this diagram in

\mathsf{Set}

:
pushout of vertex sets

David Egolf (Jan 31 2025 at 18:25):

We learned above that to compute a pushout, we can first take a coproduct and then an appropriate coequalizer. So, we take the disjoint union of our sets of vertices, and then squash together parts of this disjoint union using

r_v

and

l_v

. The disjoint union has four vertices, and then

r_v

and

l_v

specify that we squash together the vertex in the common image of

r_v

and

l_v

. So our pushout graph has three vertices.

David Egolf (Jan 31 2025 at 18:27):

We can do a similar thing with edges. In this case,

r_e

and

l_e

are both empty functions, and so our pushout graph has two edges.

David Egolf (Jan 31 2025 at 19:20):

I think I was able to figure out how one gets the source and target functions of the pushout graph! The process to do so feels a bit lengthy at the moment, and explaining it here in this thread in detail sounds tiring...

To give the rough idea, we evaluate our span of graphs at

v

and at

e

to get two spans in

\mathsf{Set}

. Then by using the fact that each graph is a functor, we get some natural transformations between the spans in

\mathsf{Set}

. (One of these natural transformations corresponds to the source morphism, and one corresponds to the target morphism). We can then use this together with the universal property of colimits to work out the source and target functions for our pushout graph.

David Egolf (Jan 31 2025 at 19:22):

I'm tired, so I'll stop here for today. Maybe next time I'll finish up this explicit calculation. Or I might move on, if doing the calculation sounds too tedious! :sweat_smile:

John Baez (Jan 31 2025 at 23:24):

If this seems tiresome, you are probably wanting to put in more detail than a typical practitioner of category theory wants to see. You are, after all, only showing that sticking together these two graphs gives the expected result :

So, maybe you should take this exercise primarily as practice in mathematical communication. In almost any mathematical argument it's possible to become completely bogged down if one tries to fill in every last detail. Filling in every last detail is what computer-based proof verification systems are for. Human communication requires a "light touch".

David Egolf (Feb 02 2025 at 02:56):

Hmm, I see. I think I sometimes find it difficult to feel that I've actually proven something if I don't check every last detail. But I suppose my goal at the moment is not really to produce a fully detailed proof. Instead my goals are to:

David Egolf (Feb 02 2025 at 02:58):

With those goals in mind, I might try to explain how one can work out the source function for the pushout graph in some detail, and then stop.

John Baez (Feb 02 2025 at 03:05):

I suppose every good math student goes through a stage of feeling they need to "check every detail", and the most extreme ones study mathematical logic and set theory so that they know exactly how every rule of common-sense reasoning can be justified from some axioms.

But later, one gets used to taking lots of steps where one easily could fill in details as required, and treats a proof as an argument that is enough to convince another mathematician (since they too could fill in those steps).

It becomes clear that filling in all the details creates an argument that's almost impossible to understand, because the really important points - the nontrivial insights - are surrounded by a thicket of details that makes them impossible to spot. It's like a forest that has some beautiful trees, but so much underbrush that you can't see them.

David Egolf (Feb 02 2025 at 03:24):

That makes sense! However, I tend to worry that any given detail might secretly be non-trivial, until I check for myself. This does tend to cloud the main idea with endless details, so I sometimes try to put some detail-checking in a different section of my notes (analogous to how one can refactor code to hide low level detail).

David Egolf (Feb 02 2025 at 03:24):

I suppose it takes practice to develop a sense of confidence regarding unchecked details.

John Baez (Feb 02 2025 at 06:12):

A lot of it is just experience. If you keep doing math you keep seeing similar things over and over and over, and then many of the small things to check become similar to things you've checked before. When I write a sentence in a proof I mentally run through the things to check to verify that sentence, and if they're sufficiently routine I move on, but if something seems fishy, or if I feel I'm using knowledge that isn't known to my audience, then I expand on them - at least when I'm being serious, like writing a paper.

John Baez (Feb 02 2025 at 06:15):

Also: if your argument uses lots of small facts that you want to verify, you can break them out as Lemmas with their own proofs, letting readers read the lemmas and skip the proofs if they want. This can be clearer than a long string of paragraphs, because it makes the structure more visible.

David Egolf (Feb 03 2025 at 19:23):

In this case, the pattern of argument involved is one I'm not too familiar with. So I think it will be good to spell out some of the detail involved, but I'll try to do it in a way that will be interesting/intuitive to read. (If I just type up a bunch of details that's probably not so much fun to run!)

David Egolf (Feb 03 2025 at 19:24):

As discussed above, I believe we can do this using a pushout, as pictured below:
pushout

David Egolf (Feb 03 2025 at 19:25):

We've already seen that the resulting pushout graph has three vertices and two edges. That matches what we're hoping to get, so far. It remains to figure out how the edges are attached to the vertices. So, we want to figure out the source and target of each edge in our pushout graph.

David Egolf (Feb 03 2025 at 19:28):

David Egolf (Feb 03 2025 at 19:33):

The process of converting our diagram of graphs to a diagram of edges or vertices I think can be seen as closely relating to this "switcheroo" isomorphism of categories:

[S, [G^\mathrm{op}, \mathsf{Set}]] \cong [G^{\mathrm{op}}, [S, \mathsf{Set}]]

. Here

S

is a category that looks like this

1 \leftarrow 2 \to 3

, and

G^\mathrm{op}

is the category such that a functor from

G^\mathrm{op}

\mathsf{Set}

is a graph.

David Egolf (Feb 03 2025 at 19:37):

Our span of graphs

G_E \leftarrow_r G_V \to_l G_E

can be viewed as a functor

D:S \to [G^{\mathrm{op}},\mathsf{Set}]

. The switcheroo ismorphism then tells us there is a corresponding functor

D^t:G^{\mathrm{op}} \to [S, \mathsf{Set}]

. I believe that

D^t(v)

will be our diagram of vertices, and

D^t(e)

will be our diagram of edges.

David Egolf (Feb 03 2025 at 19:39):

As mentioned above, these

D^t(v)

and

D^t(e)

were used to figure out what vertices and edges our pushout graph has. So, to figure out the source function of our pushout graph (which assigns each edge to its source vertex), we may want to consider

D^t(s):D^t(e) \to D^t(v)

, where

s

is the morphism in

G^{\mathrm{op}}

that has to do with the source of edges.

David Egolf (Feb 03 2025 at 19:42):

In fact, we can use

D^t(s)

to figure out the source function

H(s):H(e) \to H(v)

of our pushout graph

H

using the following diagram:
diagram

David Egolf (Feb 03 2025 at 19:44):

Here

\lambda_e

is a universal cocone with tip

H(e)

. It is under our diagram of edges. Similarly

\lambda_v

is a universal cocone under our diagram of vertices, with tip

H(v)

. There is a unique function

H(s):H(e) \to H(v)

that makes this diagram commute, by the universal property of colimits.

David Egolf (Feb 03 2025 at 19:45):

To figure out the source function for our pushout graph

H

, we can ask what is required of

H(s)

to make the above diagram commute. The above diagram commutes iff it commutes at all three components (one corresponding to each graph in our original diagram of graphs). So we'll get three conditions on

H(s)

that should determined it uniquely.

David Egolf (Feb 03 2025 at 19:48):

From here, it's just a matter of working out all the details. Since I don't know how to explain all those details in an interesting or conceptual way, I won't type them up here right now.

I will note this though: it's useful to know that the components of

D^t(s)

are given by each of the source functions of our original three graphs (in our diagram of shape

S

A similar procedure should work to find the target function of our pushout graph.

David Egolf (Feb 03 2025 at 19:53):

David Egolf (Feb 03 2025 at 19:56):

We've seen one way that we can stick together two walking edges, and that feels like a good first step. Maybe as I next step I could show how to build a graph that has a single vertex and a single edge - a "loop" graph.

John Baez (Feb 03 2025 at 21:51):

That's nice! For your new self-imposed puzzle, what sort of colimit leaps to mind? (Various basic kinds of colimits have their own names, and in the previous puzzle the relevant colimit was a pushout.)

David Egolf (Feb 03 2025 at 21:59):

I suspect we're going to end up using a coequalizer on the walking edge to "squash together" the two vertices.

John Baez (Feb 03 2025 at 22:07):

David Egolf (Feb 05 2025 at 16:22):

This example pointed out this to me: when we stick together our "building blocks" (in this case the walking edge and the walking vertex) using colimits, we also have access to "squashed" versions of our building blocks.

David Egolf (Feb 05 2025 at 16:24):

At this point it seems intuitively reasonable that we can build up any graph using colimits of walking edges and vertices. I'm not sure how to prove this though. (I could use the fact that any object in a presheaf category is canonical a colimit of representables, but I might miss out on learning something from the puzzle if I just invoke that theorem and move on.)

David Egolf (Feb 05 2025 at 16:27):

Trying to set up some kind of induction seems like it could maybe work, at least for graphs that only require a finite number of steps to piece together from our edges and vertices.

David Egolf (Feb 05 2025 at 16:39):

There's probably a systematic procedure to build up a given graph by coproducts:

Morgan Rogers (he/him) (Feb 05 2025 at 17:30):

I think it's easier to think about gluing edges together rather than vertices. If you can figure out at least one way to do it, it's worth comparing that to the "canonical way". I'll explain what that is once you've had some time to think about it.

John Baez (Feb 05 2025 at 17:39):

When you're trying to prove this fact about graphs in general, it's probably good to use the general result about presheaf categories that you just mentioned. But since this general result actually gives a formula for how to write any graph as a colimit of representables (the walking edge and walking vertex), you might (or might not) want to see what that formula says.

John Baez (Feb 05 2025 at 17:44):

This is not a coproduct, since a graph is not a coproduct of graphs that have only vertices and graphs that have only edges. Edges need to have vertices!

But you have a good idea here. You can first take a coproduct of a bunch of "walking vertex" graphs, and then glue on a bunch of "walking edge" graphs.

And this pattern continues to higher dimensions when we are building [[simplicial sets]]: we start with a bunch of "walking vertices", then glue on "walking edges", and then glue on a bunch of "walking triangles", a bunch of "walking tetrahedra", and so on. We start at the lowest dimension and work our way up.

The same pattern is commonly used in topology build spaces out of balls of various dimensions; these spaces are called [[CW complexes]].

David Egolf (Feb 06 2025 at 21:33):

Ah, yes, I don't know why I said "coproducts". I think I meant "colimits", with the idea that we can glue in our edges to what we've assembled already (as you describe).

David Egolf (Feb 06 2025 at 21:34):

I guess I'm unsure exactly what I'm looking for here. I suppose ideally I'd like to figure out a procedure that can take in an arbitrary graph and then outputs a way to assemble that graph as a colimit of the walking edges and vertices.

David Egolf (Feb 06 2025 at 21:37):

I'm not sure what this kind of procedure would be like. Maybe I'll look at the general result mentioned above to try and get some inspiration.

David Egolf (Feb 06 2025 at 21:38):

Peeking at Theorem 6.5.7 in "Category theory in context", I see that a functor

F

there is expressed as a colimit of a particular diagram, where that diagram is constructed in a way that depends on

F

John Baez (Feb 06 2025 at 21:41):

If that's what you want, you can use an explicit formula that writes any object in any presheaf category as a colimit of representables, and then see what that formula gives in this very simple case, where there are only two representables.

But if not, maybe this will help. In one of the answers @fosco gives an argument using coends and weighted colimits, but the final answer is a colimit, which can be understood without those concepts.

David Egolf (Feb 06 2025 at 21:44):

I don't know that formula yet. I just know it exists! I'm somewhat hopeful I can work out a special case here without using the general formula. Maybe doing that can help provide intuition for the more general formula.

John Baez (Feb 06 2025 at 21:45):

John Baez (Feb 06 2025 at 21:47):

It's probably not giving much away to say it amounts to "Take a coproduct of walking vertices, one for each vertex of your graph, and walking edges, one for each edge of your graph. Then do some coequalizers that wind up gluing these together to form your graph."

David Egolf (Feb 06 2025 at 21:50):

David Egolf (Feb 06 2025 at 21:52):

I started thinking about this. First, I tried to consider all the different ways that individual edges could be connected. My idea was that we'd need a different coequalizer to glue together edges differently in each case. But it didn't like how there seemed to be a large number of different cases to consider.

Instead, I think it will be clearer to instead try to glue all the edges at once, using a single coeuqalizer.

John Baez (Feb 07 2025 at 00:07):

David Egolf (Feb 07 2025 at 18:04):

I made some pictures to help visualize this process. Let's imagine we wish to build up this graph as a colimit of walking edges and vertices:
graph

David Egolf (Feb 07 2025 at 18:05):

We start by just taking the coproduct of the correct number of walking edges:
coproduct of edges

If there were any "unattached" vertices with no edges going to or from them, we'd want to also make sure we included some copies of our walking vertex in our coproduct.

David Egolf (Feb 07 2025 at 18:06):

Then we'll want to glue together some vertices. Each dashed oval here indicates a collection of vertices that we want to glue together:
vertices to be glued together

David Egolf (Feb 07 2025 at 18:07):

My plan is to use a coequalizer to glue all these together. The coequalizer we want will be of two graph homomorphisms from a discrete graph (one having no edges). There will be one vertex in this discrete graph for each pair of vertices we wish to glue together.

David Egolf (Feb 07 2025 at 18:14):

The blue vertices I've added in are the vertices in the discrete graph involved in our coequalizer. The dashed lines indicate the two graph homomorphisms in our coequalizer.

David Egolf (Feb 07 2025 at 18:17):

It remains to spell out exactly how we detect when two vertices need to be glued together. We do this by working with edges sitting inside the graph we wish to create. There are three cases in which we need to glue vertices together:

David Egolf (Feb 07 2025 at 18:21):

By considering each edge in turn, I think we'll figure out in this way all the vertices that need gluing.

David Egolf (Feb 07 2025 at 18:23):

I think that makes sense to me! As a next step, I may try to compare this proposed coequalizer with the standard formula for writing an object in a presheaf category as a colimit of representables.

John Baez (Feb 08 2025 at 00:10):

Yes, you're doing it right. It's impressive how tersely the standard formula gets the job done, with no discussion of cases.

David Egolf (Feb 11 2025 at 17:19):

I'm hoping to next think about the standard formula. But in the meantime, here's something related which I thought was pretty cool.

We know that every object in a presheaf category can be written as a colimit of representable presheaves. And we also know that left adjoints preserve colimits. Given these things, consider the left adjoint functor

\Lambda: \widehat{\mathcal{O}(X)} \to \mathsf{Top}/X

, which sends presheaves on

X

to bundles over

X

. We see that:

John Baez (Feb 11 2025 at 17:36):

We get a representable presheaf

F

on the poset

\mathcal{O}(X)

of open subsets of

X

by fixing an open set

U_0

and defining

F(U)

for any open

U

to have one element if

U_0 \subseteq U

and none otherwise.

John Baez (Feb 11 2025 at 17:38):

If we take this presheaf

F

and turn it into an etale space over

X

, it's an etale space that has one point over each

x \in U_0

and none over each

x \notin U_0

John Baez (Feb 11 2025 at 17:42):

So you're right: this gives a great way to see that all etale spaces are made by gluing together ones of this simple sort! We drew some pictures a long time ago:

David Egolf (Feb 11 2025 at 18:41):

If I understand him correctly, @Morgan Rogers (he/him) talks about this result in this video, about 17 minutes and 18 seconds in: https://youtu.be/QURq2a1ezns?t=1038. (And then it occurred to me that we could probably use things we've discussed in this thread to prove it!)

David Egolf (Feb 11 2025 at 18:44):

I suppose this line of thinking can be applied every time we have a left adjoint functor from a category of presheaves. That seems useful to know: we can transport our ability to "build up things" in a presheaf category (where the "building blocks" are the representable presheaves) along a left adjoint.

John Baez (Feb 11 2025 at 20:06):

Yes. A good simple example is the 'geometric realization' functor from Gph to Top, which takes any graph and turns it into a topological space made of points for vertices and closed intervals for edges.

Josselin Poiret (Feb 12 2025 at 10:01):

David Egolf (Feb 13 2025 at 18:30):

The general procedure for building up a presheaf

F:C^{\mathrm{op}} \to \mathsf{Set}

as a colimit of representable presheaves is as follows:

David Egolf (Feb 13 2025 at 18:32):

Let's see what this gives us for a graph

F:G^{\mathrm{op}} \to \mathsf{Set}

\int F

has as objects specific vertices and specific edges, and we put a "source" morphism from a specific edge to a specific vertex if that vertex is the source of that edge. And similarly we put a "target" morphism from a specific edge to a specific vertex if that vertex is the target of that edge. The opposite category is obtained by reversing all these arrows.

David Egolf (Feb 13 2025 at 18:33):

Given a specific vertex, applying

\pi

gives us

v

- the object of vertices in

G

. Then applying the yoneda embedding gives us

y(v) = G(-,v):G^{\mathrm{op}} \to \mathsf{Set}

. A similar thing will happen with any specific edge.

David Egolf (Feb 13 2025 at 18:35):

Any source morphism in

(\int F)^{\mathrm{op}}

gets mapped to the morphism

s

G

, and then to

y(s):y(v)\to y(e)

David Egolf (Feb 13 2025 at 18:37):

So it seems to me that we basically take our category of elements for our graph of interest, reverse each arrow, and then replace each specific thing (e.g. a specific edge) with the yoneda embedding of the corresponding general thing (e.g.

y(e)

David Egolf (Feb 13 2025 at 18:39):

Let me see how this works for the graph

\bullet \to \bullet \to \bullet

. First, we visualize its category of elements. From left to right, let's name the vertices

v_l

v_m

, and

v_r

. And let's name the edges

e_l

and

e_r

from left to right.

David Egolf (Feb 13 2025 at 18:43):

David Egolf (Feb 13 2025 at 18:44):

Here

s_l

tells us that the source of

e_l

v_l

, and similarly

t_l

tells us that the target of

t_l

v_m

. (

s_r

and

t_r

tell us similar things for

e_r

David Egolf (Feb 13 2025 at 18:47):

David Egolf (Feb 13 2025 at 18:50):

Finally, we apply the yoneda embedding to get the following, where

G_V =y(v)

and

G_E = y(e)

are respectively the walking vertex and walking edge:
final diagram

David Egolf (Feb 13 2025 at 18:50):

If I did this correctly, taking the colimit of this diagram in

[G^\mathrm{op},\mathsf{Set}]

should give us the graph

\bullet \to \bullet \to \bullet

that we started with.

Probably this works out? At least, I'm encouraged to see the span in the middle of this diagram, which reminds me of the pushout square we were working with earlier.

David Egolf (Feb 13 2025 at 19:15):

I really like the idea that we can determine a graph up to isomorphism by how it relates to ("is built up from") the representable presheaves available, in the context of how other objects in the category relate to the representable presheaves. (Specifically, note that any cocone under such a diagram made out of representable presheaves "factors through" the universal cocone.)

John Baez (Feb 13 2025 at 20:10):

Great stuff! I agree with it all. It's nice how much general abstract nonsense about presheaf categories becomes intuitive if you examine how it works in the case of graphs.

David Egolf (Feb 13 2025 at 22:10):

David Egolf (Feb 13 2025 at 22:11):

Here we are defining an "element" of an object

X

to be a morphism

:1 \to X

from the terminal object

1

to that object

X

David Egolf (Feb 13 2025 at 22:20):

It's not so clear to me exactly how a morphism from a terminal object to

X

behaves like an "element" of

X

. This definition works out nicely in the case of

\mathsf{Set}

, but not always. For example, every group has exactly one element using this definition.

David Egolf (Feb 13 2025 at 22:23):

One good feature of this definition is that a morphism

f:X \to Y

does indeed send elements of

X

to elements of

Y

. Also, this definition interacts nicely with limits. For example, the elements of a product are in bijection with a set of pairs of elements (of the two objects we are taking the product of).

David Egolf (Feb 13 2025 at 22:26):

More generally, we could consider generalized elements of kind

A

. Such an element of an object

X

is a morphism

:A \to X

. We can expect this to give a nice notion of element (in the spirit of an element of a set) if (1)

A

is "pretty small" (it's dot-like; you can't fit a lot of things inside it) and (2)

A

fits inside other objects in a variety of ways (e.g. it's not like the trivial group).

David Egolf (Feb 13 2025 at 22:28):

With regards to the puzzle, I suspect this functor

\mathrm{elt}

is exactly

\mathsf{C}(1, -):\mathsf{C} \to \mathsf{Set}

. (I think we call this the "global section" functor when

\mathsf{C}

is a topos!)

John Baez (Feb 14 2025 at 00:23):

It works better for some categories than others. In some interesting cases the functor

\mathsf{elt} : \mathsf{C} \to \mathsf{Set}

is faithful. Then it gives a way of thinking of objects of

\mathsf{C}

as sets with extra structure. For example this works for the category of topological spaces or posets, but not for the category of groups (as you observed) or vector spaces. This is one reason topological spaces and posets feel more "geometric" and less "algebraic" than groups or vector spaces.

John Baez (Feb 14 2025 at 00:29):

A category

\mathsf{C}

equipped with a faithful functor to

\mathsf{Set}

is called a [[concrete category]]. We can make

\mathsf{Grp}

\mathsf{Vect}

into a concrete category using the ordinary common-sense notion of 'element', but these elements are not morphisms out of

1

David Egolf (Feb 14 2025 at 01:35):

In the case of

\mathsf{Grp}

\mathsf{Vect}

we might make use of our free/forgetful adjunctions. For example,

\mathsf{Grp}(F(S), G) \cong \mathsf{Set}(S, U(G))

where

F:\mathsf{Set} \to \mathsf{Grp}

is the free group functor and

U:\mathsf{Grp} \to \mathsf{Set}

sends any group to its underlying set. Here

S

is some set and

G

is some group.

Then we see that

\mathsf{Grp}(F(1), G) \cong \mathsf{Set}(1, U(G))

where

1

is a terminal object of

\mathsf{Set}

. So if we want to get our "usual" elements of a group we should consider

F(1)

-shaped elements, where

F(1)

is the free group on one element. That is, we should consider

\mathbb{Z}

-shaped generalized elements.

John Baez (Feb 14 2025 at 01:54):

Right! Here's a fun fact I learned from @Todd Trimble, which is easy to prove once you hear it's true. Suppose you have a category

\mathsf{C}

and a functor

U: \mathsf{C} \to \mathsf{Set}

that has a left adjoint. Show that

U

is representable, i.e. there's a natural isomorphism

David Egolf (Feb 14 2025 at 19:27):

Hmm, let me see. So we have

U:C \to \mathsf{Set}

that has a left adjoint. Calling the left adjoint

F

, we get a bijection

C(F(s),c) \cong \mathsf{Set}(s, U(c))

for any set

s

and object

c

\mathsf{Set}

As above, let

s = 1

, a terminal object of

\mathsf{Set}

. So then

C(F(1),c) \cong \mathsf{Set}(1, U(c))

. Now, in the category of sets, two sets are isomorphic iff there is a bijection between their elements. So,

\mathsf{Set}(1, U(c)) \cong U(c)

. Thus we have

C(F(1),c) \cong U(c)

for any

c \in C

David Egolf (Feb 14 2025 at 19:28):

It remains to show that this bijection upgrades to a natural isomorphism between

U

and

C(F(1),-)

. We know that

C(F(1),-) \cong \mathsf{Set}(1, U(-))

is a natural isomorphism. So if we can show there is a natural isomorphism between

\mathsf{Set}(1, U(-))

and

U

then we are done.

We check naturality for a morphism

f:a \to b

C

. We propose a natural isomorphism

\alpha:\mathsf{Set}(1, U(-)) \to U

with

c

-th component given as

\alpha_c:\mathsf{Set}(1, U(c)) \to U(c)

, which acts by

(\ast \mapsto m) \mapsto m

for any

m \in U(c)

. (Here

\ast

is the single element of the set

1

). Note that each component is a bijection. Our naturality square becomes:
naturality square

David Egolf (Feb 14 2025 at 19:28):

This arbitrary naturality square commutes, and thus we indeed have a natural isomorphism between

\mathsf{Set}(1, U(-))

and

U

Combining this with the fact that

C(F(1),-) \cong \mathsf{Set}(1, U(-))

, we conclude that we have a natural isomorphism

U \cong C(F(1),-)

. So, indeed,

U

is representable!

David Egolf (Feb 14 2025 at 19:31):

As a consequence, I suppose that the "forward direction" functor for any geometric morphism to

\mathsf{Set}

is representable. That is, if we have a geometric morphism

:\mathcal{E} \to \mathsf{Set}

with functors

f_\ast:\mathcal{E} \to\mathsf{Set}

and

f^\ast:\mathsf{Set} \to \mathcal{E}

, then

f_\ast

is representable.

David Egolf (Feb 14 2025 at 19:40):

As another consequence, we can expect "forgetful" functors to

\mathsf{Set}

to always be representable - any such functor will be representable provided that it has a left adjoint.

John Baez (Feb 14 2025 at 21:49):

Very nice! I like how you mined this idea for all it's worth (or at least some more of what it's worth :upside_down:) rather than merely solving the puzzle and then quitting.

I think these facts really clarify why, when we have a category

\mathsf{C}

with a functor

U: \mathsf{C} \to \mathsf{Set}

with a left adjoint

F : \mathsf{Set} \to \mathsf{C}

, we so often like to define 'elements' of an object

c \in \mathsf{C}

to be elements of the set

\mathsf{C}(F1, c) \cong U(c)

. This natural isomorphism gives two useful but equivalent ways to define elements of

c

. In one, we work largely within

\mathsf{C}

and use generalized elements. In the other, we use ordinary elements of the underlying set

U(c)

Morgan Rogers (he/him) (Feb 15 2025 at 15:45):

This is true! Unfortunately there aren't many of these. Considering that the inverse image functor

f^\ast

has to preserve finite limits, can you see why there aren't many?

David Egolf (Feb 15 2025 at 19:32):

f^\ast:\mathsf{Set} \to \mathcal{E}

will preserve terminal objects and also coproducts - and every object in

\mathsf{Set}

is a coproduct of terminal objects, as every set is the disjoint union of singleton sets. It preserves coproducts because it preserves colimits (as it is a left adjoint). It preserves terminal objects because it preserves finite limits.

So, at least on objects,

f^\ast

has very little flexibility: it has to send any singleton set to a terminal object of

\mathcal{E}

, and it has to send a set with

n

elements to the coproduct of

n

copies of a terminal object of

\mathcal{E}

. A similar restriction should apply to infinite sets too, as they are built up as an (infinite) coproduct of singleton sets.

Morgan Rogers (he/him) (Feb 17 2025 at 08:56):

Exactly! So there can be at most one up to isomorphism, and since the natural endomorphisms of that functor are determined by their component at the terminal object, there can't even be any non-identity endomorphisms. The conclusion is that Set is terminal amongst cocomplete toposes (where we need cocompleteness for the left adjoint of the geometric morphism to exist).

David Egolf (Feb 17 2025 at 19:10):

My argument above showed that the inverse image functor part of any geometric morphism

f:\mathcal{E} \to \mathsf{Set}

, namely

f^\ast:\mathsf{Set} \to \mathcal{E}

, is required to act in a certain way on objects. I've been trying to understand why

f^\ast

is also required to act in a certain way on morphisms.

After some thought (and consulting "Sheaves in Geometry and Logic") I realized that the discussion above is very helpful for this question. If we have a geometric morphism

f:\mathcal{E} \to \mathsf{Set}

then the direct image functor

f_\ast

has a left adjoint, and by the discussion above is thus representable. Specifically, again by the discussion above,

f_\ast

is represented by

f^\ast(1)

, where

1

is a singleton set. Since

f^\ast(1)

has to be a terminal object, as

f^\ast

preserves finite limits, this determines

f_\ast

up to natural isomorphism. This then determines

f^\ast

up to natural isomorphism (as a left adjoint, if it exists, is unique up to natural isomorphism).

David Egolf (Feb 17 2025 at 19:23):

David Egolf (Feb 17 2025 at 19:27):

Doesn't every topos have all small colimits?
[edit: as pointed out below, an elementary topos does not necessarily have all small colimits]

John Baez (Feb 17 2025 at 19:43):

And beware that right now Morgan and I are using topos to mean [[elementary topos]]. A [[Grothendieck topos]] is a topos of presheaves on a site, and these topoi have all small colimits.

David Egolf (Feb 17 2025 at 19:52):

Ah, thanks for pointing that out! My downfall was checking my (faulty) memory against an nLab article that used the word "topos" presumably to refer to a Grothendieck topos. Probably it would be good for me to more carefully check which notion of topos is being used in the future.

David Egolf (Feb 17 2025 at 19:55):

It says this before discussing different notions of topos, which perhaps could lead some readers astray.

John Baez (Feb 17 2025 at 20:30):

That's exactly the sort of thing that made me complain (in a conversation which @Mike Shulman happened to partake in) about how I found the distinction between two uses of "topos" quite confusing at first.

John Baez (Feb 17 2025 at 22:28):

David Egolf (Feb 20 2025 at 17:12):

David Egolf (Feb 20 2025 at 17:14):

Since

\mathrm{elt}

\mathsf{C}(1,-)

, and representable functors preserve limits,

\mathrm{elt}

in particular preserves finite products.

John Baez (Feb 20 2025 at 17:15):

Nice - you didn't even use the hypothesis that

\mathsf{C}

is cartesian! Do you want to comment on that?

David Egolf (Feb 20 2025 at 17:23):

Reviewing the definition of a cartesian category, I see that a category is cartesian if it has finite products (and thus binary products and a terminal object). I assumed that

\mathsf{C}

has a terminal object

1

. And more generally I also assumed that there are finite products in

\mathsf{C}

to be preserved.

So I think I was effectively assuming that

\mathsf{C}

is cartesian, even though I didn't say that explicitly.

David Egolf (Feb 20 2025 at 17:27):

More generally,

\mathrm{elt}

will preserve any limits that exist in

\mathsf{C}

. (And we can define

\mathrm{elt}

as long as

\mathsf{C}

has a terminal object.)

John Baez (Feb 20 2025 at 17:32):

Yes. If I were writing a detailed answer in a textbook I might say something like this: representable functors preserve all limits that exist in

\mathsf{C}

, but when we say a specific functor preserves a certain class of limits, e.g. finite products, we typically imply that the domain of this functor has those limits. A cartesian category is one where all finite products exist. We need the nullary product

1

to exist for

\mathrm{elt}

to even be well-defined, but then if

\mathsf{C}

has finite products

\mathsf{elt}

will preserve them.

John Baez (Feb 20 2025 at 17:34):

I was expecting the student to just prove by hand that

\mathrm{elt}

preserves finite products, but you did a lot better: you instantaneously showed that it preserves all limits (or at least all that exist). You also showed that we're not using anything special about

1

in the definition

\mathrm{elt} = \mathsf{C}(1, -)

David Egolf (Feb 20 2025 at 17:53):

I do like it when I can instantaneously prove things, but sometimes I worry about missing some intuition by using such a powerful result.

However, the preservation of products by

\mathrm{elt}

intuitively makes sense in the context of the universal property of products. A morphism to a product corresponds to a morphism to each of the objects we are taking the product of. So, an element of a product corresponds to choosing an element from each of the objects we are taking the product of.

John Baez (Feb 20 2025 at 17:57):

Yes, I don't think I even thought about "representables preserve limits" when writing that exercise! :sweat_smile: I thought about it in the hands-on, down-to-earth way you are now.

John Baez (Feb 20 2025 at 18:00):

I think you'll see further exercises about how

\mathrm{elt}

works for cartesian closed categories... or at least, there are a lot of fun facts about this, which are good ways to start developing a feel for cartesian closed categories and how the "internal hom"

x^y

, which is an object in the category, relates to the usual "external hom"

\text{hom}(y,x)

, which is a set.

John Baez (Feb 20 2025 at 18:03):

Oh, I guess in Part 6 I described some of those facts in my exposition rather than making them exercises. I only covered the most basic of facts.

David Egolf (Feb 24 2025 at 18:02):

(I've been pretty focused working on other things. I still want to keep working on this, but perhaps a bit more slowly. I'm hoping to start on the next puzzle around the 28th or the 1st)

Peva Blanchard (Feb 26 2025 at 14:10):

@David Egolf It's quite fortunate, because I am busy with other things too, but I still want to follow the thread.

John Baez (Feb 26 2025 at 17:12):

David Egolf (Feb 28 2025 at 18:37):

David Egolf (Feb 28 2025 at 18:40):

A product of two graphs

F \times G

has a morphism to

F

and a morphism to

G

. So, if we're at a vertex in

F \times G

, we can think of that as being at a vertex in

F

and a vertex in

G

at the same time. This might be helpful intuition if we view

F

and

G

as describing ways in which two system properties can evolve over time: the product graph I'm guessing tells us how our system properties can evolve over time if we keep track of both properties at once.

David Egolf (Feb 28 2025 at 18:41):

Motivated by this intuition, I'm going to let

F

be a "cycle" graph

:a \to b \to c \to a

. (I'm thinking of this as modelling a property that can rotate through different values but only in a specific order). And I'll let

G

just be a single arrow

:x \to y

, modelling an irreversible change to a property.

David Egolf (Feb 28 2025 at 18:47):

David Egolf (Feb 28 2025 at 18:54):

We have

(F \times G)(v) = F(v) \times G(v)

and similarly

(F \times G)(e)= F(e) \times G(e)

. So

F \times G

will have 6 vertices and 3 edges.

David Egolf (Feb 28 2025 at 18:55):

It remains to figure out how these vertices and edge are connected, which amounts to figuring out the source and target functions for

F \times G

David Egolf (Feb 28 2025 at 19:10):

There is a general procedure (which was discussed above) for figuring out the limit or colimit of a diagram in a category of functors. In this case,

(F \times G)(s)

is the unique morphism such that this diagram of natural transformations commutes:
diagram

David Egolf (Feb 28 2025 at 19:13):

That is,

(F \times G)(s)

is the function

:(F \times G)(e) \to (F \times G)(v)

induced by

F(s) \circ \pi_{F(s)}:(F \times G)(e) \to F(e) \to F(v)

and

G(s) \circ \pi_{G(s)}:(F \times G)(e) \to G(e) \to G(v)

David Egolf (Feb 28 2025 at 19:17):

(F \times G)(e)

is just the pairs of edges

(u,v)

where

u

is an edge in

F

and

v

is an edge in

G

. So our function takes in

(u,v)

and outputs in the first component the source of

u

. Similarly, it will output in the second component the source of

v

In short,

(F \times G)(s)

sends each pair of edges to the pair of their corresponding sources. We can expect

(F \times G)(t)

to work in an analogous way.

David Egolf (Feb 28 2025 at 19:17):

David Egolf (Feb 28 2025 at 19:36):

This is not what I was expecting! I was expecting the "cycle" shape to have been preserved in some way. Interestingly, we don't include in this product any edge corresponding to staying at the same place in one graph while moving in the other - that also comes as a bit of a surprise, compared to my initial intuitive guess.

David Egolf (Feb 28 2025 at 19:37):

Instead, this product graph seems to display ways to move "one step" in both graphs at once.

David Egolf (Feb 28 2025 at 19:48):

I think if I wanted to retain some of the cycle structure in the product, I could do this by making it possible to "stay at

x

" even while moving along edges. So I could add a loop

1_x:x \to x

to try and do this.

If I didn't make a mistake, this is the new product graph, after adding in

1_x:x \to x

:
new product graph

Notice how we now have a cycle which involves moving through

a

b

c

as we stay at

x

John Baez (Feb 28 2025 at 21:02):

Excellent! You're noticing that the product of graphs is not geometrically intuitive unless we work with 'reflexive' graphs: graphs where each vertex has a distinguished edge from it to itself, commonly called an 'identity edge'. You gave

x

an identity edge and the product looked more like the product of a triangle and an interval. If you gave

y

an identity edge the product would look even nicer: it would become a wire-frame model of a triangular prism!

David Egolf (Mar 04 2025 at 01:57):

A friend of mine once asked me why we insist on having identity morphisms in categories. Now I know that, at least for graphs, products become more intuitive when we have identity edges!

David Egolf (Mar 04 2025 at 01:59):

Leading up to the next puzzle, I'd next like to prove this: if

F

is the walking vertex, then

F \times G

is a graph with vertices corresponding to those of

G

but having no edges.

David Egolf (Mar 04 2025 at 02:01):

The number of edges in the product graph is the product of the number of edges in the two graphs we are taking the product of. Since

F

has no edges,

F \times G

also has no edges.

And similarly the number of vertices in the product graph is the product of the number of vertices in the two graphs we are taking the product of. Since

F

has only one vertex, the vertices of

F \times G

will be in bijection with the vertices of

G

David Egolf (Mar 04 2025 at 02:04):

As described in the blog post, combining the above with the fact that we are working in a cartesian closed category lets us conclude this: a vertex of

H^G

is a function sending vertices of

G

to vertices of

H

John Baez (Mar 04 2025 at 03:04):

John Baez (Mar 04 2025 at 03:09):

A small stylistic point: while your arguments are correct, they become simpler and the second gives a slightly stronger result if you avoid bringing numbers into the game (which after all requires the theory of products of infinite cardinals, etc.). You could have said this:

The set of edges in the product graph is the product of the set of edges in the two graphs we are taking the product of. Since

F

has an empty set of edges, so does

F \times G

Similarly the set of vertices in the product graph is the product of the set of vertices in the two graphs we are taking the product of. Since

F

has only one vertex, the set of vertices of

F \times G

is in natural bijection with the set of vertices of

G

(In the second one I'm using a general fact: in any cartesian category we have a natural isomorphism

1 \times X \stackrel{\sim}{\longrightarrow} X

John Baez (Mar 04 2025 at 03:10):

David Egolf (Mar 04 2025 at 04:29):

Those are interesting answers! I don't remember what my answer was... I don't think I had a very confident one. I might have mentioned that every group is required to have an identity element.

I think I've also seen the idea somewhere to view objects (and their identity morphisms?) as sort of "degenerate" morphisms that just haven't been "stretched out". So I might have said something vague in that direction as well.

David Egolf (Mar 04 2025 at 04:32):

David Egolf (Mar 04 2025 at 04:37):

John Baez (Mar 04 2025 at 06:53):

David Egolf (Mar 05 2025 at 18:03):

David Egolf (Mar 05 2025 at 18:05):

Each edge of

H^G

corresponds to a graph morphism

:F \to H^G

. Since we are working in a cartesian closed category, which I'll call

\mathsf{Graph}

, we have

\mathsf{Graph}(F \times G, H) \cong \mathsf{Graph}(F, H^G)

David Egolf (Mar 05 2025 at 18:06):

So, to understand edges of

H^G

, it will help to understand the graph morphisms

:F \times G \to H

, where

F

is the walking edge.

David Egolf (Mar 05 2025 at 18:09):

A graph morphism preserves the source and target of edges. So it will be helpful to understand what the edges of

F \times G

are. An edge of

F \times G

amounts to a graph morphism

:F \to F \times G

, which amounts to a graph morphism

:F \to F

and a graph morphism

:F \to G

. Since

F

only has on edge, there is a unique graph morphism

:F \to F

. Thus, the edges of

F \times G

correspond to graph morphisms

:F \to G

, and hence to edges of

G

David Egolf (Mar 05 2025 at 18:11):

... I suppose it would have been simpler to just note that

(F \times G)(e) \cong F(e) \times G(e) \cong G(e)

when

F(e)

is a singleton. At any rate, the edges of

F \times G

are in bijection with those of

G

David Egolf (Mar 05 2025 at 18:14):

We also have

(F \times G)(v) \cong F(v) \times G(v)

. Since

F(v)

has two elements, we have two vertices in

F \times G

for each vertex in

G

. So, calling the vertex of

F

by the names

1

and

2

(for the source and target of the single edge), all our vertices of

F \times G

are of the form

(1,v_G)

(2,v_G)

where

v_G

is some vertex in

G

David Egolf (Mar 05 2025 at 18:16):

We'll put in a single edge in

F \times G

from

(1,v_G)

(2,v_G')

iff there is an edge going from

v_G

v_G'

G

David Egolf (Mar 05 2025 at 18:20):

Now that we know what

F \times G

looks like, we can say what a graph morphism

:F \times G \to H

amounts to.

David Egolf (Mar 05 2025 at 21:02):

First of all, this involves a function from the edges of

G

(which correspond to the edges of

F \times G

) to the edges of

H

David Egolf (Mar 05 2025 at 21:05):

We'll also need a function from the vertices of

F \times G

to the vertices of

H

. We'll map each vertex of the form

(1,v_G)

to some vertex in

H

, and each vertex of the form

(2, v_G)

to some vertex in

H

. So, we can think of this function as corresponding to two functions from the vertices of

G

to the vertices of

H

David Egolf (Mar 05 2025 at 21:06):

All of this needs to define a graph homomorphism

:F \times G \to H

. So this implies we need some compatibility conditions between our function on edges and our two functions on vertices.

David Egolf (Mar 05 2025 at 21:16):

Let's consider some arbitrary edge

(!,x):(1,v_G) \to (2, v_G')

F \times G

. Here

!

refers to the unique edge from

1

2

in our "walking edge" graph

F

, and

x:v_G \to v_G'

is some edge in

G

from

v_G

v_G'

The source of this edge is

(1,v_G)

, and its target is

(2, v_G')

. Let

f_1:\{1\} \times G(v) \to H(v)

and

f_2:\{2\} \times G(v) \to H(v)

be our two functions on vertices, and let

f_e:G(e) \to H(e)

be our function on edges.

David Egolf (Mar 05 2025 at 21:16):

Then

f_1(s(!,x)) = f_1(1, v_G)

needs to be equal to

s(f_e(x))

, so that the source of this edge is preserved. Similarly,

f_2(t(!,x)) = f_2(2, v_G')

needs to be equal to

t(f_e(x))

, to preserve the target of this edge.

David Egolf (Mar 05 2025 at 21:19):

Perhaps more cleanly, if we let

\lambda

denote an arbitrary edge in

F \times G

I think these are the compatibility conditions we require between

f_1

f_2

, and

f_e

where

f_e(\lambda)

is shorthand for applying

f_e

to the underlying edge of

G

corresponding to

\lambda

. (Here each

s

refers to a function that returns the source of an edge, and each

t

similarly refers to a function that returns the target of an edge.)

David Egolf (Mar 05 2025 at 21:24):

In summary, an edge of

H^G

amounts to a graph morphism

:F \times G \to H

(where

F

is the walking edge graph), which amounts to:

John Baez (Mar 05 2025 at 21:40):

I'm not sure I agree with your answer, although it sounds very close to what I'm getting. Here's what I'm getting:

An edge of

H^G

amounts to a pair of graph morphisms

f_1 : G \to H

f_2 : G \to H

, together with for each vertex

v

G

a chosen edge of

H

going from

f_1(v)

f_2(v)

John Baez (Mar 05 2025 at 21:42):

I could be mixed up, or have misread you. But I believe

f_1

and

f_2

need to be graph morphisms, not just maps sending vertices of

G

to vertices of

H

. That's because given a graph morphism

F \times G \to H

, each vertex of the walking edge

F

will determine a graph morphism

G \to H

John Baez (Mar 05 2025 at 21:44):

(That's supposed to be a general thing: given a graph morphism

A \times B \to C

, each vertex of

A

will determine a graph morphism

B \to C

David Egolf (Mar 05 2025 at 21:50):

I'm trying to understand this. My first thought was that, given some graph morphism

\phi:A \times B \to C

, we could try to form

B \to_{\Delta_{v_A},1_B} A \times B \to_\phi C

. Here

\Delta_{v_A}:B \to A

is suppose to be a graph morphism that "collapses" all of

B

's vertices to a fixed vertex

v_A

A

However, a problem with this idea is that we can't always collapse all of

B

to some vertex

v_A

A

- in particular we can have a problem if

v_A

has no loop edges (that go from

v_A

v_A

David Egolf (Mar 05 2025 at 21:51):

So I don't yet see how to get a graph morphism

:B \to C

given any vertex of

A

John Baez (Mar 05 2025 at 21:53):

I bet I was wrong! Maybe my intuition only works for reflexive graphs. A vertex of

A

determines a morphism

V \to A

where

A

is the walking vertex. Given a morphism

A \times B \to C

we thus get a morphism

V \times B \to C

. Now if

V

is terminal, this is the same as a morphism

B \to C

. But while the walking vertex is terminal in the category of reflexive graphs, it's not in the category of graphs!

John Baez (Mar 05 2025 at 21:55):

So when I said "given a graph morphism

A \times B \to C

and a vertex of

A

we get a graph morphism

B \to C

", it turns out I should have said "we get a graph morphism

V \times B \to C

, which is quite similar but interestingly different!

John Baez (Mar 05 2025 at 21:57):

It's very interesting to think about

V \times A

and how this compares to

A

John Baez (Mar 05 2025 at 21:58):

In the category of graphs the walking vertex

V

is what's called a 'subterminal object': a subobject of the terminal object. Taking the product with a terminal object does nothing, but taking the product with a subterminal object does 'less than nothing': more precisely it has a destructive effect.

David Egolf (Mar 05 2025 at 22:06):

(That's interesting! It will take me some time and energy to absorb what you just said. Once I do that, I'll write a longer response.)

John Baez (Mar 05 2025 at 23:06):

Okay. Some of what I said may not be quite correct since I was trying to quickly capture some brand-new thoughts, so be careful.

David Egolf (Mar 06 2025 at 22:41):

Each vertex of

A

corresponds to a morphism

:V \to A

. Using this together with the identity morphism on

B

, we get a morphism

V \times B \to A \times B

. And if

V \times B \cong B

we could get a morphism

B \to A \times B

However,

V \times B

is usually different from

B

. Indeed

V

has zero edges and one vertex, so

V \times B

has zero edges and its vertices are in bijection with those of

B

. That is,

V \times B

is like a version of

B

where we've deleted all the edges.

David Egolf (Mar 06 2025 at 22:44):

If we have a subterminal object

m:U \to 1

, then we can consider

U \times A

. We get an induced morphism

m \times 1_A:U \times A \to 1 \times A \cong A

. I think that

(m,1_A):(U,A) \to (1,A)

is a monomorphism (in a product category), because both

m

and

1_A

are monomorphisms. Then, since right adjoints preserve monomorphisms (and taking the product is a right adjoint functor), I think our induced morphism

:U \times A \to A

will always be a monomorphism.

David Egolf (Mar 06 2025 at 22:49):

So we can expect that taking a product against a subterminal object will produce a "part" of what we started with.

John Baez (Mar 06 2025 at 22:58):

All this sounds correct! This is really cute. I love how taking the product of a graph with the 'walking vertex' graph

\bullet

just kills off all the edges of that graph.

This should generalize in some fairly simple and vivid way whenever we have a presheaf category (like the category of graphs, but lots of others). Someone in this room must know a nice characterization of all subterminal objects in a presheaf category. Taking the product with any of these gives a nice way to "simplify" a presheaf by discarding the features the subterminal object doesn't have.

Heck, I can guess how it works. The terminal object in the category of presheaves on

C

has "one feature of each kind" - it maps each object

c \in C^{\text{op}}

to the 1-element set. So I believe a subterminal object has "at most one feature of each kind" - it maps each object

c \in C^{\text{op}}

to either the 1-element set or the empty set. And we should be able to see what choices are allowed by turning

C

into a preorder. We should get one subterminal object in the category of presheaves on

C

for each [[down set]] of its corresponding preorder. (Or maybe [[up set]], since the

\text{op}

confuses me, and there's a convention involved in turning a category into a preorder. If it works out to be up sets, I'll be upset.)

David Egolf (Mar 07 2025 at 16:59):

I like the idea that each object of

C

corresponds to one "kind of feature"! In our case, we have two objects: the vertex object and the edge object.

I agree that if a presheaf is subterminal, then it can have at most one of each kind of feature (and a terminal object has exactly one of each kind of feature). That's because we need an injective function from its set of features of any given kind to a singleton set.

A graph is a presheaf

:\mathsf{G}^{\mathrm{op}} \to \mathsf{Set}

. Here,

\mathsf{G}

has morphisms from the vertex object

v

to the edge object

e

. So if we created a preorder from

\mathsf{G}

by letting

x \leq y

iff there is a morphism from

x

y

(for any objects

x

and

y

\mathsf{G}

), we'd have that

v \leq e

If we have just a vertex, that's fine. But if we have an edge in a graph - we need a vertex too! Since

v \leq e

, it seems like our subterminal objects potentially correspond to down sets in the preorder we made from

\mathsf{G}

David Egolf (Mar 07 2025 at 17:03):

More generally, there is no function from a non-empty set to an empty set. So if there is a morphism

f:b \to a

C

(which corresponds to a morphism from

a

b

C^{\mathrm{op}}

), a presheaf

F

C

can never have

F(a)

non-empty if

F(b)

is empty. So, if

F(b)

is empty, then

F(a)

must be is empty too.

David Egolf (Mar 07 2025 at 17:41):

Taking this a bit further, if we have no morphism from

b

a

C

, then we have no morphism from

y(b)

y(a)

[C^{\mathrm{op}}, \mathsf{Set}]

, because the yoneda embedding

y:C \to [C^{\mathrm{op}}, \mathsf{Set}]

is full and faithful. Thus, in this case there are no "walking

b

's" in

y(a) \times A

, where

A

is some presheaf on

C

If we did have some "walking

b

" in

y(a) \times A

, this would correspond to a morphism

:y(b) \to y(a) \times A

. And this corresponds to a morphism

:y(b) \to y(a)

together with a morphism

:y(b) \to A

. But since there is no morphism from

b

a

, there is no morphism from

y(b)

y(a)

, and thus

y(a) \times A

indeed admits no "walking

b

" generalized elements (there is no morphism from

y(b)

to it).

David Egolf (Mar 07 2025 at 17:44):

In our case, there is no morphism from the edge object to the vertex object in

\mathsf{G}

, and thus there are no edges in

V \times A

(where

A

is any graph and

V

is the walking vertex).

John Baez (Mar 07 2025 at 17:54):

When you put it this way, it's clear that just as objects of a category are 'features' that a presheaf can have, the order

\leq

could be called the order of 'feature dependency'. You can imagine buying a fancy kitchen gadget, where there are lots of models with various features... but only if it has one feature

x

can it have some other feature

y

that relies on

x

David Egolf (Mar 12 2025 at 17:33):

David Egolf (Mar 12 2025 at 17:41):

It's helpful to recall that a vertex of

H^G

is a function from the vertices of

G

to the vertices of

H

. In the previous puzzle, we saw that every edge of

H^G

comes with two functions from the vertices of

G

to the vertices of

H

. So, we might guess that these two functions are in fact the source and target of the edge in question!

(I don't know if that's correct, but it helps to have a guess to get started. I'll pause here for now.)

David Egolf (Mar 13 2025 at 17:29):

Using the fact that

\mathsf{Graph}

is cartesian closed, the data of a specific edge of

H^G

corresponds to a morphism

:E \times H \to G

\mathsf{Graph}

, where

E

is the walking edge. Similarly, the data of a specific vertex of this edge corresponds to a morphism

:V \times H \to G

, where

V

is the walking vertex. I suspect that these two morphisms are related if the vertex in question is the source or target of the edge in question.

David Egolf (Mar 13 2025 at 17:43):

Because

\mathsf{Graph}

is cartesian closed, we have a bijection

\mathsf{Graph}(Y \times H, G) \cong \mathsf{Graph}(Y,G^H)

for any graphs

G,H,Y

. This is natural in each variable, so in particular we have a natural isomorphism

\mathsf{Graph}(- \times H, G) \cong \mathsf{Graph}(-,G^H)

Let's consider the functor

\mathsf{Graph}(- \times H, G):\mathsf{Graph}^{\mathrm{op}} \to \mathsf{Set}

. Note that if we supply to this functor a morphism

:V \to E

\mathsf{Graph}

(which corresponds to a morphism

:E \to V

\mathsf{Graph}^{\mathrm{op}}

), we'll get a function that sends edges of

G^H

to vertices of

G^H

. So, it seems likely helpful to understand how this functor acts on morphisms.

David Egolf (Mar 13 2025 at 17:51):

David Egolf (Mar 18 2025 at 02:23):

Zooming out for a moment: Imagine we have an adjunction, so we have a bijection

C(F(x), y) \cong D(x,G(y))

natural in

x

and

y

. Then we know that

C(F(-),y)

is naturally isomorphic to

D(-,G(y))

. Given

C(F(-),y)

, we may wish to try and determine a representing object for this functor, which will be isomorphic to

G(y)

In general, it seems like it could be quite useful/interesting to be able to figure out a representing object for a given representable functor!

David Egolf (Mar 18 2025 at 02:40):

To start to get an idea how this might work, let's think about some natural isomorphism

\alpha:D(-, G(y)) \to C(F(-),y)

. So this is a natural isomorphism

\alpha:D^{\mathrm{op}}(G(y),-) \to C(F(-),y)

I believe that the Yoneda lemma tells us that such a natural isomorphism is totally determined by where

\alpha_{G(y)}:D(G(y),G(y)) \to C(F(G(y)), y)

maps the identity morphism of

1_{G(y)}

. So, if we want to understand what

G(y)

needs to be like so that we get a natural isomorphism

\alpha

, it may make sense to focus on a naturality square for

\alpha

involving

\alpha_{G(y)}

David Egolf (Mar 18 2025 at 02:53):

Hmm. Maybe this is helpful. Let

F:C \to D

be a representable functor, so there is some

x \in C

such that

C(x,-) \cong F

. Then we need

C(x,c) \cong F(c)

for all

c \in C

. This seems like it might help us narrow down what

x

is like!

David Egolf (Mar 18 2025 at 02:58):

Given a natural isomorphism

\alpha:C(x,-) \to F

, we also need any arbitrary naturality square to commute, such as this one for a morphism

f:c \to c'

C

:
image.png

David Egolf (Mar 18 2025 at 03:00):

I don't yet see how to find

x

(up to isomorphism) using this information. But this is telling us a lot about how our object

x

relates to other objects, which we can probably use to figure out specific things about

x

- as long as we can phrase those things in terms of the compositional structure of morphisms from

x

David Egolf (Mar 18 2025 at 03:02):

Returning to our specific case of interest, there is some natural isomorphism

\alpha:\mathsf{Graph}(- \times H, G) \cong \mathsf{Graph}(-,G^H)

. Maybe it will be helpful to write out a naturality square for this.

David Egolf (Mar 18 2025 at 03:05):

These are both functors

:\mathsf{Graph}^{\mathrm{op}} \to \mathsf{Set}

. So we'll have one component of our natural isomorphism for each graph. Let's consider the naturality square for the morphism

s:V \to E

\mathsf{Graph}

, which maps the walking vertex to the source of the walking edge.

David Egolf (Mar 18 2025 at 03:08):

Here's our naturality square, although I haven't labelled the morphism on the upper edge yet:
naturality square

David Egolf (Mar 18 2025 at 03:11):

Okay! This feels like we are getting somewhere. I'm guessing that the morphism on the bottom edge of our square corresponds to the source function for

G^H

, which sends a given edge to its source vertex.

If that is true, then the morphism on the top edge of our square will give us another way to think about the source function, since it's the corresponding function between some sets isomorphic to those of interest. This will I think give us a more "concrete" description of our source function, because we have worked out some more concrete descriptions above for

\mathsf{Graph}(E \times H,G)

and

\mathsf{Graph}(V \times H,G)

above.

David Egolf (Mar 18 2025 at 03:16):

David Egolf (Mar 18 2025 at 03:24):

If that's true, this feels pretty clarifying to me: we can get generalized elements ("parts", like edges or vertices) of a given object by considering morphisms to our object of interest from various objects (of different "shapes"); but we can also (maybe) understand how these generalized elements relate to one another (e.g. like how a specific edge has a specific source vertex) by studying how these generalized element morphisms relate via composition!

John Baez (Mar 18 2025 at 16:31):

Those bullet points sound true, and I bet they generalize to presheaves on an arbitrary category

\mathsf{C}

. For any object

x \in \mathsf{C}

there should be a presheaf on

\mathsf{C}

called the 'walking

x

', which is none other than the representable

\mathrm{hom}(-,x)

. And for any morphism

f: x \to y

there should be a morphism of presheaves from the walking

x

to the walking

y

, which is none other than postcomposition with

f

, which gives a function

Hmm, this sounds awfully like the Yoneda embedding of

\mathsf{C}

\mathsf{Set}^{\mathsf{C}^{\text{op}}}

David Egolf (Mar 18 2025 at 17:25):

The first two bullet points do generalize. We consider a presheaf

F

on some category

\mathsf{C}

, which is a functor

F:\mathsf{C}^{\mathrm{op}} \to \mathsf{Set}

. By the Yoneda lemma,

F(c) \cong \mathsf{PSh}_C (C(-,c),F)

, where

\mathsf{PSh}_C = \mathsf{Set}^{\mathsf{C}^{\mathrm{op}}}

is the category of presheaves on

C

. This generalizes how we can get the vertices or edges of a graph by considering morphisms to it from the walking vertex or edge.

David Egolf (Mar 18 2025 at 17:25):

The last bullet point I think is also just the Yoneda lemma in disguise. The Yoneda lemma tells us that we have a natural isomorphism

\alpha:F \cong \mathsf{PSh}_C (y(-),F)

where

\mathsf{PSh}_C

is the category of presheaves on

\mathsf{C}

and

y

is the Yoneda embedding (so

y(c) = C(-,c)

David Egolf (Mar 18 2025 at 17:28):

Given some

f:c \to c'

C

, we get this naturality square:
naturality square

where

y(f):y(c) \to y(c')

is the Yoneda embedding of

f:c \to c'

, and where each component of

\alpha

is an isomorphism (a bijection).

David Egolf (Mar 18 2025 at 17:30):

Assuming I've remembered correctly how

\mathsf{PSh}_C(y(-),F)

acts on morphisms, this immediately tells us that

F(f)

can be understood (via a corresponding function between isomorphic sets) by precomposing with

y(f)

David Egolf (Mar 18 2025 at 17:33):

Now let

F

be a graph of interest and

s:v \to e

be the source morphism so that

F(s):F(e) \to F(v)

is our source function for

F

. We can conclude that indeed:

David Egolf (Mar 18 2025 at 17:40):

With this in mind, we recall from above that we have a natural isomorphism

\beta:\mathsf{Graph}(- \times H, G) \cong \mathsf{Graph}(-,G^H)

. Let's consider the naturality square for this for the morphism

y(s):y(v) \to y(e)

David Egolf (Mar 18 2025 at 17:42):

We recognize the bottom edge as the kind of function we were just considering, involving precomposition with the yoneda embedding of

s:v \to e

. By stacking this naturality square with one of the form we considered above, we conclude that whatever

\mathsf{Graph}(- \times H, G)

does to

y(s):y(v) \to y(e)

corresponds to our source function for

G^H

John Baez (Mar 18 2025 at 18:01):

Sounds good! In my comment I mentioned "postcomposing". You're saying "precomposition" a lot. If we're really disagreeing you are probably right, because the "op"s in this business tend to trip me up. But maybe we're not even disagreeing.

David Egolf (Mar 18 2025 at 18:09):

I think you are saying above that given a morphism

f:x \to y

, its Yoneda embedding

y(f):C(-,x) \to C(-,y)

involves postcomposition with

f

. That makes sense to me.

David Egolf (Mar 18 2025 at 18:27):

I'm realizing this: the yoneda lemma lets us think more visually/concretely about any presheaf:

John Baez (Mar 18 2025 at 18:28):

David Egolf (Mar 20 2025 at 16:53):

I think it now just remains to figure out what the functor

\mathsf{Graph}(- \times H, G):\mathsf{Graph}^{\mathrm{op}} \to \mathsf{Set}

does on the yoneda embedding of the source morphism,

y(s)

. This will tell us how the source function for our graph

G^H

works.

David Egolf (Mar 20 2025 at 17:01):

Given a graph

A

, we have

F(A) = \mathsf{Graph}(A \times H, G)

. So, given a morphism of graphs

f:A \to B

the functor

F

should give us some function

F(f):\mathsf{Graph}(B \times H, G) \to \mathsf{Graph}(A \times H, G)

If we have a graph morphism

:B \times H \to G

, we can precompose with the morphism

f \times 1_H:A \times H \to B \times H

to get a morphism

:A \times H \to G

So I'm guessing that

F

acts on morphisms like this:

F(f):\mathsf{Graph}(B \times H, G) \to \mathsf{Graph}(A \times H, G)

is the function

k \mapsto k \circ (f \times 1_H)

David Egolf (Mar 20 2025 at 17:09):

If that is true, then what is

F(y(s))

? Since

y(s):y(v) \to y(e)

, we should get a function

F(y(s)):\mathsf{Graph}(y(e) \times H, G) \to \mathsf{Graph}(y(v) \times H, G)

David Egolf (Mar 20 2025 at 17:11):

I think this is the function that sends edges to their source for the graph

G^H

David Egolf (Mar 20 2025 at 17:15):

I'm unsure if this is a good point to declare this puzzle complete and move on - or whether it would be better to try and more explicitly understand the source function (and the target function) for

G^H

in an example.

John Baez (Mar 20 2025 at 17:15):

If you can write down a 'formula' for the function that sends edges to their source for

G^H

, you can try to compare it to the formula

and keep fiddling around with both formulas to make them look more similar, until finally you show they're equal.

This is a common approach to proving equations when you're not quite sure what to do: "chew away at both ends of the candy bar", making the difference between the two things shrink until you see they're equal.

John Baez (Mar 20 2025 at 17:17):

At the very least, writing down the equation that you're trying to prove is psychologically useful. Right now one side of your hoped-for equation is given by a formula and the other by a bunch of words, so it has a somewhat mysterious quality.

David Egolf (Mar 20 2025 at 17:20):

I'm not sure what you mean by a "formula" for a function in this context. I think I've already worked out what the function is that sends edges to sources for

G^H

. Namely it is

k \mapsto k \circ (y(s) \times 1_H)

. But it sounds like you have something different in mind.

John Baez (Mar 20 2025 at 17:22):

John Baez (Mar 20 2025 at 17:23):

When you said "I think this is the function that sends edges to their source for the graph

G^H

", I thought you meant you had not proved this yet. Maybe you use "think" in a different way than I do.

John Baez (Mar 20 2025 at 17:24):

David Egolf (Mar 20 2025 at 17:25):

I'm hesitating a bit for two reasons:
(1) I'm struggling to picture how the function I got actually works
(2) I was guessing from memory how the functor

\mathsf{Graph}(- \times H, G)

works. I'm fairly certain I remembered correctly, but not 100% sure.

John Baez (Mar 20 2025 at 17:26):

David Egolf (Mar 20 2025 at 17:27):

Yes, that's right. What you wrote in your most recent post looks to me like just different notation for what I wrote above.

John Baez (Mar 20 2025 at 17:27):

Okay, well, I haven't looked at the post in god knows how long, but I knew what I was doing back then so that's probably good.

David Egolf (Mar 20 2025 at 17:29):

Ah, sorry for the confusion - by "your most recent post" I meant your most recent zulip post :sweat_smile:. The one that starts with "Okay, I thought...".

John Baez (Mar 20 2025 at 17:29):

Okay. The point of my equation is that each side is a well-defined thing in its own right, not defined to equal the other, so you're trying to prove they're equal. And maybe you did... in a way that exuded lack of confidence.

act on morphisms when

\mathsf{C}

is a category with products and

X,Y

are two objects in

\mathsf{C}

John Baez (Mar 20 2025 at 17:31):

John Baez (Mar 20 2025 at 17:32):

David Egolf (Mar 20 2025 at 17:33):

Technically speaking, then, I believe my function I described above is not equal to the source function for

G^H

. It is instead a corresponding function between isomorphic sets. So intuitively I want to say the function I described above is "isomorphic" to the source function for

G^H

John Baez (Mar 20 2025 at 17:34):

Oh, okay. Then that equation I wrote down doesn't parse (I didn't check); it would need to be padded out with some isomorphisms to parse, i.e. to have a chance of being true.

John Baez (Mar 20 2025 at 17:34):

When the padding gets complicated enough it can pay to switch to commutative diagrams.

David Egolf (Mar 20 2025 at 17:36):

John Baez (Mar 20 2025 at 17:38):

Okay, great! At least I think it's good to write down the equation you're claiming to have proved, either in traditional equation form or as a commutative diagram.

David Egolf (Mar 20 2025 at 17:48):

The Yoneda lemma specifies a natural isomorphism

\alpha:\mathsf{Graph}(y(-), G^H) \cong G^H

, where

y

is the Yoneda embedding. In particular, it specifies bijections

\alpha_e:\mathsf{Graph}(y(e), G^H) \to G^H(e)

and

\alpha_v:\mathsf{Graph}(y(v), G^H) \to G^H(v)

. (Here

y

is the yoneda embedding functor so

y(v)

is the walking vertex and

y(e)

is the walking edge).

Further, because our category

\mathsf{Graph}

is cartesian closed, we have a natural isomorphism

\beta:\mathsf{Graph}(- \times H, G) \cong \mathsf{Graph}(-, G^H)

. In particular, we have a bijection

\beta_e:\mathsf{Graph}(y(e) \times H, G) \cong \mathsf{Graph}(y(e), G^H)

and a bijection

\beta_v:\mathsf{Graph}(y(v) \times H, G)\cong \mathsf{Graph}(y(e), G^H)

David Egolf (Mar 20 2025 at 17:50):

We then get two naturality squares for

s:v \to e

, and I claim these share a common (unlabelled) morphism:
diagram

Do we really have a common morphism in these two naturality squares? That is, do we have

\mathsf{Graph}(y(-), G^H)(s) = \mathsf{Graph}(-, G^H)(y(s)^{\mathrm{op}})

? This should be true, because I assume that

\mathsf{Graph}(y(-), G^H)

\mathsf{Graph}(-, G^H) \circ y^{\mathrm{op}}

by definition.

David Egolf (Mar 20 2025 at 17:52):

Both of the individual naturality squares commute, so the outer rectangle commutes as well.

Letting

f

be the morphism on the top edge, we have

\alpha_v \circ \beta_v \circ f = G^H(s) \circ \alpha_e \circ \beta_e

. Thus

G^H(s) = \alpha_v \circ \beta_v \circ f \circ \beta_e^{-1} \circ \alpha_e^{-1}

David Egolf (Mar 20 2025 at 17:53):

It remains to describe

f:\mathsf{Graph}(y(e) \times H, G) \to \mathsf{Graph}(y(v) \times H, G)

. I claimed above that it acts by

k \mapsto k \circ (y(s) \times 1_H)

To check this, I'll review how

- \times H

and

\mathsf{Graph}(-, G)

act on morphisms.

David Egolf (Mar 20 2025 at 17:58):

I remember that

\mathsf{Graph}(-, G):\mathsf{Graph}^{\mathrm{op}} \to \mathsf{Set}

acts on a morphism

g^{\mathrm{op}}:B\to A

\mathsf{Graph}^{\mathrm{op}}

as follows. We get a function

:\mathsf{Graph}(B,G) \to \mathsf{Graph}(A,G)

. And this function acts by

k \mapsto k\circ g

, where

g:A \to B

is the morphism corresponding to

g^{\mathrm{op}}

\mathsf{Graph}

David Egolf (Mar 20 2025 at 18:00):

Next, I'd like to know how

- \times H:\mathsf{Graph} \to \mathsf{Graph}

acts on morphisms. I think it sends

f:A \to B

to the morphism

f \times 1_H:A\times H\to B \times H

. However, I'm not sure if this is correct.

David Egolf (Mar 20 2025 at 18:06):

I know there is a functor

P:\mathsf{Graph} \times \mathsf{Graph}\to \mathsf{Graph}

that sends each pair of graphs to some chosen product. And we can induce a morphism to

\mathsf{Graph} \times \mathsf{Graph}

using the functor

\Delta_H

constant at

H

and the identity functor of

\mathsf{Graph}

David Egolf (Mar 20 2025 at 18:07):

So we'll get a functor

\mathsf{Graph} \to _{(1_{\mathsf{Graph}},\Delta_H)} \mathsf{Graph} \times\mathsf{Graph} \to_P \mathsf{Graph}

. This acts on a graph

A

A \mapsto (A,H) \mapsto A \times H

. So it seems like this is a plausible functor to denote by

- \times H

David Egolf (Mar 20 2025 at 18:09):

Taking this as the definition of

- \times H

(and hopefully this matches the usual usage of

- \times H

), we can figure out what

- \times H

does on a morphism

f:A \to B

. It acts on it by

f \mapsto (f,1_H) \mapsto f \times 1_H

. So the functor

- \times H

acts on morphisms as I guessed above.

David Egolf (Mar 20 2025 at 18:13):

We can now try to build up our functor of interest as

\mathsf{Graph}(-,G) \circ (- \times H)^{\mathrm{op}}

David Egolf (Mar 20 2025 at 18:21):

We start with some

f^{\mathrm{op}}:B \to A

\mathsf{Graph}^{\mathrm{op}}

, which corresponds to the morphism

f:A \to B

\mathsf{Graph}

. If I remember how opposite functors work, the output should then be

(f \times 1_H)^{\mathrm{op}}

, which is the morphism in

\mathsf{Graph}^{\mathrm{op}}

corresponding to the morphism

f \times 1_H

\mathsf{Graph}

David Egolf (Mar 20 2025 at 18:22):

We then supply

(f \times 1_H)^{\mathrm{op}}

\mathsf{Graph}(-, G)

. We get a function that acts by precomposing with

f \times 1_H

In particular, when we start with

y(s)^{\mathrm{op}}

, we'll get out a function that acts by precomposing with

y(s) \times 1_H

David Egolf (Mar 20 2025 at 18:46):

David Egolf (Mar 20 2025 at 19:06):

Whew, that was a lot more work than I expected :sweat_smile:! But it was good to do - I think it really helped clarify the details of the argument for me.

David Egolf (Mar 21 2025 at 15:39):

I think we've now worked out the edges, vertices, and source function for the graph

G^H

. Since the target function should work similarly, we should be able to figure out what

G^H

is in some example.

As a next step (which I probably won't work on today!), I'd like to pick some small graphs

G

and

H

and try to draw the graph

G^H

John Baez (Mar 21 2025 at 16:17):

I'll read your longish series of posts more carefully when I have time - I'm very interested, but I'm going on a long car trip today and I don't want to crash.

Stream: learning: reading & references

Topic: reading through Baez's topos theory blog posts

David Egolf (Mar 26 2024 at 17:08):

David Egolf (Mar 26 2024 at 17:29):

Morgan Rogers (he/him) (Mar 26 2024 at 17:35):

David Egolf (Mar 26 2024 at 17:40):

David Egolf (Mar 26 2024 at 17:45):

David Egolf (Mar 26 2024 at 17:50):

David Egolf (Mar 26 2024 at 17:53):

Peva Blanchard (Mar 26 2024 at 18:01):

David Egolf (Mar 26 2024 at 18:12):

David Egolf (Mar 27 2024 at 17:33):

David Egolf (Mar 27 2024 at 17:39):

David Egolf (Mar 27 2024 at 17:43):

David Egolf (Mar 27 2024 at 17:44):

Peva Blanchard (Mar 27 2024 at 18:17):

David Egolf (Mar 27 2024 at 18:22):

David Egolf (Mar 27 2024 at 18:24):

David Egolf (Mar 27 2024 at 18:49):

David Egolf (Mar 27 2024 at 18:52):

Peva Blanchard (Mar 27 2024 at 19:09):

David Egolf (Mar 27 2024 at 19:18):

Julius Hamilton (Mar 28 2024 at 11:53):

Julius Hamilton (Mar 28 2024 at 11:55):

David Egolf (Mar 28 2024 at 16:44):

David Egolf (Mar 28 2024 at 16:45):

David Egolf (Mar 28 2024 at 16:56):

David Egolf (Mar 28 2024 at 17:03):

David Egolf (Mar 28 2024 at 17:12):

David Egolf (Mar 28 2024 at 17:15):

David Egolf (Mar 28 2024 at 17:21):

David Egolf (Mar 28 2024 at 17:23):

Reid Barton (Mar 28 2024 at 17:28):

Reid Barton (Mar 28 2024 at 17:29):

John Baez (Mar 28 2024 at 18:16):

Peva Blanchard (Mar 28 2024 at 18:16):

David Egolf (Mar 28 2024 at 19:06):

David Egolf (Mar 28 2024 at 19:08):

John Baez (Mar 28 2024 at 22:23):

Julius Hamilton (Mar 29 2024 at 13:03):

Julius Hamilton (Mar 29 2024 at 13:20):

Julius Hamilton (Mar 29 2024 at 13:21):

Julius Hamilton (Mar 29 2024 at 13:24):

Julius Hamilton (Mar 29 2024 at 13:26):

Julius Hamilton (Mar 29 2024 at 13:26):

Julius Hamilton (Mar 29 2024 at 13:26):

David Egolf (Mar 29 2024 at 17:08):

Julius Hamilton (Mar 29 2024 at 17:08):

David Egolf (Mar 29 2024 at 17:14):

David Egolf (Mar 29 2024 at 17:17):

David Egolf (Mar 29 2024 at 17:19):

Julius Hamilton (Mar 29 2024 at 17:28):

Julius Hamilton (Mar 29 2024 at 17:28):

Julius Hamilton (Mar 29 2024 at 17:39):

David Egolf (Mar 29 2024 at 17:53):

John Baez (Mar 29 2024 at 17:56):

John Baez (Mar 29 2024 at 18:05):

David Egolf (Mar 29 2024 at 18:53):

Julius Hamilton (Mar 29 2024 at 19:02):

Julius Hamilton (Mar 29 2024 at 19:06):

Eric M Downes (Mar 29 2024 at 19:29):

David Egolf (Mar 29 2024 at 19:58):

Julius Hamilton (Mar 29 2024 at 22:00):

Julius Hamilton (Mar 29 2024 at 22:19):

Julius Hamilton (Mar 30 2024 at 00:31):

Julius Hamilton (Mar 30 2024 at 00:40):

Julius Hamilton (Mar 30 2024 at 00:42):

David Egolf (Mar 30 2024 at 02:35):

Julius Hamilton (Mar 30 2024 at 02:35):

David Tanzer (Mar 30 2024 at 04:47):

David Tanzer (Mar 30 2024 at 06:16):

David Egolf (Mar 30 2024 at 16:14):

David Egolf (Mar 30 2024 at 16:22):

David Egolf (Mar 30 2024 at 21:53):

David Egolf (Mar 30 2024 at 22:05):

David Egolf (Mar 30 2024 at 22:07):

John Baez (Mar 30 2024 at 22:16):

John Baez (Mar 30 2024 at 22:24):

John Baez (Mar 30 2024 at 22:27):

John Baez (Mar 30 2024 at 22:36):