Chris Grossack's Blog

Finiteness in Sheaf Topoi

Mon, 19 Aug 2024 00:00:00 +0000

The notion of “finiteness” is constructively subtle in ways that can be tricky for people new to the subject to understand. For a while now I’ve wanted to figure out what’s going on with the different versions of “finite” in a way that felt concrete and obvious (I mentioned this in a few older blog posts here and here), and for me that means I want to understand them inside a sheaf topos $\mathsf{Sh}(X)$. I’ve thought about this a few times, but I wasn’t able to really see what was happening until a few days ago when I realized I had a serious misconception about picturing bundles and etale spaces! In this post, we’ll talk about that misconception, and spend some time discussing constructive finiteness in its most important forms.

As a short life update before we get started, I’m currently in Denmark with my advisor to hang out with Fabian Haiden. We’re going to be talking about all sorts of fun things related to my thesis work, mainly about Fukaya Categories. These can be really scary when you first start reading about them, but in the special case of surfaces they’re really not that bad! In the process of prepping for meeting Fabian, I’ve been writing a blog post that explicitly details what fukaya categories are, why you should care, and (for surfaces) how to compute them. I know I would have loved an article like this, and I’m excited to share one soon! I’m also finally going to finish my qual prep series from three years ago! Way back then I promised a post on fourier theory that I never got around to, but I recently found a draft of that post! So it might not be 100% true to what I was studying back then, but it’ll be as true to that as I can make it¹.

With all that out of the way, let’s get to it!

First, let’s just recall the notions of finiteness you’re likely to find in the literature². Keep in mind that, depending on the reference you’re reading, each of these is liable to be called just “finite” without disambiguation. Sometimes you even get more confusing conventions, (such as writing “subfinite” to mean what we call “kuratowski finite”!) so make sure you read carefully to know exactly what each particular author means! For this post, though, we’ll write:

If $n : \mathbb{N}$, the Finite Cardinal of size $n$ is the set $[n] = \{ x : \mathbb{N} \mid x \lt n \}$.

A set $X$ is called Bishop Finite if it’s (locally) isomorphic to a cardinal. That is, if $\exists n : \mathbb{N} . \exists f : X \cong [n]$

A set $X$ is called Kuratowski Finite if it’s (locally) the image of a cardinal. That is, if $\exists n : \mathbb{N} . \exists f : [n] \twoheadrightarrow X$.

A set $X$ is called Subfinite if it’s (locally) a subobject of a cardinal. That is, if $\exists n : \mathbb{N} . \exists f : X \hookrightarrow [n]$.

A set $X$ is called Dedekind Finite if every mono $X \hookrightarrow X$ is actually an iso. That is, if we can prove $\forall f : X \hookrightarrow X . \exists g : X \to X . fg = 1_X \land gf = 1_X$.

It’s pretty obvious from these definitions that dedekind finiteness is a bit different from the others. This comes with pros and cons, but in my experience it’s rarely the thing to consider. I’m including its definition more for cultural growth than anything, and I won’t say anything more about it in this post³.

Also, notice we’re putting an existential quantifier in front of everything. Externally, this means that we’re allowed to pass to an open cover of our base space and use a different $f$ (or indeed, a different $n$!) on each open. If you prefer type theory⁴, you can replace every instance of $\exists$ by the propositional truncation of a $\Sigma$-type.

It’s interesting to ask what the “untruncated” versions of these will be. I think that untruncated bishop finite types are exactly the cardinals, untruncated subfinite types are disjoint unions of finitely many propositions… But it’s not clear to me what the untruncated kuratowski finite types should be. Something like “finitely many copies of $B$, glued together along open sets”… But I don’t see a snappy characterization of these⁵.

Introducing Finiteness

Now, how should we go about visualizing these things? Every sheaf on $B$ is also an etale space over $B$ (that is, a local homeomorphism⁶ $E \to B$). And by thinking of our favorite sheaf as some space over $B$ we can draw pictures of it!

Let’s start simple and let $B = [-1,1]$ be an interval. Then a cardinal⁷ is a sheaf $[n] = \{x : \mathbb{N} \mid x \lt n \}$. In every sheaf topos $\mathsf{Sh}(B)$ the natural number object is (the sheaf of sections of) $B \times \mathbb{N}$ where $\mathbb{N}$ gets the discrete topology.

So if $n$ is a global section, it must be $B \times \{ n \}$ for some fixed $n$ in “the real world”. This also tells us that $[n] = \{ x : \mathbb{N} \mid x \lt n \}$ is given by

The situation is only slightly more complicated if $B$ has multiple components⁸ (say $B = [-1,1] \sqcup S^1$). In this case, $\mathbb{N}$ is still $B \times \mathbb{N}$, but now global sections can choose a different natural over each component:

Because of this, we end up with more sets $[n]$! Indeed, now a global section $n$ might be something like $(2,3)$ (shown in pink above) so that we get a cardinal $[(2,3)]$, shown below:

So cardinals are really really easy to work with! This is what we would expect, since they’re subsheaves of a particularly simple sheaf ($\mathbb{N}$). For instance, we can see the fact that cardinals always have decidable equality by noticing that any two sections over $U$ are either equal on all of $U$, or none of $U$! Contrast this with a doodle of what other sheaves might look like, where the pink and blue sections are different, but nonetheless intersect. Then the truth value of $(\text{pink} = \text{blue}) \lor (\text{pink} \neq \text{blue})$ as computed in $\mathsf{Sh}([-1,1]) \big / U$ will not be all of $U$⁹!

We also see why a cardinal is either empty or inhabited! As soon as you have a piece of a section over $U$, it must extend to a section on all of $U$.

Next up are the bishop finite sets¹⁰! These satisfy $\exists f : X \cong [n]$, so that we can find an open cover $\{U_\alpha \}$ of $B$ and functions $f_\alpha : X \! \upharpoonright_\alpha \cong [n] \! \upharpoonright_\alpha$.

Since $[n]$ is a constant family over $B$, this means we want $X$ to be locally constant. So bishop finite sets have to basically look like cardinals, but now we’re allowed to “twist” the fibres around. See how on each element of our cover we have something that looks like $[2]$, but globally we get something that is not isomorphic to $[2]$!

In fact, the bishop finite sets are exactly the covering spaces with finite fibres. These inherit lots of nice properties from the cardinals, since as far as the internal logic is concerned they’re isomorphic to cardinals!

For instance, here’s an internal proof that bishop finite sets have decidable equality:

$\ulcorner$ We know $\exists f : X \cong [n]$. Fix such an $f$, and let $x,y : X$. Since $f$ is an isomorphism we know $x=y$ if and only if $fx = fy$, but we can decide this using decidable equality on $[n]$ (which comes from decidable equality on $\mathbb{N}$, which we prove by induction). $\lrcorner$

We can also see this externally. As before, this is saying that two sections over $U$ either agree everywhere or nowhere (and it’s a cute exercise to see this for yourself). But we can even see this in a third way! It says that the subsheaf $\{(x,y) : X \times X \mid x=y \}$ is a clopen subset of $X \times X$! But we can compute $X \times X$ (the pullback of $X \to B$ along $X \to B$) and check this.

Note that the subsheaf (really sub-etale space) where the fibres are the same really is a clopen subset (read: a connected component) of the whole space $X \times X$ over $B$. If it’s not obvious that this really is the pullback, it makes a fun exercise! Actually, it’s a fun exercise to check this is true in general!

As a last remark, let’s also notice that bishop finite sets can be inhabited without being pointed. That is, a bishop finite set $X$ can satisfy $\exists x : X$ ($X$ is inhabited¹¹) without actually having $x:X$ for any $x$ ($X$ is not pointed)! This is because of the locality of the existential quantifier again! If we pass to an open cover of $B$, we can find a local section over each open. Unfortunately, these might not be compatible, so won’t glue to a global section (a point $x:X$)!

Ok, now let’s get to the example that led me to write this post! How should we visualize kuratowski finite sets? These are (locally) quotients of cardinals, so we start with a cardinal (which we know how to visualize) and then we want to start gluing stuff together.

Let’s start with a trivial double cover of $B = [-1,1]$ (that is, with two copies of $B$), and glue them together along their common open subset $[-1,0) \cup (0,1]$. This space is the line with two origins, and it’s famously nonhausdorff!

You can see that this space still deserves to be called “finite” over $B$ (for instance, all the fibres are finite), but it’s more complicated than the bishop case. For instance, it doesn’t have a well defined “cardinality”. In some places it has one element in the fibre, and in others it has two. However, it’s still possible to list or enumerate all the sections. In the above picture, there’s the gold section and the blue section. It just happens that sometimes this list has duplicates, since away from $0$ they’re the same section!

The failure of hausdorffness and the failure of decidable equality are closely related. For example, consider the two sections from the previous picture. Decidable equality says that they should be either everywhere equal or everywhere unequal¹². But of course they aren’t! They’re equal in some places (over $1$, say) but unequal in others (over $0$)!

Following Richard Borcherds, hausdorffness is also closely related to a kind of “analytic continuation”. Indeed, say your etale space $X$ is hausdorff and you pick a point $x_b$ over $b \in B$ (in this context we think of $x_b$ as the germ of a function at $b$). Then for any small enough open $U \ni b$, there is a unique extension of $x_b$ to a function on the whole neighborhood $U$! We are able to analytically continue $x_b$ from data at just a point to data defined in some neighborhood!

Here’s an easy exercise to see this for yourself and check your knowledge of the topology on the etale space of a sheaf:

Let $\mathcal{F}$ be a sheaf on $B$ whose etale space $X$ is hausdorff, and let $U$ be a connected open of $X$. Let $f,g \in \mathcal{F}(U)$ be local sections with $f(b_0) = g(b_0)$ for some $b_0 \in U$. Then show $f=g$ identically on $U$.

As a hint, consider the set $\{ b \in U \mid f(b) = g(b) \}$. Why is it open? Why is it closed?

In fact, if $B$ is locally connected and hausdorff, then the converse is also true, and our sheaf of sections $\mathcal{F}$ has this analytic continuation property if and only if its etale space is hausdorff¹³! This also makes a nice exercise (though it’s less easy), and I’ll include a solution below a fold.

solution

$\ulcorner$

First assume $\mathcal{F}$ has a hausdorff etale space $X$. Then for sections $f,g : U \to X$ we see that the set $\{b \in U \mid f(b) = g(b) \}$ is closed, as the preimage of the closed diagonal in $X$. This set is also open since if $f(b) = g(b)$ is the stalk at $b$, the definition of stalk says that $f=g$ on a neighborhood of $b$. Since $U$ is connected, we learn this subset is the whole of $U$ and $\mathcal{F}$ has analytic continuation.

Conversely, assume $\mathcal{F}$ has analytic continuation, and let $x,y \in X$ be two points. If $x$ and $y$ are in different fibres (say $x$ is over $b_x$ and $y$ is over $b_y$) then we can separate $b_x$ and $b_y$ using hausdorfness of $B$, and any representatives of $x$ and $y$ defined on these separating sets will separate $x$ and $y$ in $X$. So we're left with the case of $x$ and $y$ both living in the same fibre over $b$.

Say that every neighborhood of $x$ intersects every neighborhood of $y$. Using local connectedness of $B$, we can find a connected neighborhood $U$ of $b$, and representatives $(f,U)$ and $(g,U)$ of $x$ and $y$. These define open sets in the topology of $X$, and since neighborhoods of $x$ and $y$ always intersect, we know there's a $b' \in U$ with $f(b') = g(b')$. Then analytic continuation says $f=g$ on $U$ so that $x = f(b) = g(b) = y$, and $X$ is hausdorff, as desired.

$\lrcorner$

When I was trying to picture kuratowski finite objects inside $\mathsf{Sh}(B)$, I’d somehow convinced myself that a local homeomorphism over a hausdorff space has to itself be hausdorff. This is, obviously, not true! So I was trying to picture kuratowski finite sets and struggling, because I was only ever picturing hausdorff covering spaces. And we’ll see later that all hausdorff kuratowski finite sheaves (over spaces I was picturing) are actually bishop finite! So it’s no wonder I was confused! I didn’t realize the complexity (especially the nonhausdorff complexity) that etale spaces are allowed to have, even though in hindsight I’d been warned about this before¹⁴.

Going back to examples, we should ask if we recognize the sheaf of sections for our favorite “line with two origins” picture from earlier. Away from $0$, we know there’s exactly one section, but in any neighbordhood of $0$ there’s two sections. So we see this is the etale space for the skyscraper sheaf with two points over $0$!

Since our epi $[n] \twoheadrightarrow X$ only has to exist locally, kuratowski finite objects in $\mathsf{Sh}(B)$ can have the same twisting behavior as bishop finite sets too!

If you look up an example of a kuratowski finite set that isn’t bishop finite (for example, in Section 2.2.2 of Graham Manuell’s excellent note), you’re likely to find the example $\{ \top, p \}$ where $p$ is some truth value other than $\top$ or $\bot$. That is, where $p$ is some open set of the base space $B$.

Away from $p$, this set has two elements, $\top$ and $p$ (since away from $p$ we know $p = \bot$). But inside of $p$, this set has a single element (since in $p$ we know $p = \top$). We find its etale space is nonhausdorff as before, but now the nonhausdorffness is “spread out” over a larger set. It’s a good exercise to figure out what’s happening at the boundary of $p$. What are the basic opens of this space? Does the fibre at $p$ have one point, or two?

Kuratwoski finite sets can be shockingly complicated! Especially in contrast to the relatively tame bishop finite sets. For example, consider $B = \mathbb{R}$ with a skyscraper sheaf at every integer¹⁵:

This space is kuratowski finite in $\mathsf{Sh}(\mathbb{R})$ (since it’s locally a quotient of $\mathbb{R} \sqcup \mathbb{R}$), but it has continuum many global sections! Indeed, we can choose either the top or the bottom point at every integer.

Since even the choice of $[n]$ is local, we can change the size of the fibres as long as they’re all finite!

In fact, on mastodon (here and here), Antoine Chambert-Loir and Oscar Cunningham pointed out that you can do even weirder things. For instance, the set $\{\frac{1}{n} \mid n \in \mathbb{N} \} \cup \{ 0 \}$ is closed in $[0,1]$. So its complement is open, and we can glue two copies of $[0,1]$ together along it! This gives us a kuratowski finite object which nonetheless has infinitely many sections in every neighborhood of $0$¹⁶!

Finally, let’s talk about subfinite sets. These are locally subobjects of cardinals, so are much easier to visualize again!

A subobject of a cardinal is just a disjoint union of open subsets of $B$:

So a subfinite set has to locally look like these. Keep in mind, though, that like kuratowski finite sets, the choice of $[n]$ is made locally. So we can have something like this:

In particular, we see that the fibres of a subfinite set don’t need to be globally bounded! So, perhaps surprisingly, a subfinite set does not need to be a subobject of a bishop finite set!

As we’ve come to expect, the existential quantifier gives us the ability to twist.

As subobjects of decidable objects, we see these inherit decidable equality. However, while bishop and kuratowski finite objects are all either empty or inhabited, subfinite sets don’t need to be! For an easy example, consider

Here the truth value of $\exists x : X$ is $U \cup V$, and the truth value of $X = \emptyset$ is $\text{int}(U^c \cap V^c)$. (Do you see why?)

An aside on decidable equality

This isn’t technically about visualizing etale spaces, but I was thinking about it while writing this post, and I think it fits well enough to include. If nothing else, it’s deifnitely interesting enough to include!

If you’re really paying attention, you’ll notice the “easy exercise” about analytic continuation didn’t actually use any aspects of finiteness. Indeed, we can prove the following (very general) theorem:

Let $B$ be locally connected and hausdorff. Then the following are equivalent for a sheaf $\mathcal{F}$ on $B$:

The etale space of $\mathcal{F}$ is hausdorff
Sections of $\mathcal{F}$ satisfy “analytic continuation”, in the sense that two sections agreeing on a stalk¹⁷ must agree everywhere they’re defined
$\mathcal{F}$ has decidable equality in the internal logic of $\mathsf{Sh}(B)$

This comes from a mathoverflow answer by Elías Guisado Villalgordo

$\ulcorner$ The equivalence of $1$ and $2$ was the earlier exercise, and you can find a proof below it under a fold/

To check the equivalence of $2$ and $3$, we look at “decidable equality”, that $\forall f,g : \mathcal{F} . (f=g) \lor (f \neq g)$.

Externally, this says that for every $U$, for every pair of local sections $f,g \in \mathcal{F}(U)$, there’s an open cover $\{U_\alpha\}$ so that on each member of the cover $f \! \upharpoonright_{U_\alpha}$ and $g \! \upharpoonright_{U_\alpha}$ are either everywhere equal or everywhere unequal.

Clearly if $\mathcal{F}$ has analytic continuation then $\mathcal{F}$ has decidable equality. Indeed using local connectedness of $B$ we can cover $U$ by connected opens. By analytic continuation we know that on each connected piece either $f=g$ everywhere or nowhere, as desired.

Conversely, say $\mathcal{F}$ has decidable equality. Fix a connected $U$ and sections $f,g \in \mathcal{F}(U)$ which have the same stalk at some point $b \in U$. By decidable equality, there’s an open cover $\{ U_\alpha \}$ of $U$ where on each member of the cover $f=g$ holds everywhere or nowhere. Recall that since $U$ is connected, for any two opens in the cover $U_\alpha$ and $U_\beta$ we can find a finite chain $U_\alpha = U_0, U_1, U_2, \ldots, U_n = U_\beta$ of opens in our cover where each $U_i$ intersects $U_{i+1}$.

Then for any $U_\beta$ in the cover, fix such a chain from $U_0 \ni b$ to $U_\beta$. By decidable equality we know $f=g$ on $U_0$. So for the point in $U_0 \cap U_1$ we know $f=g$, and by decidable equality again we learn $f=g$ on $U_1$. Proceeding in this way we learn $f=g$ on the whole of $U_\beta$.

Thus $f=g$ on every member of the cover, so on the whole of $U$, and so $\mathcal{F}$ satisfies analytic continuation. $\lrcorner$

If you’ve internalized this, it makes it geometrically believable that a decidable kuratowski finite set is actually bishop finite! I think it’s fun to see this fact both syntactically (reasoning in the internal logic) and semantically (reasoning about bundles), so let’s see two proofs!

First, a syntactic proof:

$\ulcorner$ Say $f : [n] \twoheadrightarrow X$. Then we note for each $x:X$ the predicate $\varphi(k) \triangleq f(k) = x$ on $[n]$ is decidable since $X$ has decidable equality. Thus it gives an inhabited decidable subset of $[n]$, which has a least element (Lemma D5.2.9(iii) in the elephant). Call such an element $g(x)$.

Note the image of $g$ is a complemented subset of $[n]$, since we can decide $k \in \text{im}(g)$ by checking if $k = g(f(k))$ using decidability of $[n]$. Then $\text{im}(g)$ is isomorphic to a cardinal (D5.2.3), $[m]$, and it’s easy to see that composing $g$ and $f$ with this isomorphism gives a bijection between $X$ and $[m]$. $\lrcorner$

And now a semantic proof:

$\ulcorner$ Say $\pi : E \twoheadrightarrow B$ is the etale space of a kuratowski finite object in $\mathsf{Sh}(B)$. So, locally, $E$ is the quotient of some $\coprod_n B$. Say also that $E$ is has decidable equality so that, locally, two sections of $E$ that agree somewhere must agree everywhere.

By refining our covers, then, we may fix an open cover $\{U_\alpha\}$ of $B$ and epimorphisms $\coprod_{n_\alpha} U_\alpha \to E \! \upharpoonright_{U_\alpha}$ so that two local sections on $U_\alpha$ either agree everywhere or nowhere. In particular, the $n_\alpha$ many components $f_i : U_\alpha \overset{i\text{th inclusion}}{\longrightarrow} \coprod_{n_\alpha} \to E$ are pairwise either have the same image or disjoint images. Choosing exactly one $f_i$ for each possible image (the least $i$ that works, say) we see that actually $\pi^{-1}(U_\alpha)$ is homeomorphic to the disjoint union of copies of $U_\alpha$ so that $E$ is actually a covering space, and thus represents a bishop finite object. $\lrcorner$

It’s worth meditating on why these are actually the same proof! In both cases we use decidability to “remove duplicates” from our enumeration.

Which Finite to Use?

Constructively, bishop and kuratowski finiteness encode two aspects of finiteness which are conflated in the classical finite world. Bishop finite sets are those which admit a cardinality. They’re in bijection with some $[n]$, and so have a well defined notion of size. Kuratowski finite sets, in contrast, are those equipped with an enumeration. You can list all the elements of a kuratowski finite set.

Of course, knowing how to biject a set $X$ with a set $[n]$ always tells you how to enumerate $X$. But constructively knowing how to enumerate $X$ doesn’t tell you how to biject it with some $[n]$! As we saw earlier, the problem lies with removing duplicates. It’s worth taking a second to visualize a kuratowski finite set that isn’t bishop finite, and convince yourself that the question “is this a duplicate element” can have more subtle answers than just “yes” or “no”. This makes it impossible to remove the duplicates.

This bifurcation might feel strange at first, but in some squishy sense it happens classically too once you start working with infinite sets. Indeed in set theory the notion of finiteness bifurcates into cardinals, which have a defined notion of “size”, and ordinals, which have a defined notion of “order” (kind of like an enumeration… if you squint).

So when you’re working constructively, ask why you’re using finiteness. Do you really need to know that something literally has $n$ elements for some $n$? Or is it enough to know that you can put the elements in a finite list? Does it matter if your list has duplicates? These questions will tell you whether bishop finiteness or kuratowski finiteness is right for your purposes.

Subfiniteness, which amounts to being contained in some $[n]$, I find to be less useful “in its own right”. Instead, if you’re able to prove a result for bishop finite objects, it’s worth asking if you can strengthen that to a proof for subfinite ones!

In any case, as long as we work with decidable things, these notions all coincide. As we saw earlier, a decidable quotient of a bishop finite set (that is, a decidable kuratowski finite set) is again bishop finite. Similarly, a complemented subset of a bishop finite set is finite (this is lemma D5.2.3 in the elephant, but it’s not hard to see yourself). So if you’re working in a context where everything is decidable, finite sets work exactly as you would expect! This is part of why constructive math doesn’t have much to say about combinatorics and algebra. Most arguments are already constructive for decidable finite sets (which is what you’re picturing when you picture a finite set and do combinatorics to it). Sometimes you can push things a bit farther though, and this can be interesting. See, for instance, Andreas Blass’s paper on the Constructive Pigeonhole Principle.

Another blog post done! I’m writing this on a train to Odense right now, and once I get to my room I’ll draw all the pictures and post it.

Hopefully this helps demystify the ways in which constructive math might look strange at first. Once again the seemingly bizarre behavior is explained by its vastly greater generality! When you first learn that a subset of a (bishop) finite set need not be (bishop) finite, it sounds so strange as to be unusable! But once you learn that this is an aspect of “everywhere definedness” in a space of parameters (read: the base space) it becomes much more palatable.

Thanks, as always, for hanging out. Try to keep cool during this summer heat (though it’s actually really pleasant in Denmark right now), and I’ll see you all soon ^_^.

Here’s a picture of me and Peter right after we got off the train:

And, of course, I still really want to finish the blog post on 2-categories and why you should care… There’s just so many other things to think about and to write! It also feels like a big topic because I have a lot to say, and that makes me a bit scared to actually start. Especially since I’ve been writing a lot of longer form posts lately (like the three part series on the topological topos, and I can tell the fukaya post is going to be long too…) ↩
Here we’re using the expected abbreviations.

We write $f : X \cong Y$ to mean $f : X \to Y$ and $\exists g : Y \to X . gf = 1_X \land fg = 1_Y$.

We write $f : X \hookrightarrow Y$ to mean $f : X \to Y$ and $\forall x_1, x_2 : X . fx_1 = fx_2 \to x_1 = x_2$

We write $f : X \twoheadrightarrow Y$ to mean $f : X \to Y$ and $\forall y : Y . \exists x : X . fx = y$. ↩
If you want to learn more about it, I highly recommend Stout’s Dedekind Finiteness in Topoi. I don’t know of a good way to visualize the dedekind finite objects in $\mathsf{Sh}(X)$ (though I haven’t tried at all – I want this to be a quick post) which was another reason to exclude them. ↩
Which I often do, but not today. ↩
And I want this to be a quick post, finished within a day or two of starting. So I don’t have time to think if there’s something slicker (and intuitively I don’t expeect there to be).

Edit: I got about as close to “within a day or two” as is possible for me, haha. I started writing this on August 12, and it looks like I’ll get it posted on August 18. Given I went a few days without working on it, that’s not too bad! ↩
That is, a space built by gluing together opens of $B$ along common intersections ↩
Johnstone’s Elephant defines a cardinal to be a pullback of the “universal cardinal”. In $\mathsf{Sh}(B)$ this is an object in $\mathsf{Sh}(B) \big / \mathbb{N} \simeq \mathsf{Sh}(B \times \mathbb{N})$ and is shown in the following picture:

↩
We’ll still assume that $B$ is locally connected, though, since that makes them much easier to draw.

It’s important to remember that our geometric intuition is likely to fail us if we start considering, say, sheaves on the cantor space! ↩
Remember truth values are open sets in the base space, so the truth value of $(\text{pink} = \text{blue}) \lor (\text{pink} \neq \text{blue})$ will be $\text{int}(\{ \text{pink} = \text{blue} \}) \cup \text{int}( \{ \text{pink} \neq \text{blue} \} )$ ↩
I’m like… at least 90% sure it’s an accident these are both named after positions of authority in catholicism. ↩
Here we use the common abuse of notation of abbreviating “$\exists x:X . \top$” by just “$\exists x:X$”. It measures “to what extent” there exists an element of $X$. That is, its truth value is the support of $X$ over $B$ – $\text{int} \big ( \{b \in B \mid \exists x \in X_b \} \big )$ ↩
Well, it says this should be true locally. But it’s easy to see we’ll have a problem in any neighborhood of $0$. ↩
I stole this whole exercise from this mathoverflow answer by Elías Guisado Villalgordo ↩
In that same Richard Borcherds video, for instance, which I know I watched when it came out. In fact, lots of references on etale spaces, in hindsight, emphasize the fact that etale spaces can be highly nonhausdorff. My guess is that I’m not the first person to have made this mistake, haha, and I know that when I teach etale spaces going forwards I’m going to be sure to emphasize this too!

In his talk Nonetheless One Should Learn the Language of Topos, Colin McLarty compares etale spaces to a piece of mica (around the 1h04m mark), and I think I’m starting to see what he means. ↩
Concretely you can build this space by gluing two copies of $\mathbb{R}$ together along their common open subset $\mathbb{R} \setminus \mathbb{Z}$. ↩
This is a fun example to think about. Why does it have only countably many sections at $0$, rather than uncountably many? Can you picture its topology at all? See the linked mastodon posts for more discussion. ↩
Saying that $f$ and $g$ agree on the stalk at $b$ is saying that there’s an open neighborhood of $b$ where $f=g$ on that neighborhood.

So this is another way of saying that as soon as two sections agree on an arbitrarily small neighborhood, they must agree on their whole (connected) domain! ↩

Life in Johnstone's Topological Topos 3 -- Bonus Axioms

Wed, 03 Jul 2024 00:00:00 +0000

In the first post of the series, we talked about what the topological topos is, and how we can think about its objects (and, importantly, how we can relate computations in the topos $\mathcal{T}$ to computations with topological spaces in “the real world”). In part two, we talked about algebraic structures, and how (for example) groups in $\mathcal{T}$ are related to topological groups.

In that post we alluded to the presence of ~bonus axioms~ that allow us to reason in $\mathcal{T}$ more freely than in many other topoi. For instance, we have access to a certain amount of choice. We also have access to a powerful principle saying that every function between metric spaces is $\delta$-$\epsilon$ continuous!

In this post we’ll spend some time talking about these bonus axioms, and proving that they’re true (since a lot of these facts are basically folklore).

Let’s get to it!

First, let’s take a second to recall the definition of the grothendieck topology $J$ for $\mathcal{T}$. We’re going to be externalizing a lot of theorems, and it’ll be good to have the open cover condition on hand. Here’s what we wrote back in the first post, but you can of course read more in Section 3 of Johnstone’s original paper.

For the site with two objects, $\{1, \mathbb{N}_\infty\}$, every (nonempty) family of arrows $\{X_\alpha \to 1 \}$ is covering. So the interesting question is what a covering family of $\mathbb{N}_\infty$ looks like.

If $S$ is an infinite subset of $\mathbb{N}$, we write $f_S$ for the unique monotone map $\mathbb{N}_\infty \to \mathbb{N}_\infty$ whose image is $S \cup \{ \infty \}$.

A family $\{X_\alpha \to \mathbb{N}_\infty\}$ is covering if and only if both

It contains every constant map $1 \to \mathbb{N}_\infty$

For every infinite $T \subseteq \mathbb{N}$, there is a further infinite subset $S \subseteq T$ with $f_S : \mathbb{N}_\infty \to \mathbb{N}_\infty$ in the family

In particular, if a family contains every constant map $1 \to \mathbb{N}_\infty$ and a “tail of an infinite sequence” $f_{\{x \geq N\}}$ for some $N$, then that family is covering.

So, roughly, to prove that something “merely exists” in $\mathcal{T}$, we have to provide a witness for every finite $n$, and these witnesses should converge to the witness for $\infty$.

If we want to use the site with one object $\{ \mathbb{N}_\infty \}$, the condition is almost exactly the same. A family of maps is covering if and only if both

every constant map $\mathbb{N}_\infty \to \mathbb{N}_\infty$ is in the family

For each infinite $T \subseteq \mathbb{N}$, there’s a further infinite $S \subseteq T$ so that $f_S$ is in the family.

This, unsurprisingly, doesn’t make too much difference.

Dependent Choice

To start, $\mathcal{T}$ models Dependent Choice. You can find a proof in Shulman and Simpson’s aptly named note Dependent Choice in Johnstone’s Topological Topos.

Say you have a relation $R \subseteq X \times X$ which is total in the sense that $\forall x . \exists y . R(x,y)$.

Then DC says for each $x_0 : X$, there’s a function $f : \mathbb{N} \to X$ so that $f(0) = x_0$ and for each $n$, $R(f(n), f(n+1))$.

This is intuitively obvious. After all, we start with $x_0$, then by totality the set $\{ y \mid R(x_0,y) \}$ is inhabited. So we can choose an element $x_1$ from this set. Similarly we can choose an $x_2$ from $\{y \mid R(x_1,y) \}$, and so on. Notice that the choices we make are allowed to depend on the choices that came before. After all, it’s possible that for two different choices $x_1,x_1' \in \{ y \mid R(x_0,y) \}$ the sets $\{ y \mid R(x_1,y) \}$ and $\{ y \mid R(x_1',y) \}$ might be different, leading to different allowable choices of $x_2$!

Thus, DC basically tells us that recursive definitions work, even if we don’t have a canonical way to choose one of many options at each recursive stage. Indeed, most recursive definitions work by first choosing an $x_0$, and then arguing that the set of “allowable” next steps is always inhabited¹. For more information about DC and how to think about it, I recommend Karagila’s excellent paper on the subject.

For those who like type theory, DC says that for every binary relation $R$ on a type $X$,

\[\displaystyle \mathtt{dc} : \left ( \prod_{x:X} \left \lVert \sum_{y:X} R(x,y) \right \rVert \right ) \to \prod_{x_0 : X} \left \lVert \sum_{f : \mathbb{N} \to X} (f(0) = x_0) \times \left ( \prod_{n:\mathbb{N}} R(f(n), f(n+1)) \right ) \right \rVert\]

or, cashing out these $\lVert \Sigma \rVert$s for $\exists$s,

$\left ( \forall (x:X) . \exists (y:X) . R(x,y) \right ) \to \forall (x_0 : X) . \exists (f : \mathbb{N} \to X) . f(0) = x_0 \land \forall n . R(f(n), f(n+1))$

Here’s an idiomatic example of dependent choice in action: The Baire Category Theorem for complete metric spaces.

Let $(X,d)$ be a (cauchy) complete metric space² with inhabited (strongly³) dense open sets $U_1, U_2, U_3, \ldots$.

Then the countable intersection $\bigcap_n U_n$ is still (strongly) dense.

The usual proof (say, from Karagila’s notes) doesn’t use LEM, so it goes through unchanged. We’ll present it here paying special attention to the use of DC.

$\ulcorner$ Let $V$ be open in $X$. We need to show that $V \cap \bigcap_n U_n$ is inhabited.

Since $U_1$ is strongly dense, we know that $V \cap U_1$ is inhabited, say by $x_1$. Now since $U_1 \cap V$ is open, we can find a radius $r_1$ so that

$0 \lt r_1 \lt 2^{-1}$
the (strongly closed) ball $\overline{B(x_1,r_1)} = \{ y \mid d(x_1,y) \leq r_1 \}$ is contained in $V \cap U_1$

By dependent choice (and strong density of each $U_k$), we may recursively build a sequence of pairs $(x_n,r_n)$ so that

$0 \lt r_{n+1} \lt 2^{-(n+1)}$
the (strongly closed) ball $\overline{B(x_{n+1},r_{n+1})}$ is contained in $B(x_n,r_n) \cap U_{n+1}$.

By construction, then, the $x_n$s assemble into a cauchy sequence whose limit $x_\infty$ lies in each $B(x_n,r_n)$. Therefore $x_\infty \in \bigcap_n B(x_n,r_n) \subseteq V \cap \bigcap_n U_n$, so that $\bigcap_n U_n$ is dense, as desired. $\lrcorner$

Can you build a relation $R$ on $(X \times \mathbb{R}_{\gt 0} \times \mathbb{N})$ so that $R \big ( (x,r,n), \ (y,s,m) \big )$ holds exactly when $m=n+1$ and $(y,s)$ satisfy the above conditions compared to $(x,r)$?

This will let you see precisely how dependent choice is used above.

Dependent Choice implies Countable Choice, which itself implies Weak Countable Choice. But WCC⁴ implies that the dedekind reals and the cauchy reals agree. And indeed one can show directly that in $\mathcal{T}$ both the dedekind and cauchy reals are given by $よ\mathbb{R}$.

Brouwer’s Principle

The next ~bonus axiom~ we’ll talk about is Brouwer’s Continuity Principle that “Every function $f : \mathbb{R} \to \mathbb{R}$ is continuous”!

Precisely⁵:

$\mathcal{T} \vDash \forall f : \mathbb{R}^\mathbb{R} . \ \forall \epsilon : \mathbb{R}_{\gt 0} . \ \forall a : \mathbb{R} . \ \exists \delta : \mathbb{R}_{\gt 0} . \ \forall b : \mathbb{R} . \ d(a,b) \lt \delta \to d(fa,fb) \lt \epsilon$

In fact, this is true for any (external) metric spaces $X$ and $Y$ where $X$ is (externally) locally compact! In particular, it’s also true that every function $\mathbb{N}^\mathbb{N} \to \mathbb{N}$ is $\epsilon$-$\delta$ continuous.

Note that it’s crucial here that we use the truncated $\exists$ rather than the untruncated $\Sigma$ in the statement of this theorem. Martín Escardó and Chuangjie Xu have shown that the untruncated version of this theorem isn’t just false in $\mathcal{T}$, it’s provably false in every topos!

Regardless, it’s shockingly hard to find this continuity principle written down anywhere, but it’s cited in lots of places! It’s definitely part of the folklore, and I’m happy to share a full proof! It’s a bit long, though, so I’m leaving it as an “appendix” at the bottom of this post. If you know of a reference, or of a slicker proof than the one I found, I would REALLY love to hear about it⁶!

Regardless, the truth of Brouwer’s principle tells us that $\mathcal{T}$ is a rather stronger version of Solovay’s Model. Solovay’s model validates $\mathsf{LEM}+\mathsf{DC}+$“every function $\mathbb{R} \to \mathbb{R}$ is measurable”.

In $\mathcal{T}$, we have $\mathsf{DC}$ and the stronger “every function $\mathbb{R} \to \mathbb{R}$ is continuous”. But the price we pay is LEM.

Omniscience Principles

We can ask about other nonconstructive principles too. For instance, Bishop’s Principles of Omniscience!

First, let’s look at the Limited Principle of Omniscience (LPO):

The Limited Principle of Omniscience: says that every sequence of bits is either $0^\omega$ or contains a $1$. That is:

\[\forall s : 2^\mathbb{N} . (\forall n . s(n) = 0) \lor (\exists n . s(n) = 1)\]

In the presence of countable choice (which we have in $\mathcal{T}$), this is equivalent to the analytic LPO, which says:

\[\forall x : \mathbb{R} . (x \lt 0) \lor (x = 0) \lor (x \gt 0)\]

and is further equivalent to the type-theoretic condition that the obvious embedding

\[\mathbb{N} + \{ \infty \} \hookrightarrow \mathbb{N}_\infty\]

is an isomorphism.

It’s clear from the last condition that LPO is false in $\mathcal{T}$. Since both $\mathbb{N} + \{ \infty \}$ and $\mathbb{N}_\infty$ are sequential, these internal types correspond to the expected spaces externally, where this is obviously not an isomorphism. After all, one space is discrete and the other isn’t.

That said, it can be hard to find example computations of people statements internal to a topos to statements in the real world, so just for fun let’s prove this again in a more complicated way:

$\ulcorner$ If it were true, then we would know $1 \Vdash \text{LPO}$. Now cashing out the universal quantifier, we would know that for every $s \in 2^\mathbb{N}(\mathbb{N}_\infty)$

\[\mathbb{N}_\infty \Vdash (\forall n . s_k(n) = 0_k) \lor (\exists n . s_k(n) = 1_k)\]

Here $s_k : \mathbb{N}_\infty \to 2^\mathbb{N}$ is allowed to be any convergent sequence in cantor space, and we interpret $0_k$ and $1_k$ as constant sequences.

Let’s take $s_k$ to be the sequence $0^k 1^\omega$. That is, the $k$th element of this sequence, $s_k$, should be the point in cantor space with $k$ many $0$s followed by an infinite tail of $1$s. Note this sequence converges to $0^\omega$.

Now what would it mean to have $(\forall n . s(n) = 0) \lor (\exists n . s(n) = 1)$? We would have an open cover of $\mathbb{N}_\infty$ where each element of that cover thinks that one of these disjuncts is true. But every covering seive of $\mathbb{N}_\infty$ contains a function $f_U$ for $U$ an infinite subset of $\mathbb{N}$.

Now restricting $s$ to this member of the cover amounts to restricting $s$ to a subsequence, $s_{r_k}$.

Is it possible that $\mathbb{N}_\infty \Vdash \forall n . s_{r_k}(n) = 0_{r_k}$? This says for every convergent sequence $n_k \in \mathbb{N}(\mathbb{N}_\infty)$, we must have $s_{r_k}(n_k) = 0_{r_k}$ for all $k$. Of course, it’s easy to find a convergent sequence where this is false! We can just choose $n_1 \gt r_1$ to make $s_{r_1}(n_1) = 1 \neq 0$.

Is it possible for $\mathbb{N}_\infty \Vdash \exists n . s_{r_k}(n) = 1_{r_k}$? This says we can pass to a further subsequence $s_{\ell_{r_k}}$ so that for some convergent sequence $n_k \in \mathbb{N}(\mathbb{N}_\infty)$ we have $s_{\ell_{r_k}}(n_k) = 1_k$ for all $k$. But of course this is false too! Every convergent sequence of naturals is eventually constant, say $n_k = N$ for $k \gg 1$. Then for $k$ large enough, we’ll have both $n_k = N$ and $\ell_{r_k} \gt N$, in which case $s_{\ell_{r_k}}(n_k) = 0 \neq 1$.

So we see that LPO externalizes to a false claim, and thus is not validated by $\mathcal{T}$. $\lrcorner$

As an easy exercise, can you use LPO to build a function $\mathbb{R} \to \mathbb{R}$ which is not $\epsilon$-$\delta$ continuous? This provides another proof that $\mathcal{T} \not \models \text{LPO}$, since it contradicts Brouwer’s principle.

In fact, we don’t even need Bouwer’s principle to contradict! By yoneda, $\text{Hom}(\mathbb{R}, \mathbb{R})$ is the set of continuous functions on $\mathbb{R}$, but if you build a function in $\mathcal{T}$ you know what that function does externally on points! In particular, you can contradict with continuity “in the real world”.

Next, let’s look at the Lesser Limited Principle of Omniscience (LLPO):

The Lesser Limited Principle of Omniscience says that

\[\forall s : 2^\mathbb{N} . \exists ! n . s(n) = 1 . (\forall k . s(2k) = 0) \lor (\forall k . s(2k+1) = 0)\]

This is equivalent to a kind of de Morgan’s law

\[\forall s, t : 2^\mathbb{N} . \lnot (\exists n . s(n) = 1 \land \exists m . t(m) = 1) \to (\forall n . s(n) = 0 \lor \forall m . t(m) = 0)\]

and, as before, under countable choice this is equivalent to the analytic LLPO,

$\forall x : \mathbb{R}. (x \geq 0) \lor (x \leq 0)$

This turns out to be true⁷! Just to show more ways to reason about the internal logic of topoi, we’ll prove this one by working with the category $\mathcal{T}$ directly. Since type theoretic functions externalize to arrows in $\mathcal{T}$, though, we’ll use type theory to label our arrows.

$\ulcorner$ Proposition 6.2 in Johnstone’s original paper implies that $\mathbb{R}$ is the pushout of the closed cover

Now showing $\forall x : \mathbb{R} . (x \geq 0) \lor (x \leq 0)$ amounts to building a section of the projection $\pi : \sum_{x : \mathbb{R}} \lVert (x \geq 0) + (x \leq 0) \rVert$. Here I’ve also cashed out the $\lor$ for a propositional truncation of a coproduct.

But by the universal property of the pushout, we get a map $s : \mathbb{R} \to \sum_{x : \mathbb{R}} \lVert (x \geq 0) + (x \leq 0) \rVert$ as below. Note the truncation $\lVert (x \geq 0) + (x \leq 0) \rVert$ is crucial for making the middle square commute!

Moreover, since both $\pi s$ and $\text{id}_\mathbb{R}$ make the outer square commute, they must be equal by uniqueness in the universal property. So $s$ is the desired section of $\pi$, and $\mathcal{T}$ models LLPO. $\lrcorner$

As a nice exercise, check that internal to $\mathcal{T}$ the type $\mathbb{R}$ is equivalent to the quotient type⁸

\[\mathbb{R}_{\geq 0} + \mathbb{R}_{\leq 0} \big / \mathtt{inl}(0) \sim \mathtt{inr}(0)\]

and use this fact to give a purely type theoretic proof of that LLPO holds in $\mathcal{T}$.

As another nice (but quite tricky!) exercise, try externalizing the statement of LLPO and proving it “directly” by seeing that it externalizes to something true!

Next on this list is Markov’s Principle (MP):

Markov’s Principle says that

\[\forall s : 2^\mathbb{N} . (\lnot \forall n . s(n) = 0) \to (\exists n . s(n) = 1)\]

Again this is equivalent (under CC) to an analytic version⁹

\[\forall x : \mathbb{R}. \lnot (x=0) \to x \# 0\]

Here $\#$ means that $x$ is apart from $0$. That is, $\exists q : \mathbb{Q} . (x \lt q \lt 0) \lor (0 \lt q \lt x)$.

MP is true in $\mathcal{T}$, and this is not so hard to show. We’ll write this proof in a slightly more conversational style, but we encourage the reader interested in learning topos theory to check all the details with the forcing language.

$\ulcorner$ We want to show

\[1 \Vdash \forall s : 2^\mathbb{N} . (\lnot \forall n . s(n) = 0) \to (\exists n . s(n) = 1)\]

so we fix a convergent sequence $s_k : \mathbb{N}_\infty \to 2^\mathbb{N}$. We want to show that if, for all $f : \mathbb{N}_\infty \to \mathbb{N}_\infty$

\[\mathbb{N}_\infty \not \Vdash \forall n . s_{fk}(n) = 0 \quad \quad (\star)\]

then we must have

\[\mathbb{N}_\infty \Vdash \exists n . s_k(n) = 1.\]

To show this existential claim, we need to provide a conergent sequence $n_k$ so that each $s_k(n_k) = 1$. But taking $f$ to be a constant function in $(\star)$, we learn that such an $n_k$ exists for each $k$. Moreover, since $s_k \to s_\infty$, there is a $k \gg 1$ so that $s_k$ and $s_\infty$ agree on the first $n_\infty + 1$ many bits, so that we may take $n_k = n_\infty$ when $k$ is large enough. Thus the sequence converges, as desired. $\lrcorner$

Now WLPO is easy:

The Weak Limited Principle of Omniscience says that

\[\forall s : 2^\mathbb{N}. (\forall n . s(n) = 0) \lor (\lnot \forall n . s(n) = 0)\]

This is equivalent to a kind of excluded middle, and as expected there’s an analytic version (equivalent in the presence of CC):

$\forall x : \mathbb{R} . (x = 0) \lor \lnot (x=0)$

Now, it’s easy to see that MP + WLPO $\implies$ LPO (look at the analytic versions). So we learn indirectly that $\mathcal{T}$ cannot satisfy WLPO. Of course, here’s a more direct proof that the analytic version can’t work:

$\ulcorner$ Consider the sequence $x_n = \frac{1}{n}$, converging to $0$, as an element of $\mathbb{R}(\mathbb{N}_\infty)$.

If $\mathcal{T}$ were to model WLPO, it would mean (among other things) that some subsequence of $x_n$ would be either everywhere $0$ or everywhere nonzero. But no subsequence has this property, since $x_n$ is nonzero for each finite $n$, but zero at $n=\infty$.

So $\mathcal{T}$ does not model WLPO. $\lrcorner$

As an aside, in all of these statements we’re using the “truncated” $\lor$ and $\exists$, and it’s natural to ask what happens if we change these to their “untruncated” versions $+$ and $\Sigma$.

It’s a theorem of Martín Escardó that

truncated and untruncated LPO are equivalent
truncated and untruncated WLPO are equivalent, and are equivalent to untruncated LLPO
truncated LLPO is weaker than untruncated LLPO

In fact, in his agda files, Martín wonders if there’s a place in the literature where untruncated LLPO and WLPO are shown to be inequivalent.

He mentions that $\mathcal{T}$ should be an example, and here we’ve shown that in fact it is! After all, $\mathcal{T} \models \text{LLPO}$ but $\mathcal{T} \not \models \text{WLPO}$!

Bar and Fan Theorems

The Bar and Fan theorems are closely related to the nice properties of baire space and cantor space (respectively) as spaces rather than as locales. The locales are always well behaved, but in some topoi these locales fail to have enough points, so that the baire space and cantor space as sets of points may be lacking.

Since we showed in part 1 that spatial sequential regular locales always have enough points in $\mathcal{T}$, we see that baire space and cantor space both have enough points! By Propositions 3.12 and 3.13 in van den Berg and Moerdijk’s Derived rules for predicative set theory: An application of sheaves we see that the Monotone Bar Theorem and the Full Fan Theorem hold in $\mathcal{T}$.

This is also the best we can do. Since the Full Bar Theorem is known to imply LPO, and $\mathcal{T} \not \models \mathsf{LPO}$ we know that we can’t improve the monotone bar theorem to the full bar theorem in $\mathcal{T}$.

As a not-so-hard exercise, verify the decidable fan theorem by hand! That is, prove

\[\mathcal{T} \models \forall (B : 2^{\lt \mathbb{N}} \to 2) . \ulcorner B \text{ is a monotone bar} \urcorner \to \ulcorner B \text{ is uniform} \urcorner\]

Or, entirely in symbols, prove

\[\mathcal{T} \models \forall (B : 2^{\lt \mathbb{N}} \to 2) . \left ( \begin{array}{c} \underbrace { \forall s : 2^{\lt \mathbb{N}} . s \in B \to (s0 \in B \land s1 \in B) }_{\text{$B$ is monotone}} \\ \land \quad \underbrace { \forall \alpha : 2^\mathbb{N} . \exists n : \mathbb{N} . \alpha \! \upharpoonright_n \in B }_{\text{$B$ is a bar}} \end{array} \right ) \to \Big ( \underbrace { \exists N : \mathbb{N} . \forall \alpha : 2^\mathbb{N} . \alpha \! \upharpoonright_N \in B }_{\text{$B$ is a uniform bar}} \Big )\]

Note that $B : 2^{\lt \mathbb{N}} \to 2$ is a decidable subset of $2^{\lt \mathbb{N}}$. To prove the full bar theorem you would want to prove the same theorem for all subsets of $2^{\lt \mathbb{N}}$. That is, for $B : 2^{\lt \mathbb{N}} \to \Omega$.

As a ~bonus exercise~, use what you know about discrete topological spaces and subobjects in $\mathcal{T}$ to argue that a semantic proof of this theorem is easily modified to give a proof of the full bar theorem.

discussion of the ~bonus exercise~

The idea here is that a general subobject of a sequential space X is a subset of the points of X equipped with some topology making the inclusion continuous. A decidable subobject is a clopen subset of the points of $X$ equipped with the induced topology (do you see why?). Of course, since $2^{\lt \mathbb{N}}$ is discrete, it's not hard to see that _every_ subobject is decidable! This means in proving the theorem for decidable subobjects you've actually proven the theorem for _all_ subobjects!

De Morgan and LEM

Obviously LEM fails, since $\Omega \not \cong 1+1$. But what about De Morgan’s Laws?

It turns out that $\mathcal{T}$ is not de Morgan. In another paper (the aptly named Conditions Related to de Morgan’s Law) Johnstone gives a slew of conditions equivalent to a topos being de Morgan. In particular, de Morgan-ness is equivalent to

$[\top,\bot] : 1+1 \to \Omega_{\lnot \lnot}$ being an isomorphism
$1+1$ being injective

But we can show both of these are false (thus giving two proofs that $\mathcal{T}$ is not de Morgan).

For $1$, we can compute that $\mathcal{T}_{\lnot \lnot}$ is equivalent to $\mathsf{Set}$, where the equivalence sends a set $X$ the space $X$ equipped with the indiscrete topology. This is stated at the end of Section 3 of Johnstone’s original paper. In particular, $1+1$, which has the discrete topology, is not isomorphic to $\Omega_{\lnot \lnot}$ (which is indiscrete).

For $2$, we know that in $\mathsf{Seq}$ there’s a map $(0,1) + (2,3) \to 1+1$ which doesn’t extend to a map $(0,3) \to 1+1$. Since monos in $\mathsf{Seq}$ are still monos in $\mathcal{T}$ (since the inclusion is a right adjoint), we’re done.

This was a long one! Multiple months in the making, and easily the most research I’ve ever done for a blog post (both in terms of reading done and original proofs). It was super rewarding, though, and I feel way better about the topological topos and its internal logic, as well as about topos theory more broadly ^_^.

Hopefully I was able to explain it clearly enough to be useful to all of you too! I know it’s a TON of information, and in the process of revising this I really struggled to tell if it’s well exposited or not since it kind of feels like this

But even though it’s a ton of information, hopefully each section is digestible!

Thanks again for reading all. I have other posts planned, about my thesis work for a change, but I think I’m going to take a break from writing after this, haha.

Stay safe, and talk soon! 💖

Appendix: A Proof that Johnstone’s Topos Models Brouwer’s Continuity Principle

If you’re not super familiar with externalizing formulas, you might want to read my old blog post with a bunch of simpler examples before trying to tackle this one!

We’ll be doing this computation using the site with one object $\mathbb{N}_\infty$.

We’ll prove a kind of “theorem schema”. Every metric space is sequential, so for any metric spaces $X$ and $Y$ in “the real world” we can think of $X$ and $Y$ as objects of $\mathcal{T}$. Now if $X$ is moreover locally compact, we’ll prove that $\mathcal{T}$ models

\[\ulcorner \text{every function $X \to Y$ is $\epsilon$-$\delta$ continuous} \urcorner\]

Precisely:

$\mathcal{T} \vDash \forall f : Y^X . \ \forall \epsilon : \mathbb{R}_{\gt 0} . \ \forall a : X . \ \exists \delta : \mathbb{R}_{\gt 0} . \ \forall b : X . \ d(a,b) \lt \delta \to d(fa,fb) \lt \epsilon$

$\ulcorner$ We want to show that

\[\mathbb{N}_\infty \Vdash \ulcorner \text{every function $X \to Y$ is $\epsilon$-$\delta$ continuous} \urcorner\]

Cashing out the universal quantifiers, we want to show that for any continuous functions $f : \mathbb{N}_\infty \times X \to Y$, $\epsilon : \mathbb{N}_\infty \to \mathbb{R}_{\gt 0}$, and $a : \mathbb{N}_\infty \to X$ in “the real world” we have

\[\mathbb{N}_\infty \Vdash \exists \delta : \mathbb{R}_{\gt 0} . \ \forall b : X . \ d(a,b) \lt \delta \to d(fa,fb) \lt \epsilon\]

To witness the existential quantifier, we need to find a cover of $\mathbb{N}_\infty$, and produce a real-world $\delta$ defined on each member of the cover.

Finding a cover basically amounts to finding a cofinite subset of $\mathbb{N}$, and passing to a tail of all the sequences in sight. With this in mind, we choose a compact neighborhood $K$ of $a_\infty$ (using local compactness of $X$), and choose $\delta^*$ so that $B(a_\infty, \delta^*)$ is contained in $K$ and for every $x \in B(a_\infty, \delta^*)$ we have $d(f_\infty a_\infty, f_\infty x) \lt \frac{\epsilon_\infty}{6}$ (using continuity of $f_\infty$).

Then, let $N$ be large enough that for all $n \gt N$:

$\epsilon_n \gt \frac{\epsilon_\infty}{2}$
$d(f_n a_n, f_\infty a_\infty) \lt \frac{\epsilon_\infty}{6}$
For all $x \in K$, we have $d(f_\infty x, f_n x) \lt \frac{\epsilon_\infty}{6}$
$d(a_n, a_\infty) \lt \delta^*$

In condition (3) we’ve used the fact that $f_n$ converges to $f_\infty$ uniformly on the compact set $K$.

Now we take as our covering familiy

the constant functions $k : \mathbb{N}_\infty \to \mathbb{N}_\infty$
$i_S : \mathbb{N}_\infty \to \mathbb{N}_\infty$ for any infinite subset $S \subseteq \{ n \gt N \} \subseteq \mathbb{N}$.

As a reminder, the function $i_S$ is the unique monotone function $\mathbb{N}_\infty \to \mathbb{N}_\infty$ whose image is $S \cup \{\infty\}$.

Now on each member $g$ of this family, we need to produce a function $\delta : \mathbb{N}_\infty \to \mathbb{R}_{\gt 0}$ which witnesses

\[\mathbb{N}_\infty \Vdash \forall b : X . \ d(a_{g(-)}, b) \lt \delta \to d(f_{g(-)}(a_{g(-)}), f_{g(-)}(b)) \lt \epsilon_{g(-)}\]

Cashing out the last universal quantifier, we need to know that for all $h : \mathbb{N}_\infty \to \mathbb{N}_\infty$ and for all continuous $b : \mathbb{N}_\infty \to X$,

If $\mathbb{N}_\infty \Vdash d(a_{gh(-)},b) \lt \delta_{h(-)}$ then we must have $\mathbb{N}_\infty \Vdash d(f_{gh(-)}(a_{gh(-)}), f_{gh(-)}(b)) \lt \epsilon_{gh(-)}.$

So let’s build such a $\delta$ for the two cases in our cover!

First, if $g$ is the constant $k$ function. Then we need a convergent sequence $\delta_n$ so that for any function $h : \mathbb{N}_\infty \to \mathbb{N}_\infty$ and any convergent sequence $b_n$ in $X$,

If $d(a_k,b_n) \lt \delta_{hn}$ for all $n$, then $d(f_k(a_k), f_k(b_n)) \lt \epsilon_k$ for all $n$ too.

Of course, this is easy to arrange by taking $\delta_n$ to be the constant sequence witnessing continuity of $f_k$ at $a_k$.

Second, if $g = i_S$ is the unique monotone function whose image is $S \cup \{ \infty \}$. Recall also that every member of $S$ is at least $N$.

Then we define $\delta_n$ to be $\delta^* - d(a_{i_S n}, a_\infty)$. Note this is convergent, with limit $\delta^*$. Now we must show for any function $h : \mathbb{N}_\infty \to \mathbb{N}_\infty$ and for any convergent sequence $b_n$ in $X$ that

If $d(a_{i_S h n}, b_n) \lt \delta_{hn}$ for all $n$, then $d(f_{i_S h n}(a_{i_S h n}), f_{i_S h n}(b_n)) \lt \epsilon_{i_S h n}$.

But since $i_S h n \gt N$ and $d(b_n, a_\infty) \leq d(b_n, a_{i_S h n}) + d(a_{i_S h n}, a_\infty) \lt \delta^*$ we see that

$\epsilon_{i_S h n} \gt \frac{\epsilon_\infty}{2}$
$d(f_{i_S h n}(a_{i_S h n}), f_{\infty}(a_\infty)) \lt \frac{\epsilon_\infty}{6}$
$d(f_\infty b_n, f_{i_S h n} b_n) \lt \frac{\epsilon_\infty}{6}$

so that we can compute

\[\begin{aligned} d(f_{i_S h n}(a_{i_S h n}), f_{i_S h n}(b(n))) &\leq d(f_{i_S h n}(a_{i_S h n}), f_\infty(a_\infty)) + d(f_\infty(a_\infty), f_\infty(b_n)) + d(f_\infty(b_n), f_{i_S h n}(b_n)) \\ &\leq \frac{\epsilon_\infty}{6} + \frac{\epsilon_\infty}{6} + \frac{\epsilon_\infty}{6} \\ &= \frac{\epsilon_\infty}{2} \\ &\lt \epsilon_{i_S h n} \end{aligned}\]

As desired. $\lrcorner$

You might wonder why we need a “nonconstructive” axiom to do this. Why can’t we use induction on $\mathbb{N}$?

After all, we can prove (constructively, in type theory) that
\[\left ( \prod_{x:X} \sum_{y:X} R(x,y) \right ) \to \left ( \prod_{x_0 : X } \sum_{f : \mathbb{N} \to X} f(0) = x_0 \land \prod_{n:\mathbb{N}} R(f(n), f(n+1)) \right )\]
(and this makes a nice exercise!)

The difference lies in $\sum$ vs $\exists$! To build a term of type $\prod_x \sum_y R(x,y)$ is to build a function that eats an $x$ and returns a $y$ alongside a proof that $R(x,y)$. This gives us a canonical choice in $\{ y \mid R(x,y) \}$ – just use the one this function gave us!

Dependent choice works with something much weaker. It says we can build such a function even when there merely exists such a $y$, without being handed a witness! (Of course, the function we’re given only merely exists too)

Think about the semantics in $\mathsf{Sh}(B)$ for a moment. Here, to say that $\exists y . R(x,y)$ is to say that there’s an open cover of $B$ and a local witness $y$ on each element of the cover. But it’s entirely possible for these witnesses to not glue into a global witness! ↩
I don’t know if there are constructive subtleties with the notion of cauchy completeness which might be relevant here, and since I really want to get this blog post out I don’t want to read a bunch of literature on constructive metric spaces to try and figure it out…

If anyone happens to know some facts about constructive metric spaces, though, I would love to hear about them! But for now, treat this example as being more to showcase how dependent choie works than to say anything profound about the topological topos. ↩
A set $D \subseteq X$ is called Strongly Dense if for any inhabited open set $V$ we know that $D \cap V$ is inhabited. ↩
Thanks to Madeleine Birchfield for pointing out that I originally linked the wrong “Weak Countable Choice” here. I didn’t realize that there are two things, both called “weak countable choice”, and both implied by either CC or LEM! The one that proves the dedekind and cauchy reals agree is unambiguously called $\mathsf{AC}_{\mathbb{N},2}$. At time of writing it’s listed as “another weak countable choice” on nlab, and it means that every $\mathbb{N}$-indexed sequence of inhabited subsets of $\{0,1\}$ has a choice function. ↩
This is the first time in a while I’ve actually written down the definition of (metric space) continuity in full! No wonder students struggle with this, haha. I’ve forgotten what a mouthful it is! ↩
Here we have to refer to external metric spaces, and we prove a kind of “theorem schema”. I suspect something like this is true purely internally, if we can find the right definition of a “metric space” in $\mathcal{T}$. Obviously we want a distance function $d : X \times X \to \mathbb{R}_{\geq 0}$ satsifying the usual axioms. But we also need to know that the topology $d$ puts in $X$ agrees with the intrinsic topology on $X$!

Since $d$ is externally continuous we know that metric-open balls will always be open in $X$, so I think the condition is something like “every open of $X$ contains an open ball” or maybe “every open of $X$ is the union of the open balls inside it”.

This should be expressible in the internal logic since the opens of $X$ are exactly the inhabitants of $\Sigma^X$ where $\Sigma$ is the sierpinski space.

I know that Davorin Lešnik has thought about this, but I really want to get this post out (and ideally turn it into a paper) so unfortunately I won’t be pursuing this any further… for now! ↩
This realization has probably been made by many people, but it was added to the nlab by Mike Shulman. ↩
This is the quotient type in the sense of Li’s PhD thesis, not the (higher!) quotient type in the sense of HoTT… Though the quotient type I mean is almost certainly the $0$-truncation of the higher inductive quotient type.

I’ve been meaning to spend some time thinking about how you can prove theorems about a 1-topos by working in HoTT and truncating everything at the end, but I haven’t had the time. ↩
Actually, I think I remember reading somewhere that the analytic omniscience principles are always statements about the cauchy reals. The reason countable choice makes them properties of the dedekind reals is because under CC the dedekind and cauchy reals agree.

If an expert sees this and happens to know offhand if that’s true, I would love to know for sure!

Edit (July 10, 2024): Thanks again to Madeleine Birchfield for clarifying on zulip that the omniscience principles for $\mathbb{N}$ are probably equivalent to the analytic omniscience principles for the cauchy reals. I would still love a proper reference if someone has one, but at least there’s now a mathoverflow question with a proof in the LPO case. ↩

Life in Johnstone's Topological Topos 2 -- Topological Algebras

Wed, 03 Jul 2024 00:00:00 +0000

In the first post, we introduced Johnstone’s topological topos $\mathcal{T}$ and talked about what its objects look like. We showed how the interpretation of type theory in $\mathcal{T}$ gives us an “intrinsic topology” on any type we construct. We also alluded to the fact that, by working in $\mathcal{T}$ as a universe of sets, we’re able to interact with topological gadgets by forgetting about the topology entirely and just manipulating them naively as we would sets!

In this post, we’ll talk about how that works in the special case of algebraic gadgets, like groups, rings, etc., and use this to prove some interesting theorems about topological groups!

Recall Lawvere’s notion of Functorial Semantics. An Algebraic Theory is presented by some function symbols and equational axioms (we allow constant symbols as 0-ary functions), and this is probably best given through a “definition by examples”.

The usual presentation of the theory of groups is

A set $G$ equipped with function symbols

$e : G^0 \to G$
$(-)^{-1} : G^1 \to G$
$\cdot : G^2 \to G$

satisfying the equational axioms

$(x \cdot y) \cdot z = x \cdot (y \cdot z)$
$x \cdot e = x = e \cdot x$
$x \cdot x^{-1} = e = x^{-1} \cdot x$

Notice that the theory of posets is not algebraic¹ and indeed the usual presentation involves a relation symbol $\leq$ (which is not allowed) rather than only function symbols. Similarly, the theory of fields is not algebraic² and the usual presentation requires an axiom that’s much more complicated than just an equation: $(x = 0) \lor \exists y . xy = 1$.

However, these presentations by functions and equational axioms should really be thought of as presentations. There are superficially quite different presentations which still present the same theory. For instance, here is another presentation of the theory of groups³:

A set $G$ equipped with a function symbol

$/ : G^2 \to G$

satisfying the equational axiom

$x / \big ( ((x/x)/y)/z) / ((x/x)/x)/z \big ) = y$

With this in mind it’s natural to want an abstract characterization of an algebraic theory, that is independent of the choice of presentation. In his PhD thesis, Lawvere set this in motion by showing that for any algebraic theory $\mathbb{T}$, there’s a Classifying Category $\mathcal{C}_\mathbb{T}$ so that

\[\{ \mathbb{T}\text{-algebras} \} \simeq \{ \text{finite product functors } \mathcal{C}_\mathbb{T} \to \mathsf{Set} \}\]

If we have a good understanding of $\mathbb{T}$, then we can get our hands on $\mathbb{C}_\mathbb{T}$ concretely since it’s (opposite) the full subcategory of finitely generated free models!

Important for us is the related result that models of $\mathbb{T}$ in some other finite product category are given exactly by finite product functors from $\mathcal{C}_\mathbb{T}$ into that category!

So, for example, a topological group is the same data is a finite product functor $\mathcal{C}_\mathsf{Grp} \to \mathsf{Top}$, while a lie group is the data of a finite product functor $\mathcal{C}_\mathsf{Grp} \to \mathsf{Diff}$.

This is what’s going to give us the ability to relate algebras in $\mathcal{T}$ to topological algebras! Let’s see how it works!

First, say we have a group object in $\mathcal{T}$. This is the data of a finite product preserving functor $\mathcal{C}_\mathsf{Grp} \to \mathcal{T}$. But we know from part 1 that the reflector $r : \mathcal{T} \to \mathsf{Seq}$ preserves finite products too! So composing these gives a finite product functor

\[\mathcal{C}_\mathsf{Grp} \to \mathcal{T} \to \mathsf{Seq}\]

which is a group object in $\mathsf{Seq}$. That is, a (sequential) topological group⁴!

Conversely, say we have a sequential topological group. Then the embedding $e : \mathsf{Seq} \to \mathcal{T}$ is a right adjoint, and in particular preserves finite products. So again we get a group object in $\mathcal{T}$!

In fact, the adjunction $r \dashv e$ gives us an adjunction $r_* \dashv e_*$ between the functor categories, and since $r \circ e \cong \text{id}_\mathsf{Seq}$, this is true at the level of functor categories too!

So the category of sequential topological groups is a reflective subcategory of the category of groups in $\mathcal{T}$, and the reflector is exactly what you expect: Just take the topological reflection of the underlying object of $G$!

There’s nothing special about groups here, and so we learn that for any algebraic theory, the category of sequential models is a reflective subcategory of the category of models in $\mathcal{T}$. Thus, any question we have about (sequential) topological models can be answered in $\mathcal{T}$ without losing information, and anything we prove about models in $\mathcal{T}$ immediately gives us results about topological models by reflecting (though in this direction we possibly lose information about the proofs of convergence).

This is all kind of abstract right now, so let’s do a very down-to-earth example:

In $\mathcal{T}$, a subset of $G$ is any monic $X \hookrightarrow G$ (that is, any continuous injection). In particular, $X$ does not need to have the subspace topology!

With this in mind, a subgroup of $G$ is just a continuous injection $H \hookrightarrow G$ whose image is a subgroup in the usual sense. Of course, this pulls back (by injectivity) to a unique group structure on $H$ rendering the inclusion a homomorphism.

Now here’s a typical (very easy!) theorem/construction:

Let $X$ be any subset of $G$, a group. Then there’s a smallest subgroup $\langle X \rangle \leq G$ containing $X$.

$\ulcorner$ If $X$ is any subset, we define

\[\langle X \rangle = \bigcap \{ H \leq G \mid X \subseteq H \}.\]

This is a subgroup containing $X$, and any other subgroup containing $X$ is part of the intersection, rendering this the smallest such subgroup. $\lrcorner$

Notice that this proof is constructive in the sense that it doesn’t use LEM or Choice⁵. In particular, this proof works in every topos, and thus in $\mathcal{T}$.

But what does this exceptionally simple proof tell us about topological groups? Well subsets and subgroups are continuous injections, so this tells us that⁶

Let $X \hookrightarrow G$ be any continuous injection into a topological group $G$. Then there’s a topological group $\langle X \rangle$ with a continuous injection $\langle X \rangle \hookrightarrow G$ so that

$X \hookrightarrow G$ factors through $\langle X \rangle$
$\langle X \rangle$ is initial with this property

We can actually build such an $H$ by externalizing the proof of this theorem too! Subsets are interpreted as general monics into $G$, and the “intersection” of two monics externalizes to their pullback. So the desired $\langle X \rangle$ is exactly the pullback of the family of all continuous injections $H \hookrightarrow G$ factoring the inclusion from $X$⁷.

In case $G$ is sequentially hausdorff, this is on-the-nose correct. In case $G$ isn’t, then $\langle X \rangle$ might live in $\mathsf{Kur}$ instead of $\mathsf{Seq}$. But that’s ok! We can just hit it with the reflector to get an “honest” topological group with the same universal property (among the continuous injections whose domain is also “honest”).

Now, it’s entirely possible that you would have come up with such a theorem yourself. After all, a moment’s thought shows that $\langle X \rangle$ is “just” the usual subgroup generated by $X$, equipped with the finest topology rendering $X \hookrightarrow \langle X \rangle$ continuous.

The utility of the topos theoretic language is in doing more complicated constructions, where we’re still allowed to manipulate everything as though they’re sets, and we can be safe in the knowledge that, at the end of the day, we can cash out our theorem for one about topological spaces! It frees us from the burden of carrying around topologies all the time.

For a more complicated example, one can show that the category of abelian groups in a (grothendieck) topos is always AB5. In particular, the category of abelian groups in $\mathcal{T}$ is abelian and has enough injectives, so we can do homological algebra to it! Contrast this with the category of abelian groups in $\mathsf{Top}$, which is famously not abelian!

This is one of the big motivations for Condensed Mathematics. Indeed, in $\mathsf{Top}$, the continuous bijection of abelian groups $(\mathbb{R},\text{discrete}) \to (\mathbb{R},\text{euclidean})$ is not an isomorphism. Yet the kernel and cokernel are both trivial! In both condensed mathematics and the topological topos, this is remedied by a more complicated cokernel. Remember that the colimits preserved by the embedding $\mathsf{Seq} \to \mathcal{T}$ are only those that look like covers.

In fact, we can compute the cokernel as the coequalizer of the inclusion map and the constant $0$ map. In the topos, this is the sheafififcation of the colimit of presheaves, which are computed pointwise. So the underlying set of the cokernel is the colimit of the underlying sets is ${ 0 }$. But the convergent sequences is the colimit of

\[\{ \text{eventually constant sequences} \} \rightrightarrows \{ \text{convergent sequences} \}\]

where one map is just the inclusion, and the other sends every eventually constant sequence to the constant $0$ sequence.

So the cokernel has a single point ${ 0 }$, but there’s a proof that the constant $0$ sequence converges for every equivalence class of convergent sequences differing by an eventually constant sequence!

Keeping track of these proofs (which themselves form an abelian group) is exactly what we need to do to algebraically detect that $(\mathbb{R}, \text{discrete}) \to (\mathbb{R}, \text{euclidean})$ isn’t an isomorphism!

As an aside, I don’t understand condensed mathematics well enough to know how it differs from math in the topological topos. Just looking at definitions, I know it’s based on test maps from all compact hausdorff spaces instead of test maps from only $\mathbb{N}_\infty$. This probably means it’s closely related to compactly generated spaces in much the way that $\mathcal{T}$ is related to sequential spaces. I’m sure there’s a reason to prefer this, but I don’t know what it is⁸. The moral to keep in mind is that the power of doing algebra in a topos that handles the topology for you is currently being used to great effect in applying homological algebra to analytic situations where it previously couldn’t go!

Alright, I told you this one was going to be more leisurely than the last one! Now that we’ve seen some applications of $\mathcal{T}$ to topological algebra, and we’ve seen some basic externalization, let’s move on to part 3 and really get familiar with the internal logic and how it relates to the real world!

You can show this with categorical techniques. For instance, the category of models of any algebraic theory is always regular, while the category of posets isn’t ↩
The category of models for any algebraic theory always has an initial object, yet the category of fields doesn’t! ↩
See McCune’s Single Axioms for Groups and Abelian Groups with Various Operations.

This operaetion is related to the “usual” operations by $x / y = x \cdot y^{-1}$. ↩
Remember, though, that the product on $\mathsf{Seq}$ is different from the product on $\mathsf{Top}$. This never matters in practice, and the $\mathsf{Seq}$ product agrees with the product in the “convenient category” of compactly generated spaces, but if you want an honest group object in $\mathsf{Top}$, you’ll want $G$ to be locally compact. ↩
It’s not predicative, but that’s fine for a topos. And regardless, if you know enough to complain about predicativity, you know enough to give a predicative version of this proof :P. ↩
Indeed, it says something slightly stronger than this! In the case of non (sequentially) hausdorff spaces, there might be extra subsets that are merely kuratowski limit spaces! The theorem says we’re actually allowed to take $X$ to be such a subspace as well! ↩
The diligent reader will note there are a proper class of such arrows, so this pullback as written isn’t defined. Of course, the domain of any such arrow has at most $|G|$ many elements, and there’s only a set worth of topologies we can put on one of these domains. So up to isomorphism there’s only a set worth of arrows, and we’re good to go! ↩
Peter Scholze actually says a few words about why condensed sets are easier to work with than objects of $\mathcal{T}$ in a comment to his answer to this MO question. I still don’t really see it, but that’s probably because I haven’t spent a lot of time (or any time) working with condensed sets. ↩

Life in Johnstone's Topological Topos 1 -- Fundamentals

Wed, 03 Jul 2024 00:00:00 +0000

I’ve been thinking a lot about the internal logic of topoi again, and I want to have more examples of topoi that I understand well enough to externalize some statements. There’s more to life than just a localic $\mathsf{Sh}(B)$, and since I’m starting to feel like I understand that example pretty well, it’s time to push myself to understand other important examples too!

In particular, it would be nice to throw some gros topoi into the mix, and where better to start than Johnstone’s topological topos? This topos is fairly small (which makes explicit computation easy) and is very well studied (which makes finding references and examples merely annoying instead of totally impossible). Eventually I’ll want to learn about the effective topos¹ (and other realizability topoi more generally), various smooth topoi, etc. but let’s take them on one-at-a-time!

The topological topos $\mathcal{T}$ is a world where every set is intrinsically a space. What does this mean?

Well, if we’re working in $\mathsf{Set}$, then a space is a set $X$ equipped with some ~bonus structure~. This structure can take a lot of forms, but one ubiquitous example is that of a topology $\tau \subseteq \mathcal{P}(X)$.

Then if you want to work with spaces, you have to constantly keep track of what topology you’re working with. For example, there’s lots of topologies you can put on $X \times Y$, and we need to make sure we choose the right one to act like the product of the spaces $(X,\tau)$ and $(Y,\sigma)$.

Nowadays the “right” topology is usually “obvious²”, but this is only because we’re able to stand on the shoulders of countless 20th century mathematicians! I think most people would be hard-pressed to come up with, say, the compact-open topology if they hadn’t seen it before!

Of course, carrying around this ~bonus structure~ becomes most pronounced when working with continuous maps! Now, instead of just defining a function and moving on with your life, we’re constantly burdened to check that our function $X \to Y$ actually respects the topologies involved! Otherwise it isn’t a continuous function $(X,\tau) \to (Y, \sigma)$! There are some friends of mine who are constantly complaining that algebraic topologists never check that things are continuous, and honestly I’m sympathetic. But it can be a real hassle to check these things all the time…

Thankfully there’s a better way!

Every set in $\mathcal{T}$ is, by its very nature, a space! There’s no need to choose the “right” topology, or to check that your function is “continuous”. Inside $\mathcal{T}$, it’s literally impossible to write down a function that isn’t continuous, because there’s no ~bonus structure~ to respect! This is what we mean when we say the topology is intrinsic.

This is great for a couple reasons. First, say you build a type $X$ in Martin-Löf Type Theory (MLTT). We know how to interpret MLTT in any topos, so by interpreting $X$ in $\mathcal{T}$ we learn that our type $X$ is automatically a space! Understanding this relationship between types and topology has been a staple in many people’s careers, but I want to single out Martín Escardó as someone whose papers I’ve been reading lately (and who I talk to fairly often on mastodon). These conversations were a big part of the reason I decided to spend some time trying to understand $\mathcal{T}$.

Second, by the same logic, any theorem we’re able to prove constructively is automatically true in $\mathcal{T}$. That is, any constructive theorem is automatically true “continuously”, giving us a theorem for topological structures! Of course, in order to use these theorems, we need to understand how objects inside the topological topos $\mathcal{T}$ relate to honest topological spaces in “the real world”³.

In the process of learning about $\mathcal{T}$, I had to work out a bunch of examples, which I’d love to share! Even though all of this is probably “well known to experts”, I found a lot of it pretty hard to find, so hopefully this blog post is still useful for people ^_^.

Let’s get started!

What Is $\mathcal{T}$?

First, what even is the topological topos? It’s sheaves on some site, of course, but which one?

Write $\mathbb{N}_\infty$ for the one point compactification of $\mathbb{N}$. This is the space $\{0,1,2,3,\ldots,\infty\}$ topologized so that a convergent sequence $(x_n) \to x_\infty$ in $X$ is exactly a continuous map $\mathbb{N}_\infty \to X$⁴.

Then let’s look at the full subcategory of $\mathsf{Top}$ spanned by $\{1, \mathbb{N}_\infty \}$. This becomes a site if we give it the canonical topology $J$, and we define the topological topos $\mathcal{T}$ to be sheaves on this site.

We’ll say more about this definition in a minute, but first let’s see how we can picture objects of $\mathcal{T}$.

A presheaf on this site is a pair of sets $X(1)$ and $X(\mathbb{N}_\infty)$ with a bunch of maps connecting them:

There’s a map $n^* : X(\mathbb{N}_\infty) \to X(1)$ for each $n$, plus a map $\infty^* : X(\mathbb{N}_\infty) \to X(1)$
There’s a unique map $!^* : X(1) \to X(\mathbb{N}_\infty)$
For every continuous function $f : \mathbb{N}_\infty \to \mathbb{N}_\infty$ there’s a map $f^* : X(\mathbb{N}_\infty) \to X(\mathbb{N}_\infty)$.

You should think of elements of $X(1)$ as the points of $X$, and the elements of $X(\mathbb{N}_\infty)$ as (witnesses to) convergent sequences in $X(1)$. Indeed, if $p \in X(\mathbb{N}_\infty)$, then we’ll write $p_n$ for $n^*(p)$ (resp. $p_\infty$ for $\infty^* p$) and $p$ should be thought of as a witness or proof that the sequence $p_n$ converges to $p_\infty$ in $X(1)$⁵.

The unique map $!^*$ sends a point $x \in X(1)$ to a distinguished proof that the constant $x$ sequence converges to $x$, and the functions $f^*$ “reindex” a convergent sequence. So if $p$ is a proof that $x_n \to x_\infty$, then $f^* p$ is a proof that $x_{fn} \to x_{f \infty}$ too.

The sheaf condition for the canonical topology guarantees that if every subsequence of $x_n$ converges to $x_\infty$, the whole sequence $x_n$ converges to $x_\infty$ too⁶.

Can you prove that this is reasonable?

If $x_n \to x$ in some topological space $X$, and $f : \mathbb{N}_\infty \to \mathbb{N}_\infty$ is continuous, why must $x_{fn} \to x_{f \infty}$?

You might find it helpful to case on whether $f(\infty) = \infty$ or not.

As a cute exercise can you find a simple description of the arrows in $\mathcal{T}$? That is, for a natural transformation between two sheaves?

solution

A natural transformation between $X$ and $Y$ is a pair of functions $$f_1 : X(1) \to Y(1)$$ $$f_{\mathbb{N}_\infty} : X(\mathbb{N}_\infty) \to Y(\mathbb{N}_\infty)$$ So that whenever $p$ is a proof that $x_n \to x_\infty$, $f_{\mathbb{N}_\infty}(p)$ is a proof that $f_1(x_n) \to f_1(x_\infty)$. Moreover, $f_{\mathbb{N}_\infty}$ should respect the distinguished proofs that constant sequences converge (so $f_{\mathbb{N}_\infty}(!^* x) = !^* f_1(x)$) as well as reindexing (so $f_{\mathbb{N}_\infty}(g^* p) = g^* (f_{\mathbb{N}_\infty} p)$)

Now every topological space $X$ gives an object $よX = \mathsf{Top}(-,X)$ in the topos⁷, where $よX(1) = X$ and $よX(\mathbb{N}_\infty) = \{ \text{continuous functions } f : \mathbb{N}_\infty \to X \}$. That is, the underlying set of $よX$ is exactly the underlying set of $X$, and for every convergent sequence in $X$ there is a unique proof that that sequence converges (represented by the sequence itself).

If we restrict attention to the full subcategory of sequential spaces, then $よ$ is a fully faithful embedding into $\mathcal{T}$. This shouldn’t be too surprising, since the sequential spaces are exactly those spaces whose topologies are determined by a knowledge of which sequences converge!

Importantly, you should think of this is a super mild condition, since lots of natural spaces of interest are sequential. Just to name a few:

all metric spaces
more generally, all first countable spaces
every CW-complex
every noetherian ring spectrum

This tells us that a huge subcategory of topological spaces embeds fully faithfully into $\mathcal{T}$! Later we’ll say more about how computations in $\mathcal{T}$ translate to computations in the real world, but this is a good first indication that they should be closely related!

There’s another definition of $\mathcal{T}$ which you’re also likely to see.

Having $X(1)$ around explicitly as a set of points is helpful for exposition and intuition, but it turns out to not change the topos if we work without it! Intuitively, we can recover the points from the constant maps $n : \mathbb{N}_\infty \to \mathbb{N}_\infty$.

With this in mind, some authors define $\mathcal{T}$ to be the sheaves on just the full subcategory of $\mathsf{Top}$ spanned by $\{ \mathbb{N}_\infty \}$. That is, they define it to be sheaves on the monoid of continuous endomorphisms of $\mathbb{N}_\infty$. See, for instance, The Elephant (A2.1.11(j)).

This gives a different informal justification for the close connection between $\mathcal{T}$ and sequential spaces. Indeed, objects of a sheaf topos can be thought of as being glued together from objects of the underlying site. In case you’re working with a presheaf topos, we take all the ways to glue things together, but in general a grothendieck topology forces us to restrict attention to those gluings which are “nice” in some sense.

So, with this smaller site in hand, one way to think about objects in $\mathcal{T}$ is as copies of $\mathbb{N}_\infty$ that are “glued together nicely”. And one can show that the sequential spaces are exactly the quotients of disjoint unions of copies of $\mathbb{N}_\infty$! This also tells us that, in some sense, the other objects of $\mathcal{T}$ are just copies of $\mathbb{N}_\infty$ glued together in more exotic ways, for instance by gluing two copies of $\mathbb{N}_\infty$ literally on top of each other to get multiple witnesses to the convergence of the same sequence!

But how do we know that these two definitions agree? I wasn’t able to find this written down anywhere, but it’s easy to check for ourselves!

The key observation is that $\{ \mathbb{N}_\infty \}$ is a dense subsite of $\{ \mathbb{N}_\infty, 1 \}$. Here I’m using set-builder notation to mean a full subcategory of $\mathsf{Top}$ equipped with the canonical topology.

$\ulcorner$ Indeed, to check this, we only need to show that every object in $\{ \mathbb{N}_\infty, 1 \}$ is covered by maps with domain in $\{ \mathbb{N}_\infty \}$. But the identity function $\mathbb{N}_\infty \to \mathbb{N}_\infty$ covers, and the unique map $\mathbb{N}_\infty \to 1$ covers too.

Since $\{ \mathbb{N}_\infty \}$ is a full subcategory of $\{ \mathbb{N}_\infty, 1 \}$, the second condition of the comparison lemma is trivial, and we learn that the geometric map induced by the inclusion is an equivalence.

In particular, the two common definitions really do give equivalent topoi! $\lrcorner$

So, finally, what is the Canonical Topology?

For the site with two objects, $\{1, \mathbb{N}_\infty\}$, every (nonempty) family of arrows $\{X_\alpha \to 1 \}$ is covering. So the interesting question is what a covering family of $\mathbb{N}_\infty$ looks like.

If $S$ is an infinite subset of $\mathbb{N}$, we write $f_S$ for the unique monotone map $\mathbb{N}_\infty \to \mathbb{N}_\infty$ whose image is $S \cup \{ \infty \}$.

A family $\{X_\alpha \to \mathbb{N}_\infty\}$ is covering if and only if both

It contains every constant map $1 \to \mathbb{N}_\infty$
For every infinite $T \subseteq \mathbb{N}$, there is a further infinite subset $S \subseteq T$ with $f_S : \mathbb{N}_\infty \to \mathbb{N}_\infty$ in the family

In particular, if a family contains every constant map $1 \to \mathbb{N}_\infty$ and a “tail of an infinite sequence” $f_{\{x \geq N\}}$ for some $N$, then that family is covering.

So, roughly, to prove that something “merely exists” in $\mathcal{T}$, we have to provide a witness for every finite $n$, and these witnesses should converge to the witness for $\infty$.

If we want to use the site with one object $\{ \mathbb{N}_\infty \}$, the condition is almost exactly the same. A family of maps is covering if and only if both

every constant map $\mathbb{N}_\infty \to \mathbb{N}_\infty$ is in the family
For each infinite $T \subseteq \mathbb{N}$, there’s a further infinite $S \subseteq T$ so that $f_S$ is in the family.

This, unsurprisingly, doesn’t make too much difference. But note that the site with two objects is obviously local in the sense of The Elephant (C3.6.3(d)). So we learn that the global sections functor $\Gamma : \mathcal{T} \to \mathsf{Set}$ which takes an object $X$ to its set of points $X(1)$ admits the usual left adjoint characteristic of geometric morphisms (giving a set $X$ the discrete topology) but also a further right adjoint (giving a set $X$ the indiscrete topology).

In the original paper, Johnstone moreoever shows that the essential point $\mathsf{Set} \to \mathcal{T}$ given by this indiscrete arrow is the unique global point of $\mathcal{T}$.

How Does $\mathcal{T}$ Relate to $\mathsf{Top}$?

Here’s the tl;dr for this section, for ease of reference.

We have a sequence of fully-faithful embeddings of bicartesian closed categories, each of which admits a left adjoint, as shown below:

The embeddings preserve all limits (as right adjoints) but moreover preserve the cartesian closed structure, as well as certain “nice” colimits (in particular, all colimits involved in the creation of CW-complexes). The exact definition of “nice” here is explained in Johnstone’s original paper, but includes the coproduct. Additionally, the image of a map $X \to Y$ of sequential spaces (with $Y$ sequentially hausdorff) as computed in $\mathcal{T}$ is just the set theoretic image equipped with the quotient topology (Corollary 6.4 in the original paper).

The left adjoints preserve all colimits, and moreover preserve finite products (and thus, in particular, models of algebraic theories).

Lastly, in case we restrict to the full subcategory of “sequentially hausdorff spaces”, in the sense that every convergent sequence has a unique limit, then the adjunction $\text{Seq} \leftrightarrows \text{Kur}$ is an adjoint equivalence!

Here $\text{Seq}$ is bicartesian closed, $\text{Kur}$ is locally cartesian closed, $\mathcal{T}$ is a topos, and the embeddings preserve all of this structure. Thus one can say that at each level we add new “type constructors”, as shown in the following diagram (stolen from Martín Escardó):

Colimits in $\text{Seq}$ are computed as in $\mathsf{Top}$, so in particular the “nice colimits” that get preserved between $\text{Seq}$ and $\mathcal{T}$ agree with colimits in $\mathsf{Top}$.

The relationship of limits in $\text{Seq}$ to limits in $\mathsf{Top}$ is more subtle. If you only care about (quotients of) second countable spaces, then the bicartesian closed structure on $\text{Seq}$ (and thus in $\mathcal{T}$) agrees with the usual bicartesian closed structure on the “convenient category” of compactly generated spaces. In particular, function spaces get the compact-open topology.

If your spaces are locally compact, then the (finite) product in $\text{Seq}$ (and thus in $\mathcal{T}$) agrees with the product in $\mathsf{Top}$.

That’s a lot, so let’s go more in depth into what all of this means, haha.

We’ll start with the definition of a Kuratowski Limit Space (also called a subsequential space):

A Kuratowski Limit Space is a set $X$ equipped with a set of Convergent Sequences in $X$ subject to the following axioms:

For every $x \in X$, the constant sequence $(x)$ converges to $x$
If $(x_n)$ converges to $x$, then every subsequence of $(x_n)$ converges to $x$ too
If, for some $x$, every subsequence of $(x_n)$ contains a further subsequence converging to $x$, then the whole sequence $(x_n)$ already converges to $x$.

We moreover call $X$ Sequentially Hausdorff if it satisfies the bonus axiom

If $(x_n)$ converges to $x$ and $(x_n)$ converges to $y$, then $x=y$.

A function $f : X \to Y$ between limit spaces is called continuous if whenever $x_n \to x$, we have $fx_n \to fx$.

Every sequential topological space is automatically a limit space, where we just let the convergent sequences be the (topologically) convergent sequences. Moreover, there’s a fully faithful embedding of limit spaces into $\mathcal{T}$ where we let $X(1) = X$ and $X(\mathbb{N}_\infty)$ be the set of convergent sequences in $X$.

As a (not so tricky) exercise, you might want to verify that this map from $\mathsf{Kur}$ to presheaves is actually always a sheaf.

This basically amounts to comparing axiom (3) for limit spaces to the definition of a cover of $\mathbb{N}_\infty$.

Taken together, we have fully faithful embeddings

\[\mathsf{Seq} \hookrightarrow \mathsf{Kur} \hookrightarrow \mathcal{T}\]

In fact, Johnstone’s original paper shows that $\mathsf{Kur}$ is the quasitopos of $\lnot \lnot$-separated sheaves in $\mathcal{T}$! Thus the embedding $\mathsf{Kur} \hookrightarrow \mathcal{T}$ admits a finite product preserving left adjoint, and the locally cartesian closed structure of $\mathsf{Kur}$ agrees with that of $\mathcal{T}$ (see A1.5.9 in the elephant).

Concretely, this left adjoint takes an object of $\mathcal{T}$ and fogets how a sequence converges, only remembering that it converges! Said another way, it identifies any proofs $p$ and $q$ with $p_n = q_n$ for all $n \in \mathbb{N}_\infty$.

Moreover, the embedding $\mathsf{Seq} \hookrightarrow \mathsf{Kur}$ also admits a finite product preserving left adjoint!

We say a subset $U \subseteq X$ is Sequentially Open if whenever $x_n \to a \in U$, some tail of $x_n$ is entirely contained in $U$. It’s easy to see that the set of sequential open subsets forms a topology on $X$, and indeed our reflector sends a limit space $(X,\{\text{convergent sequences}\})$ to the sequential space $(X,\{\text{sequential opens}\})$. This functor moreover preserves finite products⁸, which is Proposition 3.1 in Menni and Simpson’s Topological and limit-space subcategories of countably-based equilogical spaces⁹.

Also, notice that this reflection possibly adds new convergent sequences. Maybe our limit space $X$ knows about some convergent sequences, but once we actually build a topology to make these sequences converge in the usual sense, there might accidentally be more convergent sequences than we started with!

Conversely, the subobjects of a space $X \in \mathcal{T}$ come from taking a subset of the points and a subset of the convergent sequences. So we see this is exactly what the kuratowski limit spaces are! They’re the subobjects of sequential spaces, where we may have forgotten about certain sequences that “would converge” if we still had our open sets.

In general, we can think of objects in $\mathsf{Seq}$ as honest spaces, with points and all the convergent sequences that should exist. Objects in $\mathsf{Kur}$ are almost honest spaces, we just might have forgotten about a few convergent sequences that “should” be there if we remembered the whole topology. But there’s still only “one way” for any given sequence to converge. Objects in $\mathcal{T}$ are like spaces which might have forgotten some convergent sequences, and which have ~bonus data~ attached to them giving multiple inequivalent proofs that these sequences converge!

But this intuitive picture tells us how to get an honest space from your favorite object $X \in \mathcal{T}$! We take $X(1)$ as our set of points, and a subset of $X(1)$ is open exactly when it’s sequentially open, forgetting the data of the multiple proofs of convergence.

The fact that the reflectors preserve finite products tells us that the $\mathsf{Seq}$ and $\mathsf{Kur}$ are exponential ideals in $\mathcal{T}$. Thus they’re both cartesian closed, and the embeddings preserve the cartesian closed structure! It’s not hard to see the embeddings $\mathsf{Seq} \hookrightarrow \mathsf{Kur}$ and $\mathsf{Kur} \hookrightarrow \mathcal{T}$ preserve coproducts, so that we get the promised embeddings of bicartesian closed categories.

Lastly, the cartesian closed structure on $\mathsf{Seq}$ is the one you would expect from viewing it as a “convenient category of spaces”. The exponential is (usually) the compact-open topology! You can read more about the subtleties in Escardó, Lawson, and Simpson’s Comparing Cartesian closed categories of (core) compactly generated spaces, but the gist is that you get the compact-open topology whenever you’re working with (quotients of) second countable spaces!

From this information, there’s a few simple corollaries that I want to mention explicitly, since they give more relationships between $\mathsf{Top}$, $\mathsf{Seq}$, and $\mathcal{T}$.

First, fully faithful functors reflect isomorphisms, so if we can prove in $\mathcal{T}$ that two spaces are isomorphic, it means they must be isomorphic in $\mathsf{Seq}$ too. But then all functors preserve isomorphisms, so that we get an isomorphism in $\mathsf{Top}$ too! Thus, we can show two sequential spaces are homeomorphic by working entirely in $\mathcal{T}$! The converse argument (using the fully faithful embedding $\mathsf{Seq} \hookrightarrow \mathsf{Top}$) shows that two homeomorphic sequential spaces are also isomorphic in $\mathcal{T}$, so that we can detect every homeomorphism just by working in $\mathcal{T}$.

Similarly, if $A$ and $B$ are both sequential, then a map $A \to B$ is monic in $\mathsf{Top}$, if and only if it’s monic in $\mathsf{Seq}$ if and only if it’s monic in $\mathcal{T}$! In all cases, the monics are exactly the continuous injections¹⁰. This tells us that anything we “expect” to be a subobject in $\mathcal{T}$ actually is. But note that in $\mathsf{Top}$ we might have nonsequential subspaces of a sequential space (any non-frechet sequential space will do. see here) and similarly in $\mathcal{T}$ we might have nonsequential subspaces of a sequential space (indeed, every kuratowski limit space is a subobject in $\mathcal{T}$ of a sequential space). Nonetheless, open/closed subspaces of a sequential space will be sequential in both $\mathsf{Top}$ and in $\mathcal{T}$.

Dually, if $A \to B$ is an epi in $\mathcal{T}$, then it’s an epi in $\mathsf{Seq}$, and thus an epi in $\mathsf{Top}$ (since the inclusion $\mathsf{Seq} \to \mathsf{Top}$ is a left adjoint, and preserves epis). But there’s no reason to suspect an epi in $\mathsf{Top}$ to remain an epi in $\mathcal{T}$.

This is great and all, but the only way to really get some intuition for how computations in $\mathcal{T}$ relate to computations in $\mathsf{Top}$ is to actually do some computation and check! So let’s do that! Let’s start with a few important types representing various kinds of proposition. These will be important for building new types later, and for understanding how to externalize them.

$2$

This is the discrete space with two points $\{\top,\bot\}$. This is sequential (every finite space is, and so is every discrete space), so behaves exactly as you would expect. Note that the convergent sequences are all eventually constant!

We think of $2$ as the space of Decidable Propositions, so that maps $X \to 2$ classify decidable properties of $X$. These are the same as clopen subsets of $X$, and thus might be quite rare. Notice that, in $\mathcal{T}$, $2$ doesn’t form a complete lattice! It has finite joins and meets, of course, and we can can build the continuous functions $\land, \lor : 2 \times 2 \to 2$ quite easily. Thinking of $2$ as classifying clopen subobjects, this corresponds to the fact that a finite union/intersection of clopen sets is clopen. Indeed, if $A,B \subset X$ are clopen, classified by maps $\chi_A, \chi_B : X \to 2$, then the map $\lambda (x:X). \chi_A(x) \land \chi_B(x) \ : X \to 2$ classifies exactly $A \cap B$.

This tells us immediately that $2$ cannot have countable joins/meets. We can see this via continuity, since the map $\bigwedge : 2^\mathbb{N} \to 2$ with $\bigwedge \alpha = \begin{cases} \top & \forall n . \alpha(n) = \top \\ \bot & \text{otherwise} \end{cases}$ is not continuous! (and neither is $\bigvee$. In both cases, do you see why?)

We can instead see this by thinking of $2$ as classifying clopen subsets, since if $\bigwedge$ or $\bigvee$ existed, we could use them with classfiying maps to show the countable intersection/union of clopen sets is clopen. But of course we know this is false!

As a last aside, it’s not hard to see that $2$ being countably complete corresponds exactly to the Weak Limited Principle of Omniscience (WLPO), so that this shows $\mathcal{T} \not \models \text{WLPO}$. We’ll talk more about omniscience principles in part 3, but it’s nice to mention.

$\Sigma$

This is the sierpinski space, which also has two points $\{\top,\bot\}$, but with the topology $\{\emptyset, \{\top\}, \Sigma \}$. This is sequential (every finite space is), where every sequence converges to $\bot$ and the sequences converging to $\top$ are only the eventually constantly $\top$ ones.

We think of $\Sigma$ as the space of Open Propositions, since maps $X \to \Sigma$ classify the open subspaces of $X$. By the same logic as before, it’s easy to show that $\Sigma$ is a lattice with finite meets and joins (indeed, we can build the maps $\lor$ and $\land$ using the yoneda lemma in $\mathsf{Seq}$ knowing that open subspaces are closed under finite unions and intersections).

A similar yoneda argument shows that $\Sigma$ is a complete lattice, since homs into $\Sigma$ are open subspaces, and arbitrary unions of opens are open, so these operations must be represented by joins $\bigvee$ on $\Sigma$. But it’s kind of fun to show directly that $\bigvee : \Sigma^\mathbb{N} \to \Sigma$ is continuous (and more generally so is $\bigvee : \Sigma^\kappa \to \Sigma$ for any cardinal $\kappa$).

As a cute aside, this tells us that $\Sigma$ must be closed under arbitrary meets as well… But of course, the arbitrary intersection of open subspaces isn’t open. Do you see what’s going on there?

$\Sigma^c$

This is the “co-sierpinski space”. It has two points $\{\top, \bot\}$ but the topology is $\{\emptyset, \{\bot\}, \Sigma^c\}$. Again it’s sequential, but now we see that every sequence converges to $\top$, while only the eventually constant $\bot$ sequences converge to $\bot$.

Unsurprisingly, this is the space of Closed Propositions, and maps into $\Sigma^c$ classify closed subspaces of $X$. The same logic as before shows $\Sigma^c$ is a complete lattice, but now we have direct access to $\bigwedge$, since closed subspaces are closed under arbitrary intersection!

$\nabla 2 = \Omega_{\lnot \lnot}$

At this point you know what’s going on. This space has two points, $\{\top, \bot\}$ with the indiscrete topology (the only opens are $\emptyset$ and $\nabla 2$), so here every sequence coverges to both $\top$ and $\bot$. We think of this as the space of all “classical” propositions, since a map $X \to \nabla 2$ is exactly the same thing as a “classical” subspace of $X$. That is, a subset of the points equipped with the induced topology.

This interpretation also makes it more believable that it should correspond to $\Omega_{\lnot \lnot}$, the double-negation closed propositions, as this does represent the “classical” propositions, where we really care about subsets of points (which are complemented, so satisfy LEM), and then take whatever topology makes the universal propety work (which happens to be the subspace topology).

Notice that this is again a complete lattice. We can see this either because $\nabla 2$ is indiscrete, so every function into it is continuous (in particular, the arbitrary join/meet functions are continuous), or by using yoneda again (since arbitrary unions/intersection of subsets of points are again subsets of points).

$\Omega$

This is the subobject classifier, or the space of all propositions! Maps $X \to \Omega$ classify all subspaces, even the possibly nonclassical ones. That is, this is the first time we’re able to build a limit space, rather than an “honest” sequential space!

Notice that, for $2$ (resp. for $\Sigma$, $\Sigma^c$, and $\nabla 2$) if we start with a sequential space $X$ and a decidable (resp. open or closed or classical) proposition $\varphi : X \to 2$ (resp. $\Sigma$, $\Sigma^c$, $\nabla 2$), then the pullback

exists in $\mathsf{Seq}$, and is an honest clopen (resp. open, closed, classical) subspace of $X$.

Note that this $A_\varphi$ is an object above $X$, so if we want to access it in the type theory we have to compose with the map from $X \to 1$. That is, in the type theory $A_\varphi$ (the subspace classified by $\varphi$) is represented by $\Sigma_{x:X} \varphi$, as we would expect.

Now, what does the space of all propositions look like in $\mathcal{T}$? Well, we know that $2$, $\Sigma$, $\Sigma^c$, and $\nabla 2$ are all sequential spaces. So there’s a unique convergence proof for each sequence. It turns out the ability to “remember” only some convergent sequences (which puts is into the world of kuratowski limit spaces) can be coded up by more interesting proofs of convergence! Let’s see how!

Johnstone shows that $\Omega$ has two points $\top$ and $\bot$, and for any sequence $x_n$ of $\top$s and $\bot$s

there’s exactly one proof that $x_n \to \bot$
there’s one proof that $x_n \to \top$ for each “closed ideal of subsets of $\mathbb{N}$” whose “extent” is $\{n \mid x_n = \top \}$.

Here a “closed ideal of subsets of $\mathbb{N}$” is a pair $(E,I)$ where $E \subseteq \mathbb{N}$ is called the extent of the closed ideal of subsets.

In the above, $I$ is a family of infinite subsets of $E$, so that

if $S$ is an infinite subset of $T$ and $T \in I$, then $S \in I$ too
if every infinite subset of $T$ has a further infinite subset in $I$, then $T \in I$ too.

These correspond, basically, to axioms (2) and (3) for limit spaces. This will make sense once we understand how this subobject classifier actually classifies subobjects!

Say that we have a subobject $A \hookrightarrow X$. That is, we remember only some of the points of $X$, and we only remember some of the (proofs of) convergent sequences. This had better be classified by a unique map (read: natural transformation) $\ulcorner A \urcorner : X \to \Omega$.

On points, we have $\ulcorner A \urcorner(x) = \begin{cases} \top & x \in A \\ \bot & x \not \in A \end{cases}$, but what do we do for a proof $p$ that $x_n \to x_\infty$?

Well, to what extent is $(x_n) \in A$? Each point of the sequence is either in $A$ or isn’t, so a sequence $x_n$ in $X$ produces a sequence of $\top$s and $\bot$s given by $\omega_n = \ulcorner A \urcorner(x_n)$.

If $x_\infty \not \in A$ (that is, if $\omega_\infty = \bot$), then life is easy. We send $p$ to the unique proof that $\omega_n \to \bot$. If instead $x_\infty \in A$, then there are lots of proofs that $\omega_n \to \top$, indexed by these closed ideals of subsets. So to decide where $p$ should go, we need a natural choice of $(E,I)$ associated to our sequence $x_n$.

We’ll let $E = \{ n \mid x_n \in A \}$ be the extent to which $x_n \in A$. Next, we have to say which infinite subsets of $E$ live in $I$. Given any infinite subset $T \subseteq E$, we can restrict $x_n$ to a subsequence of indices in $T$. Since $E$ is exactly the indices where $x_n \in A$, this restriction $x_n \upharpoonright T$ is a sequence in $A$. Now we say that $T \in I$ if and only if the restriction $p \upharpoonright T$ proving that $x_n \upharpoonright T \to x_\infty$ in $X$ is one of the proofs we kept in $A$.

This shows where these conditions on “closed ideals of subsets” come from! If $T \in I$, that means that $p \upharpoonright T : x_n \upharpoonright T \to x_\infty$ is a convergence proof in $A$. So every subsequence of this (read: every infinite subset of $T$) must also converge in $A$. Also, if every subsequence has a further convergent subsequence (if every infinite subset of $T$ has a further infinite subset in $I$), then $p \upharpoonright T$ must also be a convergence proof in $A$!

For example, say $X = \mathbb{R}$ with the usual topology, and $A = \mathbb{R}$ with the discrete topology. Then the identity $A \hookrightarrow X$ is monic, so should be a subobject, but in order to classify this we need to only remember some convergent sequences (the eventually constant ones). Note that every sequence in $X$ gets sent to the constant $\top$ sequence in $\Omega$, since $A$ and $X$ agree on points. But thankfully now there’s multiple proofs available that the constant $\top$ sequence converges to $\top$! We send a sequence $(x_n)$ to the proof indexed by $(\mathbb{N},I)$ where a subset $S$ is in $I$ if and only if $S$ is the set of indices of an eventually constant subsequence of $(x_n)$.

Lastly, note that if we reflect $\Omega$ into $\mathsf{Seq}$, it becomes the humble indiscrete space $\nabla 2$. All of its power comes from the ability to carry extremely detailed information inside its multiple proofs of convergence. Storing “extra information” in the convergence proofs will come up again in part 2 when we look at quotients of topological algebras.

$\mathbb{R}$

There’s an object $よ\mathbb{R} = \text{Hom}_\mathsf{Top}(-,\mathbb{R})$ in $\mathcal{T}$ since $\mathbb{R}$ is a separable space. But usually when people talk about a “real numbers object” in a topos, they mean the object of dedekind reals (points of a certain locale).

It turns out that in $\mathcal{T}$ this distinction doesn’t matter! Johnstone’s original paper computes that the object of points of the theory of dedekind reals is exactly the object $よ\mathbb{R}$!

We won’t show this here because we’ll show something more general in just a few bullets!

$2^\mathbb{N}$

Since $2$ and $\mathbb{N}$ are both sequential, their exponential in $\mathcal{T}$ agrees with their exponential in $\mathsf{Seq}$. Since $2$ and $\mathbb{N}$ are moreover second countable, $2^\mathbb{N}$ gets the compact open topology.

Now, in the realm of classical topology, since $\mathbb{N}$ is discrete the compact open topology on $2^\mathbb{N}$ is just the product topology, and we get cantor space (as expected!)

Again, one can also ask about the points of the locale object “cantor space”. And again, we find that the points of this locale are represented by $よ2^\mathbb{N}$, which is the same thing as the internal $2^\mathbb{N}$ we just computed!

$\sum_{\alpha : 2^\mathbb{N}} \forall {n : \mathbb{N}} \ \alpha(n+1) \leq \alpha(n)$

For each $n$, the proposition $\alpha(n+1) \leq \alpha(n)$ is decidable (said another way, the subset $A_n = \{ \alpha \mid \alpha(n+1) \leq \alpha(n) \} \subseteq 2^\mathbb{N}$ is clopen for each $n$), but once we ask for this quantifier (which we interpret as an infinite meet), we’re forced to work in $\Sigma^c$.

So “$\forall n : \mathbb{N} . \alpha(n+1) \leq \alpha(n)$” is a closed proposition, and $\sum_{\alpha : 2^\mathbb{N}} \forall {n : \mathbb{N}} \ \alpha(n+1) \leq \alpha(n)$ is just the closed subspace it classifies.

So this space, externally, is the closed subspace of cantor space corresponding to the decreasing binary sequences. This space is homeomorphic to $\mathbb{N}_\infty$, so we see these spaces are also isomorphic in $\mathcal{T}$.

Regular Locales

Let $X$ be a regular locale. Then the object of $X$ models in $\mathcal{T}$ is represented by $\text{pt}(X)$, the topological space of models of $X$ in $\mathsf{Set}$.

In particular, if $X$ is a regular topological space, then the object of $X$-models in $\mathcal{T}$ is just $よX$, as we might hope!

In particular again, the dedekind reals object is $よ\mathbb{R}$, the cantor space object is $よ2^\mathbb{N}$, and any regular locale with enough points classically has enough points in $\mathcal{T}$.

$\ulcorner$ Write $X^\mathcal{E}$ for the object of $X$-models in some topos $\mathcal{E}$. The points of $X^\mathcal{E}$ are in bijection with the geometric maps $\mathcal{E} \to \mathsf{Sh}(X)$, and from here it’s a nice exercise to check that the $A$ valued points of $X^\mathcal{E}$ are in bijection with geometric maps $\mathcal{E} \big / A \to \mathsf{Sh}(X)$ for any object $A \in \mathcal{E}$. Moreover, since $X$ is a locale, the geometric maps $\mathcal{E} \to \mathsf{Sh}(X)$ are in bijection with continuous maps from the localic reflection of $\mathcal{E}$ to $X$.

Putting these facts together (in the special case of $\mathcal{T}$), we see that $X^\mathcal{T}(1) = \mathsf{Locale}(\Omega(1), X)$ and $X^\mathcal{T}(\mathbb{N}_\infty) = \mathsf{Locale}(\Omega(\mathbb{N}_\infty), X)$.

We know that $\Omega(1) = \{ \top, \bot \}$, so locale maps from $\Omega(1)$ to $X$ are just the ordinary set valued points of $X$. That is, $X^\mathcal{T}(1) = X^\mathsf{Set} = \text{pt}(X)$.

Now let’s look at $\mathsf{Locale}(\Omega(\mathbb{N}_\infty), X)$, that is, frame homomorphisms $X$ to $\Omega(\mathbb{N}_\infty)$. Since $X$ is regular, we know that every $a \in X$ satisfies¹¹

\[a = \bigvee \{x \mid x \prec a \}\]

where $x \prec a$ (read as “$x$ is rather below $a$”) means that $\lnot x \lor a = \top$.

Then if $f : X \to \Omega(\mathbb{N}_\infty)$ is a frame hom we see that each $fa$ satisfies this same property. Indeed

\[\begin{align} fa &= f \left ( \bigvee \{ x \mid x \prec a \} \right ) \\ &= \bigvee \{ fx \mid fx \prec fa \} \\ &\leq \bigvee \{ y \mid y \prec fa \} \\ &\leq fa \end{align}\]

But it’s not hard to see the only elements in $\Omega(\mathbb{N}_\infty)$ that satisfy this property are the proofs $(\omega_n) \to \bot$ and the proofs $(\omega_n) \to \top$ indexed by $(E,I)$ where $E$ is cofinite and $I$ is the maximal ideal of all infinite subsets of $E$¹². Since these are precisely the open subspaces of $\mathbb{N}_\infty$ (respectively, they’re any subset of $\mathbb{N}$ and any cofinite subset of $\mathbb{N}$ including $\{\infty\}$), we see that every frame map $X \to \Omega(\mathbb{N}_\infty)$ factors through the inclusion of the frame of opens of $\mathbb{N}_\infty$. Of course, frame maps $X$ to the opens of $\mathbb{N}_\infty$ are exactly locale maps $\mathbb{N}_\infty \to X$. Since $\mathbb{N}_\infty$ is spatial, these are exactly the same thing as maps of topological spaces from $\mathbb{N}_\infty$ to $\text{pt}(X)$.

So where did we start, and where did we end? We see that $X^\mathcal{T}(1) \cong \text{pt}(X) \cong \mathsf{Top}(1,\text{pt}(X))$ and $X^\mathcal{T}(\mathbb{N}_\infty) \cong \mathsf{Top}(\mathbb{N}_\infty, \text{pt}(X))$ so that $X^\mathcal{T} \cong よ\text{pt}(X)$, as claimed. $\lrcorner$

Note that in this proof regularity was only used to check that the convergent sequences in $X^\mathcal{T}$ are what we expected. Showing that global points of $X^\mathcal{T}$ agree with the points of $X$ in $\mathsf{Set}$ is easy, and true for every locale $X$! In particular, if $X$ has enough points in $\mathsf{Set}$, it also has enough points in $\mathcal{T}$!

This result tells us that coherent locales (as defined in $\mathsf{Set}$) have enough points in $\mathcal{T}$, which I’m pretty sure is equivalent to the prime ideal theorem… But I would expect this to be false in $\mathcal{T}$ since it’s quite close to full AC, so we should be able to use it to build a discontinuous function (but I don’t know how).

Is there a flaw in this proof? Maybe we get out of this problem if the prime ideal theorem in $\mathcal{T}$ is actually equivalent to all coherent locales internal to $\mathcal{T}$ having enough points? There are locales internal to $\mathcal{T}$ which don’t exist in $\mathsf{Set}$ at all, so maybe that’s a stronger statement than what this theorem gets us…

I would be SUPER grateful if some experts chimed in! Feel free to leave a comment on this post, email me, or say something on mastodon.

How does $\mathcal{T}$ relate to sheaf topoi $\mathsf{Sh}(X)$?

Like we said in the introduction, the topological topos $\mathcal{T}$ is a gros topos in the sense that it’s objects are productively thought of as spaces. However, to any individual topological space, we can associate a “petit topos” $\mathsf{Sh}(X)$ which should be thought of as a (generalized) space in its own right. Oftentimes, if $\mathcal{B}$ is a gros topos and $X$ is a topological space which lives in $\mathcal{B}$, there is a close connection between $\mathsf{Sh}(X)$ and the slice topos $\mathcal{B} \big / X$.

For instance some authors¹³ say that “a” topological topos is a category of sheaves on a subcategory $\mathcal{C}$ of $\mathsf{Top}$ closed under finite limits and open subspaces. The grothendieck topology is the natural one where a covering family is an open cover in the usual sense. Then if $X \in \mathcal{C}$, the usual sheaf topos $\mathsf{Sh}(X)$ is “homotopy equivalent” to the slice topos $\mathsf{Sh}(\mathcal{C},J) \big / X$. This is made precise at the end of Mac Lane and Moerdijk, Chapter VI.10.

It’s natural to ask for a similar relationship between $\mathsf{Sh}(X)$ and $\mathcal{T} \big / X$ for a sequential space $X$. This is the subject of Section 9 in Johnstone’s original paper, where it’s shown that there is a geometric morphism $\mathcal{T} \big / X \to \mathsf{Sh}(X)$, but this relationship is somewhat less compelling than in the case of a more traditional gros topos. For instance, the direct image half of this morphism isn’t exact, so that cohomology of $\mathcal{T} \big / X$ does not agree with the cohomology of $\mathsf{Sh}(X)$. In particular, even for the (closed) unit interval $I$ and a finite abelian group $A$ Johnstone shows that $H^1(\mathcal{T} \big / I; A)$ is not trivial!

Johnstone decides to not say anything more about the geometric morphisms $\mathsf{Sh}(X) \to \mathcal{T}$, and we’ll follow suit.

⚠ From this discussion, though, we learn that “the” topological topos $\mathcal{T}$ is not “a” topological topos in the sense of Moerdijk and Reyes (and other papers). In particular, we have to be careful when reading the literature which version of “topological topos” the author is talking about.

Characterizing “Honest Spaces” Internally

We saw earlier how every type we construct represents some space by interpreting that type in $\mathcal{T}$ and then reflecting back into $\mathsf{Seq}$. This remains true even if we use certain principles that aren’t always true in type theory, but happen to be true in $\mathcal{T}$ (we’ll discuss this in Part 3).

From this point of view it’s natural to ask when a type we’ve constructed is already a space, without needing to do any reflection. Is there a way to internally characterize the “honest spaces” amongst all the objects in $\mathcal{T}$?

The first step in this process is to recognize $\mathsf{Kur}$ as the $\lnot \lnot$-separated objects. That is, an object $X \in \mathcal{T}$ is actually in $\mathsf{Kur}$ if and only if the internal logic thinks

\[\mathtt{is-}\lnot\lnot\mathtt{-Separated}(X) \triangleq \prod_{x,y : X} \lnot \lnot(x=y) \to (x=y)\]

This is implicit in Propositions 3.6 and 4.3 of Johnstone’s original paper.

Then, we use the fact that the full subcategory of sequentially hausdorff sequential spaces is equivalent to the full subcategory of sequentially hausdorff limit spaces! With this in mind, we define

\[\mathtt{isSeqHaus}(X) \triangleq \prod_{f,g : \mathbb{N}_\infty \to X} \left ( \Big ( \prod_{n : \mathbb{N}} f(n) = g(n) \Big ) \to f(\infty) = g(\infty) \right )\]

and it’s easy to show that $\mathcal{T}$ thinks, internally, that $\mathtt{isSeqHaus}(X)$ if and only if $X$ is sequentially hausdorff in the real world.

In particular, this means that we can show an object of $\mathcal{T}$ represents an honest topological space by internally showing that it’s $\lnot\lnot$-separated and sequentially hausdorff¹⁴!

Ok! That’s plenty for this post, where we’ve learned a lot of fundamentals about what $\mathcal{T}$ is, how we can think about its objects, how we can build new objects using type theory and various proposition classifiers, and most importantly we’ve learned how the things we build in $\mathcal{T}$ relate to topological spaces in “the real world”!

Next up we’ll talk about topological algebras, and learn how we can use the topological topos to reason smoothly about these things. This is a shorter palate cleanser between this post (which is quite long) and the third post on ~bonus axioms~ validated by $\mathcal{T}$ (which is slightly less long than this one, but with much heavier math).

Thanks for hanging in there, and for all the encouragement while I was writing this! It’s been really exciting to know how many people are interested in reading this series ^_^.

As a last request, I’ll be turning this series into a paper in the very near future. If you have any suggestions for other examples to add, or axioms to check, or if you notice any typos or outright mistakes, definitely let me know! Also, experts, if you have any additional context you think would fit well in what’s shaping up to be quite a long survey of the topological topos that you want to get into the literature, please let me know that too! It’ll be nice for future mathematicians to have this all in one place!

As always, thanks for reading all! Stay safe, and talk soon 💖

I spent some time a few years ago (Feb of 2022, according to my Zotero) thinking hard about the effective topos. I think I was going to write a blog post about it, but I never got around to it.

I think I should be able to remind myself what was going on, and in a perfect world I would understand it much better now that I know more things, so hopefully I’ll finally write that post. This is particularly relevant now that Andrej Bauer and James Hanson have posted their preprint constructing a realizability topos where the reals are countable. ↩
And even then, it might not be obvious when you’re learning! I remember when I first learned pointset topology the idea of a “cylinder set” and the product topology made no sense to me! Honestly without the framework of topological categories or something similar, I could see people still being surprised that the box topology isn’t the “right” topology on an infinite product space! See, for instance, this old and highly upvoted mse question. ↩
After my last post on a constructive extreme value theorem, I wanted to see how it externalizes in topoi other than $\mathsf{Sh}(B)$. I’m pretty sure in the effective topos we’ll get something that looks like an algorithm eating a function on a compact space and returning its max… But it’s super unclear what we should get interpreting this in $\mathcal{T}$! After all, a frame in $\mathcal{T}$ is a topological lattice $L$. So a locale in $L$ is a space whose frame of opens is itself a space…

I still haven’t totally figured out this story (though I’m much less weirded out by the idea since I remembered that scott topologies exist as a natural topology on the frame of opens), so I won’t say anything more in this post, but trying to understand $\mathcal{T}$ well enough to externalize the constructive extreme value theorem was the second big motivator for this post. Of course, understanding $\mathcal{T}$ was so fun and interesting that I got distracted from my original goal, but that’s how these things tend to go for me, haha. ↩
If you want a super concrete description, it’s equivalent to
\[\left \{ 1, \frac{1}{2}, \frac{1}{3}, \frac{1}{4}, \ldots, \frac{1}{n}, \ldots, 0 \right \} \subseteq \mathbb{R}\]
with the subspace topology. ↩
Note that it’s entirely possible for two different elements $p \neq q \in X(\mathbb{N}_\infty)$ to witness convergence of the same sequence (so $p_n = q_n$ for all $n$ and $\infty$)!

Indeed, this will be crucial later, for instance in our discussion of the subobject classifier $\Omega$. ↩
This is spelled out quite clearly in Johnstone’s original paper On a Topological Topos. Indeed, Johnstone computes the covering sieves very explicitly, and I highly recommend reading about it there. Of course, I’ll say a few words about it in this post too! ↩
It’s not immediately obvious that this presheaf is actually a sheaf, but it turns out to be. This is a nice exercise. ↩
But we must remember that products in $\mathsf{Seq}$ are not products in $\mathsf{Top}$ in general! Indeed, for most “convenient categories of spaces” the product is not the product in $\mathsf{Top}$. We can get the product in $\mathsf{Seq}$ by taking the product in $\mathsf{Top}$ and coreflecting back into $\mathsf{Seq}$, but there’s also a convenient description:

A sequence of pairs $(a_n,b_n)$ in $A \times B$ converges to $(a_\infty, b_\infty)$ if and only if separately $a_n \to a_\infty$ in $A$ and $b_n \to b_\infty$ in $B$. ↩
If you want to understand how to compute (co)limits in $\mathsf{Seq}$ or $\mathsf{Kur}$, I highly recommend reading Section 3 of this paper. It’s really great, and has a lot of information!

Particularly helpful is the observation that $\mathsf{Kur}$ is Topologically Concrete. See my old blog post here, and especially Adámek, Herrlich, and Strecker’s The Joy of Cats. This book writes $\mathbf{Conv}$ where we write $\mathsf{Kur}$, and shows that it’s topologically concrete and concretely cartesian closed! This tells us very explicitly how we can compute (co)limits and exponentials in $\mathsf{Kur}$, on top of the great description in Section 3 of Menni and Simpson. ↩
Thanks to Morgan Rogers for letting me know that monics actually agree in all cases, which simplies the exposition of this section quite a lot. ↩
Thanks to Graham Manuell for letting me know that there’s standard notation for this (which the post now uses) besides what’s shown in Johnstone’s “Stone Spaces”. ↩
First, say we have a proof $(\omega_n) \to \bot$. That is a subset $A$ of $\mathbb{N}$. Then to have $x \prec A$ means that $\lnot x \lor A = \top$, which is the proof $(\omega_n) \to \top$ indexed by $(\mathbb{N}, \text{max}_\mathbb{N})$, the ideal indexed by all infinite subsets of $\mathbb{N}$. Since $x \leq A$, it’s also just a subset of $\mathbb{N}$ (indeed, a subset of $A$), and its pseudocomplement $\lnot x$ is indexed by $(x^c, \text{max}_{x^c})$, the ideal of all infinite subsets of $x^c$. So asking $\lnot x \lor A = \top$ means asking for the closure of $(\mathbb{N},\text{max}_{x^c})$ to equal $(\mathbb{N}, \text{max}_\mathbb{N})$. Expanding this out via the axioms for one of these “closed ideals of subsets” means that every subset of $\mathbb{N}$ should have a further subset inside $\text{max}_{x^c}$. But this happens exactly when $x^c$ is cofinite! So asking for $A = \bigvee \{ x \prec A \}$ means asking for $A$ to be the union of the finite subsets of $A$, which is always true.

Now let’s look at the more complicated case of a proof $(\omega_n) \to \top$. We see $x \prec (E,I)$ if and only if $\lnot x \lor (E,I) = \top = (\mathbb{N}, \text{max}_\mathbb{N})$, so that if $x = (F,J)$ we must have $F \subseteq E$ and the closure of $I$ should be $\text{max}_\mathbb{N}$ so that $E$ must be cofinite! Since it’s not possible for a join of proofs $(\omega_n) \to \bot$ to equal a proof $(E,I)$ that $(\omega_n) \to \top$, this tells us that the only proofs with this property are those indexed by $(E, \text{max}_E)$ with $E$ cofinite! And, of course, it’s easy to see that $(E, \text{max}_E) \prec (E, \text{max}_E)$ when $E$ is cofinite, so that moreover every maximal cofinite ideal is possible. ↩
eg, Moerdijk and Reyes in their Smooth spaces versus continuous spaces in models for synthetic differential geometry ↩
You might ask if there’s a way to characterize the whole of $\mathsf{Seq}$ inside $\mathcal{T}$, rather than only the sequentially hausdorff spaces.

I suspect there’s a way to do this, by defining the sequentially open subsets of $X$ and demanding that every sequence that converges with respect to this topology already converges…

But this sounds rather complicated, and I’m really looking to get this blog post done, haha. Especially since I still have to convert it into a paper! ↩

Talk -- What is Factorization Homology?

Wed, 27 Mar 2024 00:00:00 +0000

I was recently invited to speak at the AMS Sectional in Tallahassee, Florida. In particular, at the special session on Homotopy Theory and Category Theory in Interaction. The conference was this weekend, and I’m typing this up on my plane ride home. I had a great time, and met a lot of great people! The special session was pretty small, so we all got to talk to each other, and I got surprisingly close with the people I went out for lunch (and later, drinks) with!

I was nervous at first, since Florida… isn’t politically friendly towards trans people. Thankfully everybody that I interacted with was lovely, and I didn’t have any issues at all. What’s more, the campus was beautiful, and the surrounding area was surprisingly walkable. We had a 4 hour lunch break on the first day (which is part of how I got so close with my lunch group), and after we ate some great barbecue we all hung out under some fantastic trees.

I also felt safer because I was hanging out with Kaya Arro, a postdoc at UCR, for basically the whole time. They were great company, and also helped a ton with silly homotopy questions while I was making my slides (on the day before the talk…).

Anyways here are some of the trees right outside the conference building¹!

It was like $400 cheaper to fly out on Monday evening, even though the conference ended on Sunday, so Kaya and I had almost a whole day to ourselves. We went to the FSU Museum of Fine Arts and walked around until it was airport time.

It was the perfect size museum for the amount of time we had, with some amazing installation pieces, and a whole section dedicated to book making! That was particularly exciting for me, since one of my best friends, Ashley Chan, is a super talented book maker!

These first few paragraphs were a bit bloggier than usual, but I had a great time and wanted to talk about it! Now that we’ve had a chance to catch up, though, let’s get to the math ^_^.

My talk was on Factorization Homology, and I split it up into three sections:

Why?
What?
How?

In the first section, I wanted to provide some motivation for the technology, and to explain why it deserves to be called “homology”.

The second section was all about what factorization homology is, at least in broad strokes. I left out a bunch of details about orientation and framings since the talk was only 20 minutes and I wanted to stay light on my feet. This was also the section I got to express myself the most, since I view factorization homology through the lens of categorical logic², and I was able to spend some time on that perspective.

The third section was on actually doing things with factorization homology. That is, on how we can really compute it, and how we can interpret what these computations mean. I wasn’t able to spend too much time on the interpretation, but I said a few words, and it was worth it for the extra time spent on a (sketchy version of a) simple example computation.

There were some great questions after the talk, including one from Philip Hackney (one of the organizers, and the person who invited me) which gave me the chance to talk about what factorization homology has to do with my research.

All in all, I think the talk went super well. I don’t think I came off as particularly manic, which can happen when I give short talks, and it seems like people were able to take some intuition away, which is everything I wanted! It helps that everyone at the conference was super nice, so I didn’t have to worry at all about being judged or grilled. I think people were in a particularly good mood, since I was the first talk after lunch, so everyone was full and happy, haha.

So then, in more detail, what all did we talk about? In what follows, I’ll always mean $\infty$-category, $\infty$-functor, etc. whenever I write “category”, “functor”, etc.

First, the why.

We recall that “classical” cohomology $H^n(X,A)$ can be seen as $\pi_0 \text{Map}(X, K(n,A))$. Unraveling these hieroglyphics says that the cohomology of $X$ with coefficients in $A$ can be computed as the connected components of the space of maps from $X$ to the eilenberg-mac lane space $K(A,n)$³. That is, with the set of maps up-to-homotopy $X \to K(A,n)$.

This encourages a rather broad perspective, which says that we can think of $\pi_0 \text{Map}(X,Y)$ as the “cohomology of the space $X$ with coefficients in the space $Y$”. This is sometimes called nonabelian cohomology, and many cohomology theories can be subsumed by this one (basically because lots of algebraic and topological things can be seen as spaces! See here, for instance).

If we’re calling this thing cohomology, it’s natural to ask which “classical” theorems in cohomology are true in this setting. And one natural theorem to want is Poincaré Duality.

One motivation for factorization homology is that it’s the right homology theory to make this “Nonabelian Poincaré Duality” true!

In the talk I said some words about this, but online there’s no need for me to say anything. If you’re interested, you should just go read Lurie’s notes on the topic instead. After all, I basically just paraphrased them in the talk!

If you insist on sticking around here, though, the idea is this:

To prove the “classical” poincaré duality $H_i(X,A) \cong H^{n - i}_c(X,A)$, one can show

The functors $C_\bullet(-,A)$ and $C_c^\bullet(-,A)$ from the category of $n$-manifolds and smooth embeddings to the category of chain complexes are both cosheaves.
Just check that $C_\bullet(\mathbb{R}^n,A) \simeq C_c^\bullet(\mathbb{R}^n,A)$
Conclude that, since every manifold $\mathbb{R}^n$ is a bunch of $\mathbb{R}^n$s in a trenchcoat, this equivalence is actually true for all manifolds.
Taking homology of both sides to get the claim.

Basically, a cosheaf is something we can compute for a big thing by gluing together the computations for small things (you should have something like Mayer-Veitoris in mind). In particular, since every manifold is glued from copies of $\mathbb{R}^n$, a cosheaf is totally determined by what it does to $\mathbb{R}^n$! So once we check that $C_\bullet(-,A)$ and $C_c^\bullet(-,A)$ agree on $\mathbb{R}^n$, we get for free that they must agree for every manifold⁴!

Now, if we wanted to cook up a homology theory making some kind of “nonabelian poincaré duality” true, we might try something like this:

Show $\text{Map}_c(-,Y) : \mathcal{M}\text{an} \to \mathcal{S}$ is a cosheaf
Define the “nonabelian homology” of $\mathbb{R}^n$ to be $\text{Map}_c(\mathbb{R}^n,Y)$
Use the cosheaf-ness to extend this definition to all manifolds by “doing what you have to do”.

This turns out to not quite work, but if we define “nonabelian homology” for all disjoint unions

\[\emptyset, \quad \mathbb{R}^n, \quad \mathbb{R}^n \coprod \mathbb{R}^n, \quad \mathbb{R}^n \coprod \mathbb{R}^n \coprod \mathbb{R}^n, \quad \ldots\]

we’re able to get the job done⁵!

So then, we’ve found ourselves with a functor defined on $\mathcal{D}_n$ – the category of disjoint unions of $\mathbb{R}^n$s with smooth embeddings. Then we freely extend this functor to the whole category of manifolds $\mathcal{M}\text{an}_n$ by defining the functor to be “compatible with gluing”. The keyword here is left kan extension.

If you’re me, this feels a whole lot like a familiar story from categorical logic!

This, of course, is all part of the what!

At this point in the talk I took a quick diversion to remind people about Functorial Semantics. This was extra easy since in the morning session Valentina Zapata Castro spent some time talking about the lawvere theory for monoids!

In (1-categorical) functorial semantics, we have classifying categories associated to algebraic theories. Then a model of that theory is “just” a functor from the classifying category to $\mathsf{Set}$. For instance, the classifying category $\mathcal{A}$ for abelian groups looks like this⁶:

and the classifying category for rings $\mathcal{R}$ is:

Indeed, a finite product functor $\mathcal{A} \to \mathsf{Set}$ has to send $1$ to some set $A$. Then, since it’s product preserving, it has to send $2$ to $A^2$, and $0$ to $A^0 = { \star }$. Then the indicated maps get sent to honest operations on $A$, which give us the desired abelian group structure on $A$.

Note that we have an embedding $j : \mathcal{A} \hookrightarrow \mathcal{R}$.

Now any ring $R$ is a (finite product) functor $\mathcal{R} \to \mathsf{Set}$, and if we restrict along $j$ we get a new finite product functor $\mathcal{A} \to \mathsf{Set}$. That is, we get an abelian group! It should be believable that this is the underlying abelian group of the ring we started with!

More excitingly, can we go the other way? Given a functor $\mathcal{A} \to \mathsf{Set}$ (that is, an abelian group) is there a way to freely extend this functor to one defined on all of $\mathcal{R}$?

The answer, of course, is yes and it’s given by left kan extension! This recovers the usual free ring on an abelian group $A$.

Primed with this context, let’s look at our definition of factorization homology again:

We have a (symmetric monoidal) functor $\mathcal{D}_n \to \mathcal{C}$. This data is actually quite well studied, since it’s a model for $E_n$-algebras in $\mathcal{C}$. These are monoids in $\mathcal{C}$ that are “commutative up-to-homotopy living in dimension $n$”⁷.

So we have something that looks like some kind of algebra, which is given by a functor out of $\mathcal{D}_n$. Then we want to freely extend this data to a new kind of algebra, given by a functor out of $\mathcal{M}\text{an}_n$…

Doesn’t this setup sound familiar?

If $A : \mathcal{D}_n \to \mathcal{C}$ is our $E_n$-algebra, We define $\int_{(-)} A : \mathcal{M}\text{an}_n \to \mathcal{C}$, the Factorization Homology to be the left kan extension of $A$ along $j$. We can think of this as a kind of “algebra” which is graded by connected manifolds. Moreover, we have operations between these graded pieces corresponding to smooth embeddings!

For instance, if $i : M_1 \coprod M_2 \to N$ is a smooth embedding, then we get an operation $\int_{M_1} A \otimes \int_{M_2} A \to \int_N A$.

In particular, for a Collared Manifold (that is, a manifold of the form $M = M_0 \times \mathbb{R}$) we get an algebra structure $\int_M A \otimes \int_M A \to \int_M A$ from the embedding

Similarly, if $M$ embeds into $N$, then we get a $\int_M A$-module structure on $\int_N A$:

This brings us to the most important part of the How:

We can actually compute factorization homology by using “collared excision”:

If you want to read more about all of this, I highly recommend the relevant section in Juliet Cooke’s Thesis. If you want to really understand it well, there’s also Alaya and Francis’s Factorization Homology Primer and Hiro Lee Tanaka’s excellent lectures on the subject, which are available on youtube.

Thanks again for reading, all! I’m really pleased with how this talk went, and I’m happy to have made as many friends as I did at the conference.

It was great to be invited to speak in person somewhere for the first time, and to feel super welcomed in the homotopy theory world (which I’m only tangentially a part of… at least for now).

Stay safe, all, and we’ll talk soon!

What is Factorization Homology?

Ayala, Francis, and Rozenblyum introduced Factorization Homology in order to compare category theory and (topological quantum) field theory. It has since grown into a useful invariant of surfaces and algebras which, through excision, is often computable in practice. In this survey talk we will discuss the basics of factorization homology, emphasizing its connections to both homotopy theory and category theory (via infinity categories). Given time, we will also discuss some applications to the speaker’s ongoing dissertation work in skein theory.

The talk wasn’t recorded, but you can find the slides here.

The hilariously named “HCB” (Huge Classroom Building). ↩
Though honestly, I view most things through the lens of either categories or logic. So this should come as no surprise, haha. ↩
If you haven’t seen this before, it’s a super useful perspective to have on hand! For example, $\mathbb{C}P^\infty$ is both a model for $K(\mathbb{Z},2)$ and $BU(1)$.

Now (homotopy classes of) maps $X \to K(\mathbb{Z},2)$ are the same thing as elements of $H^2(X,\mathbb{Z})$ by the fact from the main post.

But homotopy classes of maps $X \to BU(1)$ are the same thing as complex line bundles on $X$!

So, since these spaces are the same, we learn that complex line bundles on $X$ are classified by $H^2(X,\mathbb{Z})$!

You can read more about this here, for instance. ↩
This is one killer example of why people care about this kind of abstract nonsense. Not only do we need to know stuff about (co)sheaves, we also have to care about $\infty$-categories!

This is basically because $\infty$-categorical things tend to glue more nicely than $1$-categorical things. This, in turn, is related to the idea that “Chain Complex: Good. Homology: Bad”, as talked about in many places. (for instance, here and here) ↩
This is basically because checking the cosheaf condition requires some version of Mayer-Veitoris. But the “usual” version of Mayer-Veitoris crucially uses the ability to add functions (since $A$ is an abelian group). Now that $Y$ is any old space, we lose addition.

But by working with disjoint unions of $\mathbb{R}^n$s, we can get a notion of “addition” back by taking the disjoint union of the functions!

You can read more about precisely what’s going on in Lurie’s notes. ↩
Precisely, the classifying category for the models of any algebraic theory is always the opposite of the category of the finitely generated free models.

So, for instance, $\mathcal{A}^\text{op}$ is the full subcategory of abelian groups spanned by
\[\{1, \mathbb{Z}, \mathbb{Z}^2, \mathbb{Z}^3, \ldots \}\]
and $\mathcal{R}^\text{op}$ is the full subcategory of (let’s say commutative) rings spanned by
\[\{1, \mathbb{Z}[x], \mathbb{Z}[x,y], \mathbb{Z}[x,y,z], \ldots \}\]
There’s a similar version of this story that works for essentially algebraic theories where we use the finitely presented models instead.

We’ve talked a bit about this on the blog before, and I have plans to talk quite a bit about it sometime in the future. ↩
I’m not going to tell you what that means, but here’s an illustrative example:

An $E_1$-algebra in $\mathfrak{Cat}$ is a monoidal category.

An $E_2$-algebra in $\mathfrak{Cat}$ is a braided monoidal category. That is, the multiplicative structure is commutative… but only up to a homotopy (the braiding).

An $E_3$-algebra in $\mathfrak{Cat}$ is a symmetric monoidal category. That is, a category whose multiplicative structure is genuinely commutative (at least, up to isomorphism, which is the best we ever want to do in a category).

All $E_n$-algebras for $n \geq 3$ are the same, since $\mathfrak{Cat}$ is a $2$-category. There aren’t any interesting (read: nonidentity) homotopies in dimension $\geq 3$, so if you’re commutative up to a higher homotopy, the only choice for that higher homotopy is the identity! Which means you’re just commutative without further qualification!

But you can imagine that, for $\infty$-categories that do have higher homotopies, we can have monoids that are commutative up to the data of an $n$-homotopy, and that’s exactly an $E_n$-algebra. In case you want something that really is commutative, you can take $n=\infty$ here. ↩

Proving Another "Real Theorem" with Topos Theory

Mon, 25 Mar 2024 00:00:00 +0000

Another day, another post that starts with “So I was on mse…”, lol. Somebody asked whether maximizing over a compact set is a continuous thing to do. That is, given a continuous function $f : K \times X \to \mathbb{R}$ is the function $x \mapsto \max_{k \in K} f(k,x)$ continuous?

If you’re me, this looks an awful lot like the usual extreme value theorem, but where “everything in sight depends continuously on a parameter from $X$”. Indeed, say we had a family of compact sets $K_x$ depending on a parameter from $X$, and a family of functions $f_x : K_x \to \mathbb{R}$ which is continuous both in the choice of $x$ and in the choice of $k \in K_x$. Then the usual extreme value theorem tells us that we have pointwise maxima $m_x = \max_{k \in K_x} f_x(k)$, and it’s natural to ask whether these maxima also vary continuously in the parameter $x$.

Depending on your experience with bundles, you might be aware that the usual way to encode a “family of sets $A_x$ continuously depending on $x \in X$” is with a single space $A$ (usually called the total space) equipped with a map $\pi : A \to X$. Then the fibres of $\pi$ give us the desired family of sets $A_x = \pi^{-1}(x)$¹. As a super concrete example, say we want to represent the constant family $A_x = \mathbb{R}$. Then we should use the trivial bundle $\mathbb{R} \times X$ with the obvious projection map $\pi : \mathbb{R} \times X \to X$².

I bet you weren’t expecting a cute exercise this early in the post!

Can you show that a family of functions $f_x : A_x \to B_x$ over $X$ is the same thing as a single function $f : A \to B$ making the following diagram commute:

Recall we’re thinking of the sets $A_x$ and $B_x$ as being the fibres $\pi_A^{-1}(x)$ and $\pi_B^{-1}(x)$.

To go from the family $f_x$ to a single function $f$ on the total spaces, it might be worth remembering that the total space $A$ is just the disjoint union of the fibres³ $\sum_{x \in X} A_x$ (suitably topologized).

So the constant family $K_x = K$ of compact sets is represented by the trivial bundle $K \times X$, and (by the above exercise) a family of functions $f_x : K_x \to \mathbb{R}$ is represented by a function $f : K \times X \to \mathbb{R} \times X$ with $\pi_2 (f(k,x)) = x$. Since we know what the second component must be, this is the same data as a function $K \times X \to \mathbb{R}$! This tells us that, if we can prove some “$X$-parameterized” version of the extreme value theorem (where the resulting maxima also depend continuously on a parameter $x \in X$), we can answer the OP’s question by looking at the special case where $K_x$ is a constant family!

If you’ve been exposed to some ideas in topos theory, you should be screaming something to do with the “internal logic” right now! Indeed, a theorem inside the topos of sheaves on $X$ externalizes to give a version of that theorem where everything is automatically continuous in a parameter from $X$. So if we have access to the extreme value theorem inside $\mathsf{Sh}(X)$ (that is, if we have a constructive proof of the extreme value theorem) we would immediately get the parameterized version for free!

So then, we want to get our hands on a constructive proof of the extreme value theorem. Typically constructive theorems which assert the existence of something (in this case, a maximum) require some ~bonus assumptions~ which are invisible in the classical case. This is because existence is a much stronger phenomenon constructively⁴.

To start, we probably want to be working with locales instead of topological spaces. Since the person on mse was happy with $K$ and $X$ metrizable, they’ll certainly be ok with sobriety since hausdorff-ness already implies it! Since sober spaces and locales (with enough points) are the same thing, we lose nothing by moving to the world of locales. There’s a good notion of compactness for locales, and people have spent a lot of time proving classical theorems constructively in this setting! So, optimistically, we can google "extreme value theorem"+"locale" to see if someone has already done what we’re looking for, and our optimism is quickly rewarded! We find Graham Manuell’s slides from TACL 2022 which give a constructive proof of the following:

If $K$ is a compact, overt, positive locale, and $f : K \to \mathbb{R}$ is continuous, then $f$ has a maximum $\max_K f$ given by a dedekind real.

Let’s say a few words about what all these weird adjectives mean!

First, let’s handle the easiest one: Compact.

The most familiar definition of compactness is that every open cover has a finite subcover. Precisely, if $\mathcal{U}$ is a family of open subsets of $X$ with $X = \bigvee \mathcal{U}$, then there should be a (kuratowski-)finite subfamily $\mathcal{U}_0 \subseteq \mathcal{U}$ so that actually $X = \bigvee \mathcal{U}_0$.

However, constructively it’s better to avoid saying the word “finite” if you can. The notion of finiteness splits into different inequivalent notions (kuratowski-finite, bishop-finite, dedekind-finite, etc) which makes the whole thing a bit of a mess⁵. For this reason, there’s a different, equivalent definition of compactness which we often prefer:

We say $X$ is called compact if for every directed open cover $\mathcal{U}$⁶ with $X = \bigvee \mathcal{U}$, we must actually have $X \in \mathcal{U}$!

As a cute exercise, prove these two definitions are equivalent! Try your hardest to make sure your proof is constructive!

There’s one last angle on compactness that I want to emphasize here as well: We say a locale is compact if universal quantification is open. That is, if $U \subseteq K \times X$ is open, we want to know if $\{ x \mid \forall k . (k,x) \in U \} \subseteq X$ is open. This turns out to be true (for every $X$ and every $U$) if and only if $K$ is compact⁷!

The next adjective on the list is Overt. This is a kind of tricky notion to understand at first, since it’s invisible classically. We say a locale $O$ is overt iff for any locale $X$ and open subset $U \subseteq O \times X$, the projection $\{ x \mid \exists o \in O . (o,x) \} \subseteq X$ is open. Note how this is dual to the universal quantifier present for compact sets. Also note that this set is exactly the image of $U$ under $\pi_X : O \times X \to X$. Since, classically, this projection map is always open, we learn that classically every locale is overt⁸.

There’s an equivalent characterization of overtness, which is often useful in practice. We want to check that the unique map $X \overset{!}{\to} 1$ is open. Since open maps are preserved by pullbacks, this gives the original definition for free. It also shows how this definition is classically invisible, since every subset of the terminal locale is open when the terminal locale is $\{\star\}$.

However, recall that in a sheaf topos $\text{Sh}(X)$ the terminal locale is $X$! Then a locale in $\text{Sh}(X)$ is just an external locale with a map into $X$, and overtness of the internal locale corresponds to openness of the external map! So there’s obviously no reason to expect every locale to be overt constructively, since it’s easy to find continuous functions that aren’t open.

Lastly, we say that a locale is Positive iff every open cover must be inhabited. That is, if $X = \bigvee \mathcal{U}$ for $\mathcal{U}$ a family of open sets, the family $\mathcal{U}$ must be inhabited.

Recall we say $A$ is inhabited exactly when $\exists a \in A$ is true. In $\text{Sh}(X)$, this means that there’s an open cover of $X$ so that $A$ has a section on every element of the cover. That is, exactly when the structure map $A \to X$ is surjective⁹.

Now, let’s outline the proof from Manuell’s slides¹⁰. Recall a Dedekind Real is a pair of cuts $(L,U)$ where

$L \subseteq \mathbb{Q}$ is a lower cut of rational numbers in the sense that
1. $\exists p \in L$ ($L$ is inhabited)
2. if $p \lt q$ and $q \in L$, then $p \in L$ too ($L$ is downwards closed)
3. if $p \in L$, then $\exists q \in L . q \gt p$ ($L$ is upwards open)
$U \subseteq \mathbb{Q}$ is an upper cut of rational numbers in the sense that
1. $\exists p \in U$ ($U$ is inhabited)
2. if $p \gt q$ and $q \in U$, then $p \in U$ too ($U$ is upwards closed)
3. if $p \in U$, then $\exists q \in U . q \lt p$ ($U$ is downwards open)
$L$ and $U$ are compatible in the sense that
1. If $p \in L$ and $q \in U$, then $p \lt q$ ($L$ and $U$ don’t overlap)
2. If $p \lt q$ are any rationals, then $p \in L \lor q \in U$ ($L$ and $U$ have no gap between them)

If you haven’t seen this before, we think of $r = (L,U)$ as pinning down a real number by saying how it compares to every rational. If $q \in L$, we say $q \lt r$, and if $q \in U$ we say $q \gt r$. See the nlab page for more. Recall also¹¹ that, a real number in $\text{Sh}(X)$ is a continuous function $X \to \mathbb{R}$. The lower/upper cuts are also useful separately, and externalize to lower/upper semicontinuous functions on $X$.

Now, finally, for the proof idea:

$\ulcorner$ Say $f : K \to \mathbb{R}$ is a map from a positive, overt, compact locale $K$. First we’ll use positive overtness to build

\[L = \{ q \in \mathbb{Q} \mid \exists k \in K . q \lt k \}.\]

Then, we’ll use positive compactness to build

\[U = \{ q \in \mathbb{Q} \mid \forall k \in K . k \lt q \}.\]

Lastly, we need to show that the $L$ and $U$ we just built are compatible. It’s easy to show they don’t overlap, but showing they “have no gap” requires a small argument (which you can find on Manuell’s slide 8). $\lrcorner$

Note how (after externalizing) this is really the same argument that Sangchul Lee gave in the accepted answer on mse!

First we show that $g(x) = \max_K f(k,x)$ is lower semicontinuous (that is, we show that it’s a lower real). This doesn’t use any facts about $K$, since it’s only using positive overtness of $K \times X$, which is always true!

Then we show that $g(x)$ is upper semicontinuous (that is, we show it’s an upper real). This is where we crucially use compactness, both externally and internally.

The compatibility conditions basically amount to checking your upper and lower semicontinuous functions are the same, but since Sangchul’s answer is working with a single function the whole time there’s no need to verify that.

So now we have this constructive theorem. What does it actually tell us after we externalize?

Well, you can chase through the definitions of compact, overt, and positive inside (say) a sheaf topos $\text{Sh}(X)$ and see what you get!

First, as we alluded to in the introduction to this post, a locale $K$ internal to $\text{Sh}(X)$ is exactly a locale map¹² $\pi : \Gamma(K) \to X$ where $\Gamma$ is the usual global sections map.

From here, it’s not (too) hard to see that $K$ is compact if and only if $\pi$ is proper, that $K$ is overt if and only if $\pi$ is open, and that $K$ is positive if and only if $\pi$ is surjective¹³.

So altogether we learn that

If $\pi : A \to X$ is a proper, open, surjection and we have $f : A \to \mathbb{R}$, then

\[x \mapsto \max_{k \in \pi^{-1}(x)} f(k)\]

is continuous.

In particular, we answer the OP’s question! Taking $\pi : K \times X \to X$ to be the usual projection map (which is open and surjective, plus proper since $K$ is compact) we learn that

\[x \mapsto \max_{(k,x) \in \pi^{-1}(x)} f(k,x)\]

is continuous. As desired!

Ok! Thanks for reading, all! This felt like it took forever for what a short post it was, but I had a great time writing it. I’m flying home from the AMS Sectional today, where I gave a talk at the special session on Homotopy Theory and Category Theory in Interaction. I had a great time, and met a lot of really awesome people. It was a small group, which means we had lots of time to hang out and get to know each other.

I’ll write up some stuff about the conference (and my talk) soon, but for now I need to get ready to go to the airport! Stay safe ^_^.

I really don’t like Goldblatt’s Topoi as a book, but his section on the relationship between bundles over $X$ and sets continuously “indexed” by $X$ is super good. It’s chapter 4 section 5, and it has some really helpful pictures and examples! ↩
Note that it’s possible to have constant fibres, without being the trivial bundle! In this case, even though each fibre is the same as every other, we glue them together in an interesting way! To be the trivial bundle, you need to know the fibres are all the same, and that we glued them together in the most naive way possible. For instance, the trivial $\mathbb{R}$-bundle over the circle $S^1$ looks like a cylinder, since we glue the copies of $\mathbb{R}$ together in the simplest possible way. However, we could slowly “twist” our copies of $\mathbb{R}$ as we move around the circle as we glue them together to get a moebius strip! This is still a continuously varying family of copies of $\mathbb{R}$, but now the bundle is nontrivial! Here’s a picture from wikipedia:

As a cute exercise, can you come up with two bundles over $S^1$ where each fibre has two elements? One should be a trivial bundle, and the other shouldn’t be. ↩
Any type theorists in the room are probably screaming Dependent Sum right around now! That’s extremely fair, and I almost put something in the main post about this, but I ended up editing it out. ↩
As witnessed (pun intended) by the fact that, interpreted in suitable topoi, a constructive existence proof gives rise to free theorems saying there’s an algorithm that produces the desired object, or that the desired object can be defined continuously in a parameter, etc. This is exactly the phenomenon we’re trying to exploit, and we have to do work somewhere! ↩
I still haven’t taken the time to really familiarize myself with the various notions of finiteness, and how they “feel”. One day soon I want to, though. ↩
Recall a partial order $(D,\leq)$ is called directed if it’s inhabited and for any $x,y \in D$ ~~their join $x \lor y$ is in $D$ too~~ Edit: There’s a mutual uppoer bound $x,y \leq z$ also in $D$. Thanks to Tom de Jong for this correction!

This is really hiding kuratowski-finiteness again, since directedness guarantees that for any kuratowski-finite subset $F \subseteq D$ that $\bigvee F \in D$. This is the usual argument that binary joins imply finite (nonempty) joins. ↩
I was curious about this, and Pedro pointed out that one direction is basically the classical tube lemma. I don’t actually see how to do the other direction (at least quickly) and I don’t really have time to think about it right now. If someone figures it out I’d love to hear about it in the comments! ↩
Constructively it’s still true that every locale with enough points is automatically overt (see the nlab). It’s a very mild condition, see the discussion here, for instance, or Paul Taylor’s Overt Subspaces of $\mathbb{R}^n$. ↩
Edit: July 7, 2024:

Graham and I talked about this on the CT Zuip back in March, since it’s not 100% obvious how this works.

In Graham’s notes, theorem 3.15 is a constructive proof that $A$ is a positive locale if and only if the map $! : A \to 1$ is epic in the category of locales.

So this tells us that, $A$ is a positive locale internal to $\mathsf{Sh}(X)$ iff, in the internal category of locales in $\mathsf{Sh}(X)$, the map $A \to 1$ is epic.

We would love to use the “well known” equivalence between the category of internal locales in $\mathsf{Sh}(X)$ and the category of external locales over $X$ here to say that $A \to 1$ is epic iff externally the structure map $A \to X$ is epic, as claimed… But it’s actually slightly more subtle than that! After all, Graham’s notes prove a claim about the internal category of locales internal to $\mathsf{Sh}(X)$, and I don’t see an obvious way to relate this to the external category of locales internal to $\mathsf{Sh}(X)$. Morally the external category should be a kind of “global section” of the internal category, but I’m not sure how to make this precise… I think it’s something stacky.

That said, this particular situation is simple enough that we don’t need to worry about such things! In his notes, Graham actually proves that $!^* : \mathcal{O}(1) \to \mathcal{O}(A)$ is injective as a frame hom. Injectivity in the internal logic gives us injectivity on global sections, but we know the global sections of $\mathcal{O}(1)$ are just $\mathcal{O}(X)$ externally, and global sections of $\mathcal{O}(A)$ are just $\mathcal{O}(\Gamma(A))$ externally! So we get injectivity externally, thus the exeternal map $\Gamma(A) \to X$ is epic, as desired.

There’s almost certainly a cleaner approach using Caramello and Zanfra’s recent machinery about relative topoi via stacks (see here), but I haven’t had time to learn any of these results. ↩
He, in turn, mentions was based on the treatment in Paul Taylor’s A Lambda-Calculus for Real Analysis. I actually have this paper saved, but it’s long and I wasn’t sure how easy it would be to translate Taylor’s results into language I’m more familiar with. Now that I’ve seen Manuell do it, though, I have plans to read this paper pretty soon! ↩
I’ve always gotten slightly annoyed, or at least laughed quietly to myself, when authors ask the reader to “recall” some fact that they quite possibly don’t know.

I really want this post to be done, though (I’ve been working on it for almost a month) and if I have to explain why upper/lower reals correspond to upper/lower semicontinuous functions I’ll never finish… I know, I tried.

You can find an extremely detailed treatment in Mac Lane and Moerdijk’s Sheaves in Geometry and Logic Chapter VI.8. Understanding this well is probably enough to work out that upper/lower reals correspond to upper/lower semicontinuous functions yourself (which will make a great exercise!). Depending on your experience, you might be helped by my old post on externalizing formulas inside a topos.

There’s also a great treatment in Johnstone’s Sketches of an Elephant, Chapter D4.7. This works out the case of lower reals and lower semicontinuous functions explicitly, but does so quite quickly. ↩
In fact we can say exactly which map it is! We want a locale map from $\Gamma(K) \to X$, which means we want a frame homomorphism from $\mathcal{O}(X) \to \mathcal{O}(\Gamma(K)) = \Gamma(\mathcal{O}(K))$.

But given an open set $U \subseteq X$, we can look at the local sections $\Gamma(\mathcal{O}(K),U)$. This is again a frame, and the restriction map $\rho^X_U : \Gamma(\mathcal{O}(K)) \to \Gamma(\mathcal{O}(K),U)$ has a left adjoint $\sigma_U : \Gamma(\mathcal{O}(K),U) \to \Gamma(\mathcal{O}(K))$.

The desired map from $\mathcal{O}(X) \to \mathcal{O}(\Gamma(K))$ sends $U \subseteq X$ to $\sigma_U(\top)$, where $\top$ is the top element of $\Gamma(\mathcal{O}(K),U)$. ↩
I feel kind of bad not proving these facts, since they’re not so hard? But I really am trying to finish this post quickly.

You can find a lot of this in Johnstone’s Elephant. In particular,
- Getting a locale map $\pi : \Gamma(K) \to X$ from an internal locale is proposition chapter C1.6.2
- Compactness of $K$ agrees with properness of $\pi$ is theorem C3.2.8
- Overtness of $K$ agrees with openness of $\pi$ is lemma C3.1.17
- We’ve already talked about why positivity means $\pi$ is surjective
↩

Internal Group Actions as Enriched Functors

Sun, 18 Feb 2024 00:00:00 +0000

Earlier ~~today~~ this month on the Category Theory Zulip, Bernd Losert asked an extremely natural question about how we might study topological group actions via the functorial approach beloved by category theorists. The usual story is to treat a group $G$ as a one-object category $\mathsf{B}G$. Then an action $G \curvearrowright X$ is the same data as a functor $\mathsf{B}G \to \mathsf{Set}$ sending the unique object of $\mathsf{B}G$ to $X$. Is there some version of this story that works for topological groups and continuous group actions?

I wouldn’t be writing this post if the answer were “no”, so let’s get into it! This is a great case study in the ideas behind both internalization and enrichment, and I think it’ll make a great learning tool for future mathematicians wondering why you might care about such things.

Hopefully people find this helpful ^_^.

(Also, I’m especially sorry for the wait on this one, since I know at least one person has been waiting on it for two weeks now! Life got busy, but I’m excited to finally get this posted.)

First, let’s take a second to talk about Internalization.

The idea here is to take a construction that’s usually defined for sets, and interpret it inside some other category. For instance, a group $G$ is usually

a set $G$
a function $m : G \times G \to G$ (the multiplication)
a function $i : G \to G$ (the inversion)
a function $e : 1 \to G$ (the unit)

satisfying some axioms.

We can internalize this definition into the category $\mathsf{Top}$ of topological spaces by looking at

a topological space $G$
a continuous function $m : G \times G \to G$
a continuous function $i : G \to G$
a continuous function $e : 1 \to G$

satisfying the usual axioms. This recovers the usual definition of a topological group.

Similarly we could ask for a manifold $G$ with smooth maps $m,i,e$, and this would recover the definition of a lie group. At the most general, we ask for a Group Object Internal to $\mathcal{C}$. This is the data of:

a $\mathcal{C}$-object $G$
a $\mathcal{C}$-arrow $m : G \times G \to G$
a $\mathcal{C}$-arrow $i : G \to G$
a $\mathcal{C}$-arrow $e : 1 \to G$

satisfying the usual axioms¹.

As a quick aside, notice the crucial use of the terminal object $1$ and the product $G \times G$ in the above definition. This tells us that we can only define groups internal to a category with finite products².

Now just like we can internalize a group object, we can also internalize a group action! If $G$ is a group object internal to $\mathcal{C}$ and $X$ is some object of $\mathcal{C}$ (if you like, $X$ is a “set internal to $\mathcal{C}$”) then an Internal Group Action is a $\mathcal{C}$-arrow $\alpha : G \times X \to X$ satisfying the usual axioms.

So then a group action internal to $\mathsf{Top}$ is the usual notion of a continuous group action, and a group action internal to manifolds is a lie group action, etc.

Let’s change tack for a second and talk about the other half of the story.

Now we start with a (symmetric) monoidal closed category. Roughly speaking, this is a category where the set of arrows $\text{Hom}_{\mathcal{C}}(X,Y)$ can be represented by an object of $\mathcal{C}$!

For instance, the category of vector spaces $\mathsf{Vect}$ is monoidal closed since the homset $\text{Hom}(V,W)$ of linear maps is itself a vector space, which we’ll write as $[V,W]$.

Another example is the category of (nice³) topological spaces. The set of continuous functions $\text{Hom}(X,Y)$ can be given the compact-open topology, so that it is itself a topological space $[X,Y]$.

The fact that these categories can “talk about their own homsets” might make you wonder about other categories with structured homsets.

For instance, there are lots of categories in the wild whose homsets are vector spaces! If $R$ is any $k$-algebra, then for $R$-modules $M$ and $N$ the homset $\text{Hom}_{R\text{-mod}}(M,N)$ is a vector space. Similarly, if $G$ is any group, then the homset between any two $G$-representations is actually a vector space! We say that the categories $R\text{-mod}$ and $G\text{-rep}$ are Enriched over $\mathsf{Vect}$.

Similarly, you can ask about categories where each homset is a topological space. It turns out that these give a fantastic first-order approximation⁴ to the theory of $\infty$-categories!

More generally, we can define a $\mathcal{C}$-Enriched Category to be the data of

A set of objects
For each pair of objects $x,y$, a $\mathcal{C}$-object $\text{Hom}(x,y)$
For each triple of objects, a composition map in $\mathcal{C}$, $\circ : \text{Hom}(y,z) \otimes \text{Hom}(x,y) \to \text{Hom}(x,z)$
For each object $x$, a distinguished element⁵ $\text{id}_x \in \text{Hom}(x,x)$
Satisfying the usual axioms.

And what will be relevant for us, a $\mathcal{C}$-enriched groupoid, which moreover has an inverse map $i : \text{Hom}(x,y) \to \text{Hom}(y,x)$ showing that every arrow is an isomorphism.

Note that every (symmetric monoidal closed) $\mathcal{C}$ is enriched over itself in a canonical way. We take the $\mathcal{C}$-category whose objects are objects of $\mathcal{C}$, and define $\text{Hom}(x,y)$ to be the $\mathcal{C}$-object $[x,y]$. This specializes to the right notion for vector spaces and topological spaces which were examples earlier in this section.

We’ll also need the notion of a $\mathcal{C}$-enriched functor, which is exactly what you might expect given the above definition.

Given two $\mathcal{C}$-enriched categories $\mathbf{A}$ and $\mathbf{B}$, an Enriched Functor $F : \mathbf{A} \to \mathbf{B}$ sends objects of $\mathbf{A}$ to objects of $\mathbf{B}$. Moreover, for every pair of objects in $\mathbf{A}$ there should be a $\mathcal{C}$-arrow $F_{x,y} : \text{Hom}_{\mathbf{A}}(x,y) \to \text{Hom}_{\mathbf{B}}(Fx,Fy)$ which are compatible with identities and composition.

For example, a $\mathsf{Vect}$-enriched functor is just a functor so that the map $\text{Hom}(x,y) \to \text{Hom}(Fx,Fy)$ is moreover a linear map (recall our homsets are vector spaces). Similarly, a $\mathsf{Top}$-enriched functor is a functor so that the maps on homsets $\text{Hom}(x,y) \to \text{Hom}(Fx,Fy)$ are continuous.

Now we have all the pieces we’ll need to prove

Theorem: Fix a cartesian closed category $\mathcal{C}$.

There is a natural bijection between group objects $G$ internal to $\mathcal{C}$ and $1$-object groupoids $\mathsf{B}G$ enriched over $\mathcal{C}$.

Moreover, for a fixed group object $G$, there is a bijection between internal $G$-actions and enriched functors $\mathsf{B}G \to \mathcal{C}$.

$\ulcorner$ Say that we have a group object $G$ internal to a (cartesian closed) category $\mathcal{C}$. Then let’s build a $\mathcal{C}$-enriched category, $\mathsf{B}G$ with a single object $\star$, where $\text{Hom}(\star,\star) = G$. Of course, we write $\text{id}_\star = e$, and composition is multiplication.

Note that $G$ is an object of $\mathcal{C}$, and the identity/composition/inverse maps are $\mathcal{C}$-arrows. So this really is a $\mathcal{C}$-enriched groupoid with one object!

Conversely, say we have a one-object $\mathcal{C}$-enriched groupoid $\mathcal{G}$. Then $\text{Hom}_\mathcal{G}(\star,\star)$ had better be an object of $\mathcal{C}$, and it’s easy to check that composition and inverse in $\mathcal{G}$ gives this object an internal group structure!

So the data of an enriched $1$-object groupoid is exactly the data of an internal group!

Now, what is a $\mathcal{C}$-enriched functor $F : \mathsf{B}G \to \mathcal{C}$?

We have to send $\star$ to some object of $\mathcal{C}$, say $X$. Then we need a $\mathcal{C}$-morphism $\text{Hom}(\star,\star) \to \text{Hom}(X,X)$. But by the definitions of $\mathsf{B}G$ and $\mathcal{C}$ (enriched over itself) this is the data of a $\mathcal{C}$-arrow $G \to [X,X]$.

Now we use cartesian closedness! This arrow transposes (uncurries) to a $\mathcal{C}$-arrow $G \times X \to X$, and one can check that the identity and composition preservation for the functor corresponds exactly to the axioms for $G \times X \to X$ to be a group action internal to $\mathcal{C}$.

Of course, walking backwards through the above discussion shows that an internal group action $G \times X \to X$ in $\mathcal{C}$ is exactly the data of a $\mathcal{C}$-enriched functor $\mathsf{B}G \to \mathcal{C}$ sending $\star \mapsto X$! $\lrcorner$

Your category-theorist senses should be tingling after reading the statement of the previous theorem!

Sure there’s a bijection of group/group actions, but what about the arrows!?

As a cute exercise, prove that this theorem upgrades to an equivalence⁶ of categories between

\[\left \{ \begin{array}{c} \text{group objects internal to $\mathcal{C}$ with} \\ \text{internal group homs as arrows} \end{array} \right \} \simeq \left \{ \begin{array}{c} \text{$1$-object groupoids enriched over $\mathcal{C}$ with} \\ \text{enriched functors as arrows} \end{array} \right \}\]

and for fixed $G$

\[\left \{ \begin{array}{c} \text{Internal actions $G \times X \to X$ in $\mathcal{C}$ with} \\ \text{internal $G$-equivariant arrows} \end{array} \right \} \simeq \left \{ \begin{array}{c} \text{Enriched functors $\mathsf{B}G \to \mathcal{C}$ with} \\ \text{enriched natural transformations as arrows} \end{array} \right \}\]

Part of the puzzle is how to define some of these notions (such as “internal $G$-equivariant arrows”). You might find it helpful to read the preexisting definition of an enriched natural transformation.

Let’s take a second to meditate on the difference between “internalization” and “enrichment”. This difference is usually invisible, since for “ordinary” categories we’re always working both internal to $\mathsf{Set}$⁷ and enriched over $\mathsf{Set}$. That is, our categories are always sets with some structure, and our homsets are always… well, sets!

When you have some gadget and you think “Gee! I sure wish this gadget automatically had the structure of a $\mathcal{C}$-object!”⁸, you want to work internally to $\mathcal{C}$. Doing this means that pretending that $\mathcal{C}$ is the universe of sets, and the $\mathcal{C}$-arrows are the universe of functions, and then just doing whatever we usually do but “inside $\mathcal{C}$”.

Figuring out exactly how to do this is the purview of much of categorical logic. We can construct standard ways of interpreting set theoretic constructions (such as “${x \in \mathbb{N} \mid \exists y. y^2 = x}$”, etc.) inside a (sufficiently structured) category $\mathcal{C}$. Then there’s a routine, but slightly annoying⁹, method for cashing out these set theoretic constructions for an “internal” version in $\mathcal{C}$! You can read all about this here or here. One of the reasons so many people care about topos theory is because a topos is a category with so much structure that we can actually internalize any concept we want inside it!

What about enrichment? This is useful when you have an otherwise “normal” category, but your homsets have ~bonus structure~ that you want to respect. For instance, your homsets might be abelian groups, or vector spaces, topological spaces, or chain complexes! Then enriched category theory tells you that, say, yoneda’s lemma still works when you ask that everything in sight respects this ~bonus structure~. This turns out to be the start of an incredibly interesting subject called formal category theory¹⁰.

To see how well you understand internal versus enriched things, here’s a cute exercise:

Write out, in some detail, the definition of a category internal to $\mathsf{Cat}$ (the category of categories). Then write out, in some detail, the definition of a category enriched over $\mathsf{Cat}$ (with $\times$ as its monoidal structure).

Both of these concepts are extremely useful in lots of ongoing research in algebra, logic, and applied category theory! A category internal to $\mathsf{Cat}$ is a double category (See Evan Patterson’s excellent blog post on the subject). A category enriched over $\mathsf{Cat}$ is a 2-category, these show up very naturally, as I’ll hopefully show in an upcoming blog post!

Ok, this blog post became something much longer than I originally intended (to nobody’s surprise), but let’s have one more cute puzzle before we go.

Another common way group actions get treated is as a group homomoprhism $G \to \text{Aut}(X)$, where $\text{Aut}(X)$ is the group of automorphisms of $X$. Is there some way to make this perspective fit in with the internal/enriched perspectives we’ve been working with so far?

Again, the answer is yes, but now we need to work with a cartesian closed category with all finite limits.

Given a cartesian closed category with finite limits $\mathcal{C}$, and an object $X \in \mathcal{C}$, can you build a group object $\underline{\text{Aut}}(X)$ internal to $\mathcal{C}$ so that the global elements $1 \to \underline{\text{Aut}}(X)$ are in bijection with the usual automorphism group $\text{Aut}(X)$ (which is just a set)?

Then, once you’ve defined $\underline{\text{Aut}}(X)$, can you show that an internal group action $G \times X \to X$ is the same data as an internal group hom $G \to \underline{\text{Aut}}(X)$?

If you find this exercise hard, maybe that’s incentive to learn some categorical logic! The category $\mathcal{C}$ has enough structure¹¹ for its internal language to support a definition

\[\{ f : X \to X \mid \exists g : X \to X . fg = \text{id}_X \land gf = \text{id}_X \}\]

which we can then cash out for an honest object $\underline{\text{Aut}}(X)$ in $\mathcal{C}$.

Since the usual proof that this set is a group is constructive, we get for free that any object $\underline{\text{Aut}}(X)$ in any $\mathcal{C}$ is actually a group object in $\mathcal{C}$! Moreover, the usual proof that an action $G \curvearrowright X$ is a group hom $G \to \text{Aut}(X)$ is constructive. So we learn, for free, that in $\mathcal{C}$ an internal group action is the same thing as an internal group hom $G \to \underline{\text{Aut}}(X)$!

Thanks for sticking around! This was a super fun post to write since it touches on a LOT of aspects of “more advanced” category theory that people might struggle with at first. I would normally give more of an outro, but I have some friends coming over in a half hour and I really want to get this posted!

Stay warm, and stay safe, all! We’ll talk soon ^_^

Note that the “usual axioms” can all be expressed as equalities between composites of these functions. For instance, the inverse law says that the composite
\[\begin{array}{ccccccc} G & \overset{\Delta}{\longrightarrow} & G \times G & \overset{1_G \times i}{\longrightarrow} & G \times G & \overset{m}{\longrightarrow} & G\\ g & \mapsto & (g,g) & \mapsto & (g, g^{-1}) & \mapsto & g \cdot g^{-1} \end{array}\]
is the same arrow as
\[\begin{array}{ccccc} G & \overset{!}{\longrightarrow} & 1 & \overset{e}{\longrightarrow} & G \\ g & \mapsto & \star & \mapsto & e \end{array}\]
The elementwise definitions in the lower lines are primarily for clarity in showing what these composites “are really doing”, but they can be made precise using the language of “generalized elements”. ↩
There turns out to be a deep connection between “algebraic theories” (like groups, rings, etc) and categories with finite products, which I want to write about someday. This is the start of the story of categorical logic, which is near and dear to my heart.

One can view this whole game of “internalization” as a subfield of categorical logic, where we focus in on the things we normally do to sets, and precisely say
1. what categories can interpret various set-theoretic constructions
2. how to figure out what the “right way” to internalize a given construction is. This turns out to be totally algorithmic!
↩
There’s a couple things I could mean by “nice” here. See here for some options, but if pressed I’d probably say compactly generated spaces. Keep in mind that this makes the compact-open topology I linked to incorrect in some edge cases, but morally what I’ve said is right, and I think it’s literally true for compactly generated hausdorff spaces. ↩
In fact, every $\infty$-category is equivalent (in the appropriate sense) to a category enriched in simplicial sets. See here, for instance.

The claim then follows from the fact that the category of simplicial sets (up to homotopy) is equivalent (again, in an appropriate sense) to the category of topological spaces (up to homotopy). ↩
Of course, by “element” here I mean a global element. That is, a map from the monoidal unit $\text{id}_x : I \to \text{Hom}(x,x)$. ↩
Isomorphism? ↩
Ignoring size issues ↩
Which for some reason I’m hearing in Tom Mullica’s voice ↩
In exactly the same way that, say, gaussian elimination, is routine but annoying. ↩
Of course, there’s other more exotic ways to use enriched categories where the hom-objects aren’t structured sets! See, for instance, the famous Lawvere Metric Spaces. These are still extremely interesting, but are of a different flavor to the enriched categories that fit into the story I’m trying to tell. ↩
Notice that, in this definition, the $g$ in the existential quantifier is provably unique. This is super important because it means we can make this definition using only finite limits. More complicted existential quantifiers require more structure on our category, namely regularity. For more information see the nlab pages on cartesian logic and regular logic. ↩

Talk -- 2-Categorical Descent and (Essentially) Algebraic Theories

Tue, 14 Nov 2023 00:00:00 +0000

A few weeks ago I gave a talk at the CT Octoberfest 2023 about some work I did over the summer that I’m really proud of. Unfortunately, while writing up the result I found a 1999 paper by Pedicchio and Wood that proves the same theorem (with roughly the same proof), so I wasn’t able to publish. Thankfully, the work is still extremely interesting, and I was more than happy to talk about it at a little online conference for other category theorists ^_^.

Recall an algebraic theory is something like groups, rings, modules, etc. It’s a structure that can be defined as a set (or possibly multiple sets) with some operations defined on it (allowing constants as $0$-ary operations) and equations specifying the behavior of those operations.

An essentially algebraic theory is something like categories. It’s a structure that can be defined as a set (or possibly multiple sets) with some operations defined on it, etc. The main superpower we get in the essentially algebraic world over the algebraic one is partially defined functions. Now our operations don’t have to be defined everywhere, they are allowed to be defined on subsets of the sorts. As long as those subsets are definable by equations!

For instance, the theory of categories is essentially algebraic since we have

Sets $O$ and $A$ (the sets of objects and arrows)
operations $\text{dom}, \text{cod} : A \to O$ taking an arrow to its domain/codomain
an operation $\text{id} : O \to A$ taking an object to the identity arrow at that object
an operation $\circ : \{ (f,g) \in A \times A \mid \text{dom}(f) = \text{cod}(g) \}$
satisfying certain equational axioms, like $\text{dom}(\text{id}(x)) = x = \text{cod}(\text{id}(x))$, $(f \circ g) \circ h = f \circ (g \circ h)$, etc.

Notice that composition isn’t defined on the whole set $A \times A$. It’s only partially defined! But the set where it’s defined is easy to understand – it’s defined by an equation in the other functions ($\text{dom}(f) = \text{cod}(g)$).

Contrast this with fields, which have a partially defined inverse operation $(-)^{-1} : \{ x \in k \mid x \neq 0 \} \to k$. There is no way to write the domain of inversion as an equation¹.

Now, essentially algebraic theories are extremely nice, for lots of reasons I outlined in my talk (and mentioned on the nlab page I linked earlier), but they’re not quite as nice as honest algebraic theories.

For instance, the underlying set of a quotient of groups is a quotient of the underlying set. If we have a surjection $G \twoheadrightarrow H$, then there’s an equivalence relation² $\theta$ on $UG$ (the underlying set of $G$) so that $UH \cong (UG) \big / \theta$.

This is no longer the case for models of an essentially algebraic theory! That is, the underlying set of a quotient might not be a quotient of the underlying set³.

For example, consider the following category:

Notice its set of arrows (ignoring identities) is ${ f, g }$.

Now if we quotient to set $Y_1 = Y_2$, we get a new category

But now that $Y_1 = Y_2$, $f$ and $g$ are composable! So we had better add a composite!

So after quotienting, our underlying set of arrows (again, ignoring identities) is ${ f, g, gf }$, which isn’t a quotient of the set we started with! Also, note the role that partial operations played in this. The reason we got ~bonus elements~ in our underlying set is because after quotienting the domain for the partial operation got bigger, so we had to freely add stuff to make sure we were closed under composition.

Another reason to care about algebraic theories over essentially algebraic ones is that algebraic theories can be interpreted in any finite product category, while essentailly algebraic theories make use of all finite limts! This shows up even for “real mathematicians”, since the category $\mathsf{Diff}$ of smooth manifolds doesn’t have finite limits! So we can define a lie group as a group object in $\mathsf{Diff}$ (since the theory of groups is algebraic) but we can’t define a lie groupoid as a groupoid object in $\mathsf{Diff}$ (since the theory of groupoids is merely essentially algebraic)⁴!

With this in mind, it’s natural to ask when we can recognize an algebraic theory amongst the essentially algebraic ones. It turns out we can, and the process requires a fair amount of category theory!

We’ve already touched on the relationship between

\[\{ \text{algebraic theories} \} \leftrightsquigarrow \{ \text{finite product categories} \}\]
\[\{ \text{essentially algebraic theories} \} \leftrightsquigarrow \{ \text{finite limit categories} \}\]

But it turns out the relationship goes much deeper! Indeed, one can show that the “sets” in the above bullets actually represent $2$-categories, and that the correspondences are (contravariant) bi-equivalences!

Given a finite product (resp. finite limit) category $\mathcal{C}$, we treat it as an (essentially) algebraic theory, and say its category of models is the category of finite product (resp. finite limit) functors $\mathcal{C} \to \mathsf{Set}$.

In fact, we can go further! Given a finite product (resp. finite limit) categories $\mathcal{C}$ and $\mathcal{V}$, we say that the cateogry of $\mathcal{C}$ models in $\mathcal{V}$ is the category of finite product (resp. finite limit) functors $\mathcal{C} \to \mathcal{V}$.

Conversely, given a category of models for some (essentially) algebraic theory, its category of ($\mathsf{Set}$-valued) finitely generated free algebras⁵ (resp. finitely presented algebras) has finite coproducts (resp. finite colimits). So if we take the opposite category of this, we get a category with finite products (resp. finite limits)!

It’s then not so hard to show that these operations are mutually inverse⁶!

Now if we have an algebraic theory $\mathbb{A}$, that corresponds to a finite product category $\mathcal{A}$ where the category of $\mathbb{A}$-models is the functor category $\mathsf{FinProd}(\mathcal{A}, \mathsf{Set})$.

To view this as an essentially algebraic theory, we want to find a finite limit category $\mathcal{E}$ which has the same models. That is, so that for every finite limit category $\mathcal{V}$:

\[\mathsf{FinLim}(\mathcal{E}, \mathcal{V}) \cong \mathsf{FindProd}(\mathcal{A}, U \mathcal{V})\]

where $U$ is the forgetful functor from finite limit categories to finite product categories.

This makes it clear that $\mathcal{E}$ should be the free finite limit completion of the finite product category $\mathcal{A}$ we started with! Since we already have products, all we have to do is freely add equalizers!

With this in mind, we see how to rephrase our problem of recognizing the algebraic theories among the essentially algebraic ones!

Fix an essentially algebraic theory $\mathbb{E}$, with (finite limit) classifying category $\mathcal{E}$.

Then $\mathbb{E}$ is actually algebraic if and only if $\mathcal{E}$ is equivalent to the free equalizer completion of a finite product category $\mathcal{A}$!

This means if we want to recognize the algebraic theories, we just need a way to recognize the essential image of the equalizer completion functor!

Thankfully, there’s a very heavy hammer we can use to understand the image of a left adjoint: Comonadic Descent!

I don’t want to say too much about (co)monadic descent here, mainly because I’m going to a friend’s concert tonight and I’ve already written quite a lot about it in my recent preprint. But here’s the short story. We have a diagram of categories

where $\mathsf{FinLim}_{\mathsf{Eq} \ U}$ is the category of coalgebras for the $\mathsf{Eq}\ U$ comonad, and the usual Barr-Beck yoga shows that everything in the image of $\mathsf{Eq}$ has a canonical coalgebra structure, which is where the top map (which I’m abusively also calling $\mathsf{Eq}$) comes from.

The adjunction $\mathsf{Eq} \dashv U$ is called comonadic exactly when this top map is an equivalence. In particular, this means we can recognize the image of $\mathsf{Eq}$ in $\mathsf{FinLim}$ as those categories admitting an $\mathsf{Eq} \ U$ coalgebra structure!

It turns out to not be too hard to prove that this adjunction is comonadic by using Beck’s famed (Co)monadicity Theorem! This comes down to some combinatorics⁷ involving Pitt’s explicit construction of the equalizer completion, first published in Bunge and Carboni’s The Symmetric Topos, which solves our problem!

Pedicchio and Wood, in their ‘99 paper A Simple Characterization of Theories of Varieties, give a nice characterization of the image of $\mathsf{Eq}$ as those categories with enough “effective projectives”⁸.

Let me say a quick word about the “2-categorical” in the title. In the last section, to make use of the descent machinery, we had to work with 1-categories $\mathsf{FinLim}$ and $\mathsf{FinProd}$. That is, with stict such categories. Of course, this result should really be 2-categorical in nature, working with all such categories, and we should be using a 2-categorical version of comonadic descent to prove the theorem… Unfortunately I don’t know of one!

I was kind of hoping that someone would ask about this during the talk – after all I put “2-categorical” in the title, but didn’t mention 2-categories at all! But my talk was the first talk of the day⁹, so it makes sense that people would have been nice and not asked that.

Regardless, I’m pretty sure an australian would have a reference for this kind of descent (and I might ask about it in the category theory zulip after posting this), because there’s no way I’m the first person to want to use it!

All in all, I’m happy with how the talk went. It was one of the shorter talks I’ve given, and I wanted to assume the audience didn’t know a ton of logic. I think I did a good job giving the flavor of the theorem, and some reasons to care about it, without necessarily getting bogged down in the details of the proof. Hopefully I didn’t come off as too upset that I was 24 years late to publish it myself!

As usual, here’s a copy of the slides, abstract, and recording. I’ll also encourage people to take a look at some of the other talks from the conference (which you can find here). There were a ton of interesting ones¹⁰ and if you like category theory I’m sure you’ll find something you enjoy!

Thanks again for reading. Stay warm, and try not to let the November darkness weigh on you too much! Talk soon ^_^

$2$-Categorical Descent and (Essentially) Algebraic Theories

An essentially algebraic theory is an algebraic theory that moreover allows certain partially defined operations. Since algebraic theories enjoy certain nice properties that essentially algebraic theories don’t, it’s natural to ask if we can recognize when an essentially algebraic theory is actually algebraic. In the language of functorial semantics, this amounts to recognizing when a finite limit category is the free completion of a finite product category, and the problem can be solved by considering a 2-categorical descent theory. This was independent work, but writing it up I learned that the same result can already be found in a 1999 paper of Pedicchio and Wood. This seems to be less well known than it should be, and I hope this talk brings attention to this fascinating subject.

The slides are here, and a recording is below:

In fact, we can prove this with category theory! It’s a theorem that the category of all models for an essentially algebraic theory has an initial object. But there isn’t an initial object in the category of fields! So no matter how clever we are, there won’t be an essentially algebraic axiomatization of fields. ↩
In fact there’s much more to be said here. The equivalence relation $\theta$ will be a congruence (meaning it’s compatible with the algebraic structure), and the study of such congruences is historically one of the biggest topics in universal algebra. I won’t say more here, but trust me that there’s much more to say. If you’re interested, I recommend Burris and Sankappanavar’s book, freely available here. ↩
This should be believable for a few reasons. Indeed, the “underlying set” functor is a right adjoint, so we shouldn’t expect it to play nicely with any kind of colimit (like a quotient).

Moreover, $U$ playing nicely with congruences is one of the defining features of an algebraic theory! This is the key criterion for monadicity! ↩
If you haven’t seen them before, it may come as a surprise that “real mathematicians” care about lie groupoids, since they sound quite abstract. But they’re really not esoteric at all! They model orbifolds, which are manifolds with certain mild singularities. They arise incredibly naturally when studying, say, manifolds with a group action. ↩
This is the slightest of fibs. The situation for finite product categories is actually slightly more sublte than I’m letting on, basically because a finite product category might not be cauchy complete. If you know you know, if you don’t, then trust me that it doesn’t really matter. If you’re interested in the details, you can find them in Adámek, Vitale, and Rosický’s book Algebraic Theories. ↩
But they’re only mutually inverse up to equivalence! If we start with a finite limit category $\mathcal{C}$, and then look at the opposite of the finitely presented objects in the functor category $[\mathcal{C},\mathsf{Set}]$, then we merely get something equivalent to $\mathcal{C}$!

In particular, we don’t get something isomorphic to $\mathcal{C}$, and we definitely don’t get something equal to $\mathcal{C}$! Some readers will likely say that concepts of “isomorphism” and “equality” aren’t even defined in a 2-category (they would say bicategory, of course), but that’s not a quibble I want to have right now.

What matters is that there’s something honestly 2-categorical happening here, and we need that language to make this notion of “sameness” precise in the same way we need 1-categories to make the notion of “isomorphism” precise. I’m literally so close to finishing a blog post on “2-categories and why you should care” that goes into this in more depth, but I wanted to say a word about it here. After all, rambling footnotes are a feature of this blog! ↩
As many theorems do, at the end of the day. Thankfully our combinatorics is also pretty polite. ↩
Though they work with the opposite of the categories we work with, so for us they’re more likely to be “effective injectives” after dualizing. ↩
Which put it at 6am my time… Thankfully I’ve recently become a morning person, so I only had to wake up an hour earlier than usual. ↩
And I haven’t even had time to watch them all yet! ↩

A truly incredible fact about the number 37

Wed, 08 Nov 2023 00:00:00 +0000

So I was on math stackexchange the other day, and I saw a cute post looking for a book which lists, for many many integers, facts that Ramanujan could have told Hardy if he’d taken a cab other than 1729. A few days ago OP answered their own question, saying that the book in question was Those Fascinating Numbers by Jean-Marie De Koninck. I decided to take a glance through it to see what kinds of facts lie inside (and also to see just how many integers are covered!). Not only was I overwhelmed by the number of integers and the number of facts about them, the preface already includes one of the single wildest facts I’ve ever heard, and I have to talk about it here! Here’s a direct quote from the preface:

37, the median value for the second prime factor of an integer; thus the probability that the second prime factor of an integer chosen at random is smaller than 37 is approximately $\frac{1}{2}$;

My jaw was on the floor when I read this, haha. First it sounded totally unbelievable, since 37 is a tiny number in the grand scheme of things. Then it started to sound slightly more plausible… After all, about half of all integers have $2$ as their smallest prime factor. It makes sense that smaller primes should be more frequent among the smallest factors of numbers! But then I thought “how can you possibly prove this!?”. I’m not much of an analytic number theorist¹, but I know that they have good estimates on a lot of facts like this. I decided it would be fun to try and find and understand a proof of this fact, and also write some sage code to test it!

So then let’s go ahead and do it ^_^

First, I think, the sage code. I want to know if this really works!

“Obvoiusly” there’s no uniform distribution on the natural numbers, so what does it even mean to choose a “random” one? The way the number theorists usually solve this problem is by fixing a large number $N$ and looking at the probabilities when you pick a random number between $1$ and $N$. Then you look at the $N \to \infty$ limit of these probabilities.

So for us, we’ll want to first fix a large number $N$ and then work with numbers $\leq N$. For $N$ kind of small, we can just find the second prime factor of each number $\leq N$ and check the median!

When I first ran this code, it honestly felt like magic, haha. What the hell is going on here!?

The key idea, found in a paper of De Koninck and Tenenbaum², is that we can compute the density of numbers whose second prime is $p$ (which the authors denote $\lambda_2(p)$) by cleverly using the ideas in the Sieve of Eratosthenes!

Let’s do a simple example to start. What fraction of numbers have $5$ as their second prime? In the language of the paper, what is $\lambda_2(5)$?

Well it’s not hard to see that the numbers whose second prime is $5$ are those numbers whose prime factorization looks like

\[2^a 3^0 5^b \cdots\]

\[2^0 3^a 5^b \cdots\]

so we need to count the density of numbers of these forms.

But a number is of the first form ($2^a 3^0 5^b \cdots$) if and only if it has a factor of $2$, a factor of $5$, and no factors of $3$.

To bring this back to elementary school³, we can highlight all of our numbers with a factor of $2$

numbers with no factors of $3$

and numbers with a factor of $5$

Then the numbers whose prime factorization starts $2^a 3^0 5^b \cdots$ are exactly the numbers highlighted by all three of these colors!

It’s intuitively clear that $\frac{1}{2}$ the numbers are blue, $\frac{2}{3}$ are orange, and $\frac{1}{5}$ are pink. So taken together, $\frac{1}{2} \cdot \frac{1}{5} \cdot \frac{2}{3} = \frac{1}{15}$ of numbers are of this form!

So now we have our hands on the density of numbers of the form $2^a 3^0 5^b$, but this is only one of two ways that $5$ can be the second smallest prime. A similar computation shows that $\left ( 1 - \frac{1}{2} \right ) \cdot \frac{1}{3} \cdot \frac{1}{5} = \frac{1}{30}$ of numbers are of the form $2^0 3^a 5^b$.

It’s easy to see that these sets are disjoint, so their densities add, and $\frac{1}{15} + \frac{1}{30} = \frac{1}{10}$ numbers have $5$ as their second smallest factor!

Now with the warm-up out of the way, let’s see how we can compute $\lambda_2(p)$ for our favorite prime $p$!

We’ll play exactly the same game. How can $p$ be the second smallest prime? Exactly if the prime factorization looks like

\[p^b q^a \prod_{q \neq r \lt p} r^0\]

for some $q \lt p$.

But we can count these densities as before! For each choice of $q$, we know that $\frac{1}{p}$ numbers are multiples of $p$, $\frac{1}{q}$ are multiples of $q$, and for each $r$ we know $\left (1 - \frac{1}{r} \right )$ numbers are not multiples of $r$! For each $q$, then, we want to land in the intersection of all of these sets, then we want to sum over our choices of $q$. Taken together, we see that

The density of numbers whose second prime is $p$ is

\[\lambda_2(p) = \sum_{q \lt p} \frac{1}{p} \frac{1}{q} \prod_{q \neq r \lt p} \left ( 1 - \frac{1}{r} \right )\]

We can rearrange this to

$\displaystyle \lambda_2(p) = \frac{1}{p} \left [ \prod_{q \lt p} \left ( 1 - \frac{1}{q} \right ) \right ] \sum_{q \lt p} \frac{1}{q} \left ( 1 - \frac{1}{q} \right )^{-1}$

As a cute exercise, write $\lambda_k(p)$ for the density of numbers whose $k$th prime is $p$.

De Koninck and Tenenbaum mention in passing that

\[\displaystyle \lambda_k(p) = \frac{1}{p} \left [ \prod_{k \lt p} \left ( 1 - \frac{1}{q} \right ) \right ] s_{k-1}(p)\]

where $s_j(p) = \sum \frac{1}{m}$ is a sum over all $m$ who have exactly $j$ prime factors, all of which are $\lt p$.

Can you prove that this formula is correct⁴?

But remember the goal of all this! We want to know the prime $p^*$ so that half of all numbers have their second prime $\leq p^*$. That is, so that the sum of densities

\[\lambda_2(2) + \lambda_2(3) + \lambda_2(5) + \ldots + \lambda_2(p^*) \approx \frac{1}{2}.\]

But we can implement $\lambda_2(-)$ and just check for which prime this happens!

Again we see that $37$ is the prime where roughly half of all numbers have something $\leq 37$ as their first prime! So we’ve proven that $37$ is the median second prime!

Also, this shows that we expect the actual density to be $\approx .5002$. If we set $N = 10^7$ in the code from the first half⁵ to get a better approximation, we get $.5002501$, which is remarkably close to the truth!

As another cute exercise – using the ideas in this post, can you compute the median third prime?

As a (much) harder exercise⁶, can you get asymptotics for how the median $k$th prime grows as a function of $k$?

Thanks for hanging out, all! This was a really fun post to write up, and I’m really excited to share it with everybody! This fact about $37$ was all I could think about for like a week, haha.

I have more blog posts coming, of course, so I’ll see you all soon!

Stay safe, and stay warm ^_^

Absolutely the understatement of the year ↩
Sur la loi de répartition du k-ième facteur premier d’un entier

Yes, this paper is in french, but it’s really not so hard to read, especially with liberal use of google translate. Though if you want to avoid reading it, I’ve done the hard work for you, and everything in this blog post is in english.

It also wasn’t too hard to find this paper, thankfully. It’s mentioned in a footnote in the entry for $37$ in Those Fascinating Numbers, so I had a decent starting point. ↩
I literaly got the base image by googling “grid of numbers high res” and clicking the first result, which was for elementary schoolers ↩
It might be helpful to remember a generating function trick that shows up fairly often (for instance in partitions and the riemann zeta function):
\[\sum \frac{1}{n} = \prod_p \left ( 1 - \frac{1}{p} \right )^{-1}\]
Don’t worry that this sum diverges for now. Just take note of why these two sides are equal. You should expand each term of the right hand side as a geometric series, then check what happens when you foil. ↩
(and run it locally, since factoring numbers that big takes so long that the online sagecell times out) ↩
If you read french, the De Koninck and Tenenbaum paper we’ve been referencing all post (Sur la loi de répartition du k-ième facteur premier d’un entier) is actually all about analyzing these asymptotics!

If we write $p_k^*$ for the median $k$th prime, then they show:
\[\log \log p_k^* = k - b + O \left ( \frac{1}{\sqrt{k}} \right )\]
where $b = \frac{1}{3} + \gamma - \sum_p \left ( \log ((1-1/p)^{-1}) - 1/p \right )$ and $\gamma$ is the Euler-Mascheroni Constant. ↩

Preprint -- The RAAG Functor as a Categorical Embedding

Thu, 05 Oct 2023 00:00:00 +0000

After almost a year of sitting on my hard drive, I finally had time in August to finish revising my new preprint on Right Angled Artin Groups (Raags). And in September I had time to put it on the arxiv for people to see! Within 24 hours I had an email from somebody who had read it, and was interested in reading it closely! It’s super exciting to see that people are actually reading something I wrote, and it’s really validating to feel like the math I did last year was worth it ^_^. In this post, I’d love to give a quick informal description of what’s going on in that paper.

First, what’s a right angled artin group? We’ve actually talked about them before in the second ever post on this blog (!) but here’s a quick tl;dr.

Fix a (simple, undirected) graph $\Gamma$ with underlying vertex set $V$.

The Right Angled Artin Group $A\Gamma$ is the group

\[\langle v \in V \mid [v_1, v_2] \text{ for } \{v_1, v_2\} \in \Gamma \rangle\]

freely generated by the vertices, where two vertices commute if and only if they’re adjacent in $\Gamma$.

For example, in case $\Gamma$ is a complete graph on $n$ vertices, $A\Gamma$ is a free abelian group of rank $n$. If $\Gamma$ has no edges, then $A \Gamma$ is a free group of rank $n$. As a quick exercise, can you convince yourself that for $\Gamma$ as below, we have $A \Gamma \cong F_2 \times F_2$ is a direct product of free groups?

Now, it’s not hard to see that the right angle artin group construction $A$ is actually a functor $A : \mathsf{Gph} \to \mathsf{Grp}$ from the category of graphs to the category of groups. After all, if $\varphi : \Gamma \to \Delta$ is a graph homomorphism¹ then we get a group homomorphism $A \varphi : A \Gamma \to A \Delta$ given by sending each generator $v \in A \Gamma$ to the generator $\varphi v \in A \Delta$.

Moreover, $A : \mathsf{Gph} \to \mathsf{Grp}$ admits a right adjoint! This makes precise the idea that the raag on $\Gamma$ is a kind of “free group” associated to $\Gamma$. Given a group $G$, we define its Commutation Graph $CG$ to be the graph² whose vertices are elements of $G$ where ${g_1, g_2} \in CG$ is an edge if and only if $g_1$ and $g_2$ commuted in $G$.

This allows us to bring much more powerful category theory to the table. In particular, it lets us use the machinery of comonadic descent to characterize the essential image of $A$. That is, to understand which groups are raags, and to understand which homomorphisms between raags are $A \varphi$ for some graph homomorphism! In fact, we show that $A$ is a categorical embedding, so that $\mathsf{Gph}$ is a (non-full) subcategory of $\mathsf{Grp}$!

Moreover, the characterization is something we can really calculate with!

The main theorem of the paper states that there is a coalgebra structure we can put on groups so that

The coalgebra structures on a group $G$ are in bijection with the graphs $\Gamma$ so that $G \cong A\Gamma$
A homomorphism $f : G \to H$ respects this coalgebra structure if and only if $f = A \varphi$ for a graph homomorphism.

In particular, the raags are exactly the groups admitting such a coalgebra structure.

One thing that I wish more mathematicians would talk about is the kind of meandering nature of research. When you see a paper written in defnition-theorem-proof style, it’s hard to imagine what the discovery process must have looked like. It’s almost always much messier than the final paper would lead you to believe, so I want to take a minute to talk about the history of this paper.

Why was I thinking about all this?

There’s an important open question in the theory of raags about the “nonstandard” embeddings between two raags.

Obviously if $\Gamma$ is a (full) subgraph of $\Delta$, then $A\Gamma$ is a subgroup of $A \Delta$ in a canonical way. But it turns out we can have pairs so that $A \Gamma$ embeds into $A \Delta$, but $\Gamma$ is not a subgraph of $\Delta$! Motivated by questions in geometric group theory, it’s natural to want to understand when these “nonstandard” embeddings are possible.

Now the adjunction comes into play. If we have an embedding $A \Gamma \to A \Delta$, then our adjunction gives us a map of graphs $\Gamma \to CA \Delta$. Since $CA : \mathsf{Gph} \to \mathsf{Gph}$ is a monad on the category of graphs, we (not very creatively) call this construction the Monad Graph of $\Delta$.

It would be nice to have a combinatorial characterization of when a map $\Gamma \to CA \Delta$ transposes to an embedding $A \Gamma \to A \Delta$. But such a characterization is as yet unknown³.

I actually had most of these thoughts back in 2020 when I was first thinking about raags, but they didn’t really go anywhere. Especially since I was focused on life things like passing my quals and finding an advisor. But that all changed in 2022 when I started really trying to understand stacks…

One way to understand stacks⁴ is as “categories satisfying descent”. In the same way that a sheaf on $X$ assigns a set to each open in a way that elements of the set glue along open covers, a stack on $X$ assigns a category to each open in a way that objects and arrows of the category glue along open covers!

This story is closely tied up with the story of descent theory, so I took a detour through understanding that⁵. Along the way I found out about comonadic descent which (among other things) lets you characterize the image of a left adjoint. I remembered this raags adjunction that I worked on, and knew that its image was pretty easy to understand. Maybe the adjunction was comonadic and I could use this to attack the nonstandard embedding problem!

Once I knew the right question to ask, the proof itself was surprisingly simple, and I had a rough draft within a day or two⁶.

Next came the process of writing everything up in a way that doesn’t require a ton of category theoretic background. Working out the exposition for the paper was super enlightening for me⁷. Hopefully it also makes it more accessible for people new to the subject!

Ok, with the history out of the way, let’s talk about

What’s in the paper

This is pretty quick to outline, since I’m assuming a category theory background of my blog readers that I wasn’t assuming of my paper readers. That said, if it ever gets too heavy, feel free to read the first few sections of the paper, since I go into much more detail there.

The main result is implied by the categorical statement that the adjunction $A \dashv C$ is Comonadic. This says that $\mathsf{Gph}$ is equivalent to the category of $AC$-coalgebras on $\mathsf{Grp}$, and moreover that the equivalence intertwice the adjunctions $A \dashv C$ (on the $\mathsf{Gph}$ side) and the co-free/forgetful adjunction (on the $\mathsf{Grp}_{AC}$ side).

That is, we have a situation as below:

Really the equivalence $A$ sends a graph $\Gamma$ to the group $A\Gamma$ with the coalgebra structure $A \eta_\Gamma : A \Gamma \to ACA \Gamma$, so that after forgetting this structure we get $A : \mathsf{Gph} \to \mathsf{Grp}$.

How do we show that this really is an equivalence? The answer is Beck’s (Co)monadicity Theorem! It says that for any adjunction $A \dashv C$ the base of that triangle is an equivalence if and only if $A$ reflects isomorphisms and preserves equalizers of “$A$-split parallel pairs”.

In the paper we use a different version of the comonadicity theorem which is easier to check, but it boils down to the same proof.

It’s “well known” in raag circles that $A$ is conservative (this can already be found in a 1987 paper of Droms), so we need to check that $A$ preserves certain equalizers. We can do this with a slightly technical argument of combinatorics on words. The key fact is that we have a good understanding of normal forms for the elements in $A\Gamma$⁸.

The last section of this paper was a small application of this machinery. We’re able to reprove a result that we can effectively recover $\Gamma$ from the isomorphism type of $G \cong A \Gamma$, as long as we’re promised that $G$ really is a raag⁹. We also show that if we have any concrete examples of groups $G$ with $AC$-coalgebra structures, we can really do all the computations that we would want to do!

I think I like this format of blog posts putting more emphasis on the things that were on my mind when I was working on a paper, rather than the contents of the paper itself. Maybe if I write a more detailed paper I can say some informal words about what’s in it, but I think the historical perspective might help younger mathematicians see how messy research can be, and how incidental things can all blend together into a result. I just happened to be thinking about raags a few years before I happened to learn about stacks and descent, which opened the door to a result on descent for raags. Hopefully you all also like the historical perspective ^_^.

I’m also going to try to keep these paper announcement posts a bit less polished. It’s easy to get paralysis and revise posts forever and never finish them (ask me how I know…) I want to make sure that these actually get out, so I’ll try to keep them light on the revision. That shouldn’t be hard if the main point is the history!

Anyways, thanks for hanging out, all. I’m super excited to have a result, and I’ll be submitting to journals in the near future. Stay warm, and we’ll talk soon ^_^.

In the sense that $\varphi : V \to W$ is a function on the underlying vertices preserving the edge relation. So ${v_1, v_2} \in \Gamma$ implies ${ \varphi v_1, \varphi v_2 } \in \Delta$. ↩
In order to make this an honest adjunction, we need to require every vertex of our graph have a self loop. That is, we actually work with reflexive simple undirected graphs. This is a kind of technical point, since if every vertex has a self loop we don’t need to draw them! So the combinatorics remains basically unchanged. ↩
There’s reason to suspect this is a good idea, though! Not only is it category-theoretically natural, but (as far as I know) the best characterization of raag embeddings we have is due to Kim and Koberda, which shows that whenever $\Gamma$ embeds into their extension graph $\Delta^e$ that $A\Gamma$ embeds into $A\Delta$. The interesting relationship is that $\Delta^e$ is the full subgraph of $CA\Delta$ on the conjugates of the generators!

In fact, Kim and Koberda conjectured that the converse is true too, so that $\Delta^e$ is the right graph to consider to understand the nonstandard embeddings, but this was disproven in a 2013 paper of Casals-Ruiz, Duncan, and Kazachov. (Many thanks to Carl-Fredrik Nyberg-Brodda for telling me about this)

This is actually good news for my paper, since it means that we’ll want to understand more of the monad graph in order to better understand the embeddings, instead of being satisfied understanding a subgraph. ↩
And there are many! I want to write a blog post about them sometime soon. ↩
I was especially drawn to this because I learned that descent is closely related to monads. Even though objectively I’m very comfortable with monads nowadays, deep inside me there’s still a teenage programmer fascinated and confused by monads while learning haskell. It’s kind of wild to think that almost 10 years ago haskell launched me on my category theory journey. It’s even wilder to think of how far I’ve come since then! ↩
I once heard some advice that once you’ve climbed a high mathematical mountain (like geting some comfort with descent theory) it’s worth looking around to see if there’s anything else to do while you’re up there.

This led me to another question about understanding which essentially algebraic theories are secretly algebraic. This amounts to understanding the image of the left (2-)adjoint from finite product categories to finite limit categories.

I was actually able to work this out as well during CT2023, but unfortunately I learned while writing it up that Pedicciho and Wood had published the same result in ‘99. I was sad, of course, but I learned a TON while working on that project, about essentailly algebraic theories, locally finitely presentable categories, 2-categories, and lots more! That’s actually coming in super handy with my thesis project, since locally presentable categories are the target for factorization homology!

Again, research is a messy and winding road. ↩
For much the same reason that writing these blog posts is often good for me! Teaching is a great way to learn, and to make sure you really understand a subject. ↩
In fact, in multiple of these comonadicity proofs I’ve done, the key to checking this equalizer condition is an understanding of normal forms for the free objects!

This is yet another reason that the search for “normal form” theorems for various algebras is an incredibly useful pursuit! ↩
Determining whether or not a group $G$ is a raag, without being promised that it is one, is undecidable. ↩