Using Ultrapowers to Solve Problems
05 Dec 2021
An Ultraproduct is a construction from mathematical logic that lets us prove (first order) statements about some family of structures by studying a single object which somehow captures the behavior of the family. In this post we’ll talk about ultraproducts – what are they, what do they do, and most importantly how can we actually use them to solve problems?
First, what are ultraproducts anyways?
Fix a sequence of groups (say)
- Elements of
are (equivalence classes of) tuples , where . thinks that some first-order property is true if and only if all but finitely many of the s do1. For example, if all but finitely many of the are abelian, then the ultraproduct will be abelian too, since abelianness is defined by the first order sentence .
One special case of interest is where all the
It might seem strange that this construction is useful. After all, by the
bullet point above, we know that
The answer is caught up in the “first-order” qualifier! It turns out that there
can be lots of differences between
To know which properties we can transfer back and forth between
Ok, so let’s quickly write down what an ultraproduct is. Even though I’m assuming some familiarity with logic, it’s nice to have a refresher4. For concreteness, we’ll work with countable ultraproducts, since they’re a bit simpler to think about and they’re the only ones I’ve ever personally needed.
Fix a first order language
- No finite set is large
- If
is large, then every is also large - If
and are both large, then so is - For every set
, either or is large
Then we can look at the Ultraproduct5
where we say that
The critical result in this area is Łoś’s Theorem (pronounced roughly like “Wash’s Theorem”) which says that:
That is, truth in the ultraproduct is exactly truth in “almost all” of the
As a fairly quick exercise, you should prove this if you have some experience with model theory.
The main character of today’s post is the ultrapower, but I can’t help but say a few words about why ultraproducts are useful. I’ll contain myself, though, and just give the following idea:
If you have a sequence of objects you want to study, say the groups
As a concrete example, consider the family
Then the ultraproduct
- cardinality continuum
- algebraically closed
- characteristic
Thus it’s (noncanonically!) isomorphic to
Since any question about polynomials and their roots is expressible in the
first order language of fields, this gives one formalization of the
informal “Lefschetz Principle”, which says that doing algebraic geometry
over
So then, let’s move on to ultrapowers!
If
sending
Moreover, since
Of course, we, from the outside looking in, can tell that
The canonical example of this phenomenon is nonstandard analysis, where we study the “hyperreal numbers”
which have lots of nice properties7. For instance, the hyperreal number
is an infinitesimal in the sense that
for any real number .
Then nonstandard analysis uses the extra elements given to us by the ultrapower in order to give us neat and intuitive definitions like the following:
If
Not only do these new definitions give us a new way of thinking9 about classical analysis, they’re even useful for letting us prove new things! Nowhere is this clearer than on Terry Tao’s blog, where he uses nonstandard analysis to great effect!
The trick behind ultrapowers, then, is to use our new
~bonus elements~ in
Following the example of prime fields, we can also use ultrapowers to kill
certain “bad properties” of elements. If we have a sequence of elements of
We can also use ultrapowers to kill “bad properties” of
More speculatively, maybe there’s
a proof in
As a last aside, I’ll leave you with an old example from the class where I met my undergraduate advisor. I’ll leave it as an exercise that might be somewhat tricky depending on how much nonstandard analysis you’ve seen:
We work inside
Say you were able to prove that there is a nonstandard twin prime.
That is, a pair of numbers
Then the twin prime conjecture is true10.
We’ve “simplified” the search for infinitely many twin primes into the search for one nonstandard twin prime!
Of course, I have no idea how one would go about finding a nonstandard twin prime! But this is a cute problem nonetheless ^_^.
-
Technically this is only true of nonprincipal ultraproducts. ↩
-
Pun intended ↩
-
Let
be a collection of “primitive symbols” which are necessary for talking about a given object of study. For example:- If we’re interested in groups, we might take
- for ordered rings we might take
- If we’re interested in graphs, we might take
(where says that and are adjacent)
Then the First Order Language associated to
(often written (sometimes or if we’re working with multiple different logics at once) is the collection of formulas we can build using- primitive symbols from
- connectives:
, , , , , etc. - variables like
, , etc. - quantifiers:
,
⚠ our quantifiers can only quantify over elements of our structure! This is the “first” in “first order”.
Now we see that there’s no way to write down “
is cyclic” in a first order way. The obvious ideadoesn’t work because we’re quantifying over
, which is not our structure. Similarly, we can’t writesince that is an infinite string of symbols. We only allow finite conjunctions or disjunctions.
Lastly, given a formula
and a Model (that is, a set equipped with constants, operations, and relations for each symbol in ) we write to mean that, when is interpreted using the operations of , it becomes true.Here are some nice exercises you might try to get familiar with first order logic:
For
the language of graphs, for each exercise below, write a formula so that a graph if and only if it satisfies the criterion in the exercise: is complete. contains a triangle.- For any (fixed) finite graph
, express contains a copy of as a subgraph. What about as a full subgraph? Can you make if and only if ?
For
the language of groups: . What about ? is -step nilpotent (think about iterated commutators)- For any (fixed) finite group
, express . What about ? (Think about the (finite!) multiplication table for ).
- If we’re interested in groups, we might take
-
Also, not all model theory courses include ultraproducts! For instance, I know the graduate model theory classes at CMU, (for some inexplicable reason), don’t talk about them. ↩
-
This seems to depend only very mildly on the choice of ultraproduct, and it’s reasonable to ask how the choice of ultraproduct effects the resulting structure. This tends to be quite subtle, see here, say, and for most use cases we don’t worry too much about which ultrafilter to take (even though it does matter. See here)
That said, there are certain applications where we really do want to choose an ultrafilter with special properties, say regularity. You can read about what kind of bonus information we get when we take ultrapowers over regular ultrafilters here, say. ↩
-
In order,
- is a fairly easy computation, see here
- is because we can write down sentences
which say that every polynomial of degree has a root. - is because for every
, cofinitely many think that . So the ultraproduct is a field, but cannot be characteristic for any .
-
Properties that you can (and should!) read more about in the fantastic Lectures on the Hyperreals by Goldblatt.
I have particularly fond memories of this book, because it was one of the first math textbooks that I read by myself, rather than having to read it for a class. ↩
-
It’s important here that
. If we allow , we actually get uniform continuity!This may appear counterintuitive at first, but does make some intuitive sense after a while. Again, I’ll point you to Goldblatt’s book for more. ↩
-
An old way of thinking? This is much closer to how analysis was thought about before it was placed on a truly rigorous footing through limits. ↩
-
As a hint, first show that a nonstandard natural number must be bigger than every standard natural number. For instance, the standard number
satisfies the formulaso the only numbers less than
are standard. Obviously we can do this for any standard number, and the claim follows.Next, show that, since
is nonstandard, we must haveetc. in
.Do you see why (after transferring these theorems back to
) this means there must be infinitely many twin primes? ↩