Reducing to -- Permanence and Concrete Proofs
05 May 2021
There are lots of ways in which good notation can make results seem obvious. There are also lots of ways in which “illegally” manipulating expressions can give a meaningful answer at the end of the day. It turns out that in many cases our illegal manipulations are actually justified, and this is codified in the principle of Permanence of Identities! This is one place where category theory and model theory conspire in a particularly beautiful (and powerful) way.
In this post we’ll talk about how to prove statements in general rings by
proving analogous statements for polynomials with integer coefficients.
This is nice because we often have access to ~bonus tools~ when working in
I think this technique is best shown by example, so I’ll give a smattering of proofs using this idea. Hopefully by the end you’ll be convinced of its flexibility ^_^.
Example – The Binomial Theorem
Let’s start simple, and build from here. In any of my favorite rings
where we (as usual) interpret scaling by an integer as repeated addition.
Many authors prove this by saying something like “the usual proof goes through unchanged”, but we can actually be a bit cleverer.
So since we proved
Notice that it doesn’t matter what “the usual proof” is! Maybe you like to
prove the binomial theorem by looking at the taylor series of
There are lots of situations where we run this same kind of argument.
Example – Sylvester’s Identity
This one is a favorite example of Bill Dubuque on mse. I’ve seen him evangelize it so often I felt obligated to include it. It helps that it’s such a great example! He actually has a fantastic explanation of this same princple here, which I definitely recommend reading if you’re interested!
Sylvester’s identity says that
(for
Since the determinant is a polynomial in the entries of a matrix1, this is really expressing the equality of polynomials with integer coefficients!
So we have a polynomial equation
Notice that we’ve, again, used a special property of integer polynomials in
this proof! We can cancel polynomials with reckless abandon because we’re working
in a domain. Once we prove this polynomial identity, though, the result remains
true after we evaluate! In particular, even if
the specific
Whatever tools we want to use inside
As a simple exercise, can you extend this result to the case where
Example – Computing Inverses
I said we would be focusing on commutative rings in this post, and that’s true. But there’s a really cool noncommutative example that follows the same principle and is worth showing. I learned about this on mse (where else?) in a different excellent post by Bill Dubuque.
Even in noncommutative rings2, if
Now for the clever trick:
we can embed this into the ring of noncommutative power series
In
But this means in
which, under our embedding, gives us the following identity in
But then since this ring is initial
among all noncommutative rings with
Notice the extra power we got, both by using quotient rings to model some hypotheses in our theorem and by passing to formal power series. This is part of what’s so nice about embeddings! They let us prove statements in some smaller setting by using techniques from a bigger setting3. We’ve been implicitly using this idea throughout the post, but I wanted to make it explicit at least once. After all, once we’re aware of it, we can intentionally use it in other settings as well4.
A Sentimental Interlude – Seven Trees in One
The first paper I ever read5 opens with the following beautiful passage:
Consider the following absurd argument concerning planar, binary, rooted, unlabelled trees. Every such tree is either the trivial tree or consists of a pair of trees joined together at the root, so the set
of trees is isomorphic to . Pretend that is a complex number and solve the quadratic to find that is a primitive sixth root of unity and so . Deduce that is a one-element set; realize immediately that this is wrong. Notice that is, however, not obviously wrong, and conclude that it is therefore right. In other words, conclude that there is a bijection built up out of copies of the original bijection : a tree is the same as seven trees. The point of this paper is to show that ‘nonsense proofs’ of this kind are, actually, valid.
You can see that we’ve “proven” a claim about trees by proving a polynomial
implication in
In the paper, the authors show that homomorphisms of certain polynomial
implications are also preserved6 for rigs (that is, rings without negatives).
Here
Since the objects of a category (up to isomorphism) with products and coproducts
forms a rig, this tells us there is a hom from
Since this polynomial implication is of the variety that’s preserved, and in
the category of datatypes we have
This follows the spirit of what we’re doing in this post, but is a bit more detailed because general homomorphisms don’t preserve all implications. A model theorist might jump straight to elementary embeddings, but that’s far too restrictive for our purposes here. The authors of the above paper do a great job finding (only slightly technical) conditions which make this argument go through. I’ve included it both to show what’s possible when you extend the ideas in this post, and also because it was my first paper and I feel a certain amount of love for it.
Example – Mutliplicative Determinants
Say we want to prove that
These arguments don’t go through for a general ring
Of course, once we remember
We first note that the entries of
But since the determinant is a polynomial in the entries, we see the equation
is actually a polynomial equation (albeit a complicated one) in the
But
This is a fun and powerful technique, and it’s really useful in a lot of situations! Here are some quick exercises for you to play around with, but I encourage you to look for your own as well!
Pick your favorite two facts about matrix algebra and see if they’re true over arbitrary rings. If you stick to facts about determinants, matrix multiplication, row operations, etc. you should be able to choose pretty much anything!
Show that the quadratic formula always works, unless it obviously doesn’t.
As a hint, you’ll want to work in the ring
Show that
(the ring of continuous functions on ) (the ring of continuous functions on (the ring of smooth functions on ) (the ring of entire functions on )
so if we prove an polynomial identity using real or complex analysis it
is true in
As another powerful tool in your arsenal, say you want to prove some
polynomial identity in one variable:
-
Show that there’s some finite number
(depending on and ) so that if and only if it’s true for many choices of . -
Show that that
is a polynomial identity in . Verify it by hand for choices of . Argue that this verification proves this identity holds in all rings where you can divide by .
It’s wild to me that some finite verification like this is enough to prove an identity (even a simple one like this) for all rings. If you want to see more of this proof technique you should check out Petkovšek, Wilf, and Zeilberger’s book A = B (section 1.4, to start).
Does this technique work for polynomials with more than one variable?
As one last aside, I’m really interested in figuring out when we can do this with power series. Over a year ago now I asked a question about this, though I accepted the answer too quickly (and in hindsight it isn’t really the kind of answer I was looking for).
With the extra year to think about it, though, I think I have a better idea how to make it work. I meant this to be a kind of prequel where we work in the simpler setting of polynomials to get practice before we jump into proving identities with power series.
-
This is the one place where it’s helpful to remember that horrible definiton of the determinant in terms of a sum over permutations of products of the entries. It’s obviously a polynomial (albeit a gross one). ↩
-
They still have
, though. We aren’t animals. ↩ -
If you like the model theoretic language, embeddings reflect truth. ↩
-
We really do use this kind of machinery ALL the time, though. Whenver we use complex numbers to prove things about
, , etc. for instance.More excitingly, this is part of the power of the yoneda lemma – We embed any (small) category into a topos of presheaves. Then if we can prove some fact (which doesn’t refer to any topos-y things) using this high powered machinery, it reflects down to our original category!
This is also why model theorists care so much about elementary embeddings, which I’ve given a quick introduction to here. The tl;dr is that embeddings don’t need to preserve or reflect the truth of formulas involving quantifiers. Elementary embeddings, on the other hand, do both. ↩
-
“Objects of Categories as Complex Numbers” by Fiore and Leinster. See here for an arxiv link. It’s really excellent, and readable too! ↩
-
In fact, we do this by quotienting by the assumptions of our impliction, just like we did with the
example. So the relevant rig for binary trees is .The issue is that for some equations
(and also ) might not embed into !For instance, when we quotient by some non-monic identity, we get something that isn’t finitely generated as a
-module. In particular, the powers of some root we added won’t satisfy any integer linear combinations. This is a problem since in the subfield generated by the roots of some integer polynomial will be finite dimensional over , and thus powers of any root will satisfy some integer linear combination!Since we no longer have an embedding, truth is no longer reflected! The core of Fiore and Leinster’s paper is giving conditions where this doesn’t happen. ↩
-
Knapp also happens to spend some time talking about Permanence of Identities (in ch. V.2), which is why I chose this book in particular to mention. It turns out the exact example we’re about to work out is also in Artin’s “Algebra” (ch. 12.3) alongside a discussion of Permanence of Identities! So if you want to see some other perspectives on this topic, you can read about it there too. ↩