An Explicit Example of the Proof of the Nullstellensatz

11 Jan 2022 - Tags: sage

I’m in an algebraic geometry class right now, and a friend was struggling conceptually with the proof of the strong nullstellensatz. I thought it might be helpful to see a concrete example of the idea, since the proof is actually quite constructive! Which brings us to this post:

Formally, we’re going to assume the weak nullstellensatz, and use it to show the strong nullstellensatz. That is, we’ll assume

The Weak Nullstellensatz

$V (a) = \emptyset$ if and only if $a = (1)$

Purely algebraically, this says:

“The only way $f_{1}, f_{2}, \dots, f_{r}$ can fail to have a common zero is if $(f_{1}, f_{2}, f_{3}, \dots, f_{r}) = (1)$ .”

and we’ll show

The Strong Nullstellensatz

$I (V (a)) = \sqrt{a}$

Again, purely algebraically, this says:

“The only way $g, f_{1}, f_{2}, \dots, f_{r}$ can all be $0$ simultaneously is if (for some $n$ ) $g^{n}$ is a linear combination of the $f_{i}$ .”

Both of these are theorems of the form “the obvious issue is the only one”. Obviously if $1 = p_{1} f_{1} + p_{2} f_{2} + \dots + p_{r} f_{r}$ , then the $f_{i}$ cannot all be $0$ simultanously. Indeed, if $f_{i} (x^{*}) = 0$ for all $i$ , then evaluating both sides of the above at $x^{*}$ gives $1 = 0$ , which is a problem. The weak nullstellensatz says that this is the only reason a family of polynomials won’t have a common root.

Similarly, it’s obvious that if $g^{n} = p_{1} f_{1} + \dots + p_{r} f_{r}$ , then at any common zero of the $f_{i}$ , $g = 0$ too. Again, if we evaluate both sides at $x^{*}$ we find $g (x^{*})^{n} = 0$ , and so $g (x^{*}) = 0$ too (since a field has no nontrivial nilpotents). The strong nullstellensatz says that this is the only way for $g$ to share a zero with the $f_{i}$ .

Now, it turns out the weak nullstellensatz has computational content. That is, if $f_{1}, \dots, f_{r}$ don’t have a common zero, there’s a computer program¹ that will actually find the $p_{i}$ so that $1 = p_{1} f_{1} + \dots + p_{r} f_{r}$ .

For instance, let’s take a simple example:

f_{1} = x y - 1 f_{2} = x + y f_{3} = x y^{3} f_{4} = y x^{3}

First, let’s check that these polynomials really don’t have any points in common:

xxxxxxxxxx
 
# make a polynomial ring with generators x,y
# over QQbar, the algebraic closure of QQ.
R.<x,y> = QQbar[] 
​
f1, f2, f3, f4 = x*y - 1,  x+y,  x*y^3,  y*x^3
​
I = ideal(f1,f2,f3,f4)
​
I.variety() # should print out [], the empty set

[]

Help | Powered by SageMath

Accepted: {"data":{"text/plain":"[]"},"metadata":{},"execution_count":1}

Messages

Next we can see that they generate the ideal $(1)$ .

xxxxxxxxxx
 
R.<x,y> = QQbar[] 
f1, f2, f3, f4 = x*y - 1,  x+y,  x*y^3,  y*x^3
I = ideal(f1,f2,f3,f4)
​
I == ideal(1)

True

Help | Powered by SageMath

Accepted: {"data":{"text/plain":"True"},"metadata":{},"execution_count":1}

Messages

Of course, this means we should be able to write $1$ as a linear combination of the $f_{i}$ :

xxxxxxxxxx
 
R.<x,y> = QQbar[] 
f1, f2, f3, f4 = x*y - 1,  x+y,  x*y^3,  y*x^3
I = ideal(f1,f2,f3,f4)
​
R(1).lift(I) # prints [y^2 - 1, y, -1, 0]

[y^2 - 1, y, -1, 0]

Help | Powered by SageMath

Accepted: {"data":{"text/plain":"[y^2 - 1, y, -1, 0]"},"metadata":{},"execution_count":1}

Messages

and indeed, these are the coefficients to get $1$

xxxxxxxxxx
 
R.<x,y> = QQbar[] 
f1, f2, f3, f4 = x*y - 1,  x+y,  x*y^3,  y*x^3
​
# should give 1
(y^2 - 1) * f1  +  y * f2  +  (-1) * f3  +  0 * f4

Help | Powered by SageMath

Accepted: {"data":{"text/plain":"1"},"metadata":{},"execution_count":1}

Messages

So now what about the strong nullstellensatz?

Let’s take $g = y x + x + 1$ , which vanishes at every point of the variety defined by $x^{2} + 2 x + 1$ and $y$ (do you see why?).

Then we expect $g^{n} \in (x^{2} + 2 x + 1, y)$ for some $n$ , and we’ll get there by the Rabinowitsch trick:

We’ll add a variable $z$ to the mix, and notice that $x^{2} + 2 x + 1$ , $y$ , and $(y x + x + 1) z - 1$ don’t have any common zeroes.

Indeed, if $x^{2} + 2 x + 1$ or $y$ is nonzero at some point, then we’re done. But if they’re both zero, then we know $g = y x + x + 1$ is zero as well. Then $(y x + x + 1) z - 1 = g z - 1 = 0 z - 1 = - 1 \neq 0$ .

But then, by the weak nullstellensatz, that means these three polynomials must generate the ideal $(1)$ in $k [x, y, z]$ !

Indeed,

xxxxxxxxxx
 
R.<x,y,z> = QQbar[]
f1, f2 = x^2 + 2*x + 1,  y
g = y*x + x + 1
​
p1,p2,p3 = var('p_1, p_2, p_3')
​
coeffs = R(1).lift(ideal(f1,f2,g*z-1))
show(p1 == coeffs[0])
show(p2 == coeffs[1])
show(p3 == coeffs[2])

p_{1} = z^{2}

p_{2} = x^{2} z^{2} + x z^{2} + x z

p_{3} = - x z - z - 1

Help | Powered by SageMath

Accepted: {"data":{"text/plain":"html","text/html":"<html>\$\\displaystyle p_{3} = -x z - z - 1\$</html>"},"source":"sagecell"}

Accepted: {"data":{"text/plain":"html","text/html":"<html>\$\\displaystyle p_{2} = x^{2} z^{2} + x z^{2} + x z\$</html>"},"source":"sagecell"}

Accepted: {"data":{"text/plain":"html","text/html":"<html>\$\\displaystyle p_{1} = z^{2}\$</html>"},"source":"sagecell"}

Messages

So we know that

1 = p_{1} f_{1} + p_{2} f_{2} + p_{3} (g z - 1)

1 = z^{2} (x^{2} + 2 x + 1) + (x^{2} z^{2} + x z^{2} + x z) y + (- x z - z - 1) (g z - 1)

Now for the slick trick! We’re working in an ideal containing $z g - 1$ , which means that $z = \frac{1}{g}$ in all of our computations²! So let’s take this expression and plug in $z = \frac{1}{g}$ to get

1 = \frac{1}{g^{2}} f_{1} + (\frac{x^{2}}{g^{2}} + \frac{x}{g^{2}} + \frac{x}{g}) f_{2} + (- \frac{x}{g} - \frac{1}{g} - 1) 0

Of course, we can clear the denominators by multiplying through by $g^{2}$ to see

g^{2} = f_{1} + (x^{2} + x + x g) f_{2} \in (f_{1}, f_{2})

So we found that, for some $n$ , $g^{n} \in (f_{1}, f_{2})$ . As desired.

It turns out that this is exactly how the proof goes in general!

Say you give me polynomials $f_{1}, \dots, f_{r}, g \in k [x_{1}, \dots, x_{m}]$ so that $g$ vanishes whenever all the $f_{i}$ do.

Then we look at the ideal (in $k [x_{1}, \dots x_{m}, z]$ )

(f_{1}, \dots, f_{r}, z g - 1)

which must equal $(1)$ by the weak nullstellensatz.

Then a computation, which sage will happily do for us, gives us polynomials $p_{1}, \dots, p_{r + 1} \in k [x_{1}, \dots, x_{m}, z]$ so that

1 = p_{1} f_{1} + \dots + p_{r} f_{r} + p_{r + 1} (z g - 1)

Then we plug in $\frac{1}{g}$ for $z$ to get a new expression

1 = p_{1} (\vec{x}, \frac{1}{g}) f_{1} (\vec{x}) + \dots + p_{r} (\vec{x}, \frac{1}{g}) f_{r} (\vec{x})

This is a polynomial with $g$ s in the denominator. So we multiply both sides by some $g^{n}$ to clear denominators, and we find

g^{n} = g^{n} p_{1} (\vec{x}, 1) f_{1} + \dots + g^{n} p_{r} (\vec{x}, 1) f_{r}

Notice that the $p_{i} (\vec{x}, 1)$ are polynomials in $x_{1}, \dots, x_{m}$ , since we’ve plugged in $1$ for $z$ everywhere. This means we’ve shown $g^{n}$ is a linear combination of the $f_{i}$ , so $g^{n} \in (f_{1}, \dots, f_{r})$ , as desired.

Another quick post today! Hopefully other people find this helpful too ^_^.

I do have a few bigger ones in the pipeline, but I won’t say exactly what. I’ve learned that saying what post you’re planning to write next is a guaranteed way to not actually write it, haha.

See you soon!

If you’re interested in this, you’ll want to read about gröbner bases. The actual algorithm for computing with these is buchberger’s algorithm.

I really liked Adams and Loustaunau’s An Introduction to Gröbner Bases, which is a very polite introduction. I’ve heard great things about Cox, Little, and O’Shea’s Ideals, Varieties, and Algorithms: An Introduction to Computational Algebraic Geometry and Commutative Algebra, though I haven’t gotten around to reading it myself. ↩
There’s a lot to be said about precisely why this trick works. It’s really because we’re looking at the homomorphism
$k [x, y, z] \to k (x, y)$
sending $x \mapsto x$ , $y \mapsto y$ , and $z \mapsto \frac{1}{g}$ .

This is quickly seen to be injective, so it preserves and reflects truth. We solve our problem in $k (x, y)$ , but recover a formula of polynomials in $x$ and $y$ that gets reflected back to $k [x, y]$ under this embedding.

For more information about this technique of “permanence of identities”, you can see this blog post of mine. ↩