A Proof that there's No Constructive Proof of the Intermediate Value Theorem

10 Jun 2025 - Tags: topos-theory , sage

The other day my friend Lucas Salim was asking me some questions about categorical logic and constructive math, and he mentioned he’d never seen a proof that there’s no constructive proof of the intermediate value theorem before. I showed him the usual counterexample, and since my recent blog post about choice was so quick to write I decided to quickly write up a post about this too, since I remember being confused by it back when I was first learning it.

The key fact is Soundness and Completeness of the topos semantics of constructive logic. This says that there is a way of interpreting the usual syntax of mathematics into a topos in such a way that

(Soundness) If you can constructively prove a statement, then its interpretation in every topos is true
(Completeness) If a statement is interpreted as true in every (elementary¹) topos, then there must exist a constructive proof

For a proof, see Chapter II of Lambek and Scott’s Introduction to Higher Order Categorical Logic or Section D4.3 of The Elephant.

This means that as long as we’re careful to avoid choice and excluded middle, anything we prove will be true when interpreted in any topos we like! Then there’s a mechanical procedure that lets us convert this interpretation into a corresponding statement in the “real world²”, and this gives us lots of “theorems for free” for each individual constructive theorem! A nice case study is given by the Weierstrass Approximation Theorem, which I gave a talk on years ago³. Since this theorem is constructively provable⁴…

by interpreting it in the effective topos we learn there’s a computer program $Approx$ which takes as input a function $f : [0, 1] \to R$ ⁵ and an $ϵ > 0$ and outputs the coefficients of a polynomial approximating $f$ .
by interpreting it in a sheaf topos $Sh (Θ)$ we learn that for any continuous family of functions $f_{θ} (x) : Θ \times [0, 1] \to R$ , there’s locally⁶ a polynomial $p_{θ} (x)$ whose coefficients are continuous functions of $θ$ which approximates each $f_{θ} (x)$
etc.

If you’re interested in learning how to externalize a statement in a topos to a statement about the real world, I highly recommend Ingo Blechschmidt’s excellent paper Exploring mathematical objects from custom-tailored mathematical universes which gives a high level overview of the topic, while still giving enough details to let you externalize a few statements of your own!

But where were we? The usual proofs of the intermediate value theorem aren’t constructive. See, for instance, Bauer’s Five Stages of Accepting Constructive Mathematics or Section 1 of Taylor’s A Lambda Calculus for Real Analysis for a discussion of how some common proofs fail (as well as great lists of constructively provable alternatives). Since the usual proofs seem to fail, we might guess that IVT is not provable constructively… but how could we prove this?

Say, towards a contradiction, that there were a constructive proof of the intermediate value theorem. Then it would be true in every topos, and thus its various externalizations would all be true in the real world. So to show that there isn’t a constructive proof, all we have to do is find a topos which doesn’t think it’s true!

Following many who came before me⁷, we’re going to use the topos of sheaves on $(- 1, 1)$ . It’s been a minute since we’ve externalized a statement together, so let’s do it now!

In full symbolic glory⁸, the IVT says

\forall f : R \to R . \forall a, b \in R . (a < b \land f (a) < 0 \land f (b) > 0) \to (\exists x \in R . a < x < b \land f (x) = 0)

So now using the forcing language for $Sh ((- 1, 1))$ (check out Theorem $1$ in Chapter VI.7 of Mac Lane and Moerdijk’s Sheaves in Geometry and Logic if you’re not sure what this means), we compute:

1 ⊩ \forall f : R \to R . \forall a, b \in R . (a < b \land f (a) < 0 \land f (b) > 0) \to (\exists x \in R . a < x < b \land f (x) = 0)

We cash out the universal quantifiers to get “for every open $U \subseteq (- 1, 1)$ , and for every continuous $f : U \times R \to R$ , $a : U \to R$ , $b : U \to R$ , we have…”

U ⊩ (a < b \land f (a) < 0 \land f (b) > 0) \to (\exists x \in R . a < x < b \land f (x) = 0)

Next, cashing out the implication gives “for every open $U \subseteq (- 1, 1)$ , and for every continuous $f : U \times R \to R$ , $a : U \to R$ , $b : U \to R$ , so that for all $t \in U$ we know $a (t) < b (t)$ , $f (t, a (t)) < 0$ , and $f (t, b (t)) > 0$ , we have…”

U ⊩ \exists x \in R . a < x < b \land f (x) = 0

Finally, cashing out the existential quantifer and the stuff inside it we get the external statement:

The IVT is true inside $Sh ((- 1, 1))$ if and only if the following is true externally:

For every open $U \subseteq (- 1, 1)$
for every continuous $f : U \times R \to R$ , $a : U \to R$ , $b : U \to R$
so that for all $t \in U$ we know $a (t) < b (t)$ , $f (t, a (t)) < 0$ , and $f (t, b (t)) > 0$
there is an open cover ${U_{α}}$ covering $U$ with continuous functions $x_{α} : U_{α} \to R$ so that
for all $t \in U_{α}$ , $a (t) < x_{α} (t) < b (t)$ and $f (t, x_{α} (t)) = 0$ .

What a mouthful!

Of course, we’re trying to prove this fails, so all we have to do is find an open set $U$ and functions $f$ , $a$ , and $b$ satisfying the assumptions so that the conclusion fails. We’ll choose $U = (- 1, 1)$ to be the whole set, $a (t) = - 2$ and $b (t) = 2$ to be constant functions, and $f (t, x) : (- 1, 1) \times R \to R$ to be

f (t, x) = max (min (t + x + 1, t), t + x - 1)

Then we see that, indeed, $f (t, a) < 0$ and $f (t, b) > 0$ for all $t \in (- 1, 1)$ , so to prove the IVT fails in this topos we just need to show there’s no open cover on which $x (t)$ with $f (t, x (t)) = 0$ can be chosen continuously.

The idea is that no matter how hard we try, $x (t)$ cannot be continuous in a neighborhood of $t = 0$ . Indeed, here’s an animation showing how $x (t)$ changes as we change $t$ :

xxxxxxxxxx
 
x,t = var('x,t')
​
f(t,x) = max_symbolic(min_symbolic(t+x+1, t), t+x-1)
​
def mkFrame(t):
    graph = plot(lambda x: f(t,x), (x,-2,2), ymin=-2, ymax=2)
    zero = point((1-t if t<0 else -1-t, 0), size=40, color="orange")
    txt = text("t={}".format(t), (-1.5,1.5))
    return graph+zero+txt
​
frames = [mkFrame(t/10) for t in ([-5..5] + [-s for s in [-4..4]])]
a = animate(frames)
a.show()

Help | Powered by SageMath

Messages

You can see that when $t = 0$ the root $x (t)$ jumps between $\pm 1$ !

Indeed, if we plot $x (t)$ we get:

xxxxxxxxxx
 
x,t = var('x,t')
​
f(t,x) = max_symbolic(min_symbolic(t+x+1, t), t+x-1)
​
p = implicit_plot(f(t,x), (t,-1,1), (x,-2,2))
p.axes_labels(['$t$', '$x$'])
p.show()

Help | Powered by SageMath

Messages

and it’s obvious that this is not the graph of a continuous function in any neighborhood of $t = 0$ .

As long as we’re showing pretty graphics, you can also visualize this whole function $f$ as a surface over the strip $(- 1, 1) \times R$ . Then choosing a $t$ amounts to choosing a “slice” of the surface, and we can see that where that slice intersects the $(t, x)$ -plane jumps suddenly as we cross $t = 0$ . In this example the axes are labeled $x$ and $y$ rather than $x$ and $t$ :

So where did we start, and where did we end? If the IVT were constructively provable, it would be true inside $Sh ((- 1, 1))$ and thus for our $f (t, x)$ we could find an open cover on which the zero $x (t)$ varies continuously in $t$ . But this can’t possibly happen in a neighborhood of $t = 0$ , so we learn there is no constructive proof!

Buuuuut, all is not lost! Usually classical theorems do have constructive analogues, either by adding new assumptions, weakening the conclusion, or by finding a different statement of the theorem that’s more positive. Andrej Bauer’s paper Five Stages of Accepting Constructive Mathematics lists many possibilities.

For instance, one way to weaken the conclusion is to prove that for any $ϵ$ you like, there’s an $x$ with $| f (x) | < ϵ$ . In our example, if we plot those $x$ so that $| f (t, x) | < ϵ$ we get

xxxxxxxxxx
 
x,t = var('x,t')
​
f(t,x) = max_symbolic(min_symbolic(t+x+1, t), t+x-1)
​
p = region_plot(abs(f(t,x)) <= 0.1, (t,-1,1), (x,-2,2))
p.axes_labels(['$t$', '$x$'])
p.show()

Help | Powered by SageMath

Messages

and it’s easy to fit the graph of a continuous selection function $x (t)$ inside this thickened region.

Another approach is to recognize that the problem comes from $f$ “hovering” at $0$ when $t = 0$ . If we forbid this hovering, for instance by assuming $f$ is strictly monotone, then we can constructively prove the IVT (See Bauer’s Five Stages paper again).

There’s yet another version, coming from Abstract Stone Duality, where we say that whenever $f (a) < 0 < f (b)$ , the compact subspace $Z_{f} = {x \in [a, b] ∣ f (x) = 0}$ is occupied (Cor 13.11 in A Lambda Calculus for Real Analysis). This is a condition that’s weaker than inhabited but stronger than nonempty, which you can read about in Section 8 of the same paper. I don’t understand this condition very well, because I haven’t spent as much time thinking about ASD as I would like. Hopefully sometime soon I’ll find some time to work through some examples!

Edit (July 7, 2025):

Jim Kingdon recently started a thread in the metamath github talking about this, and told me about it over mastodon. In this thread, Mario Carneiro gave a slick proof that the IVT implies (analytic) LLPO, which should feel familiar. Recall that Analytic LLPO is the statement that $\forall t \in R . t \geq 0 \lor t \leq 0$ .

$⌜$ Fix $t \in R$ , and again consider the function $f_{t} (x) : R \to R$ by

f_{t} (x) = max (min (t + x + t, t), t + x - 1)

Then by the IVT, $f_{t} (x)$ has a zero $f_{t} (z) = 0$ , and constructively we know that $z < \frac{1}{2} \lor z > \frac{- 1}{2}$ . But now if $z < \frac{1}{2}$ then $t \leq 0$ and if $z > \frac{- 1}{2}$ then $t \geq 0$ , giving the claim. $⌟$

Mario also mentions that LLPO and Countable Choice is enough to prove IVT, so that in any settings where CC holds LLPO and IVT are equivalent⁹. Indeed

$⌜$ Say $f (a) < 0$ and $f (b) > 0$ .

For every $r \in Q$ , decide if $f (a + r (b - a))$ is $\geq 0$ or $\leq 0$ . For each $r$ this is an application of LLPO, but there might be two options (if $f$ outputs a value that is both $\geq 0$ and $\leq 0$ ), so we have to to choose one of these for each of our countably many $r \in Q$ . So we’ve used both LLPO and CC to do this.

Now we can do binary search in $[a, b]$ to find a zero. The midpoints $m$ we check will always be of the form $a + r (b - a)$ for some $r \in Q$ , so to split our interval in half we can use our pre-made choices – recurring into the upper half of the interval if $f (m) \leq 0$ or the lower half if $f (m) \geq 0$ .

Then the sequence of midpoints for our intervals is a cauchy sequence whose modulus of convergence we can compute, since after the first $n$ bisections all the future midpoints lie in an interval of width $2^{- n} (b - a)$ . Since the dedekind reals are complete for these kinds of explicit cauchy sequences (see here, for instance), these converge to a real number $z$ . As usual, continuity of $f$ implies that $f (z) = 0$ . $⌟$

Ok, thanks for reading, all! It’s nice to get a few quick posts up while I’m working on some longer stuff. I’m still thinking a lot about a cool circle of ideas involving Fukaya Categories, Skein Theory and T(Q)FTs, and Hall Algebras, and I’m slowly making progress on writing posts about all these fun things.

Now, though, I have to go run a review session for a calculus class, haha. I’ll try to resist telling them about the fascinating subtleties that show up when you try to do everything constructively.

Stay safe, and we’ll talk soon 💖

Usually on this blog when I talk about topoi I mean grothendieck topoi, but for this completeness result we really do need to allow more general elementary topoi with NNO. Indeed there are statements true in all grothendieck topoi that are not constructively provable (since they fail in some elementary topos). See here for a partial list.

I would actually love to know if there’s a reference for what one has to add to IHOL to get something sound and complete for grothendieck topoi… I spent some time looking, but the only thing I found was Topological Completeness for Higher-Order Logic by Awodey and Butz, but it seems like they use $1 + 1$ in place of $Ω$ , so this isn’t the usual interpretation of logic in a grothendieck topos (which is also where the classical completeness comes from). ↩
Maybe “base topos” would be less philosophically charged ↩
I was really fumbling around with topos theory back then, haha. I’m much more confident now, and in the last few years I’ve just worked through a lot more examples and done more computations and read more papers and generally just learned a lot. Rereading that post was surprisingly… nostalgic isn’t the right word… but it’s fun to see how much I’ve grown! ↩
The usual proof with bernstein polynomials works if we’re careful to check some constructively relevant details. I’ll copy it here in case the wikipedia article changes someday:

Fix $ϵ > 0$ .

We write $b_{k, n} (x) = (\binom{n}{k}) x^{k} (1 - x)^{n - k}$ , and note that:
1. $\sum_{k = 0}^{n} b_{k, n} = 1$
2. $\sum_{k = 0}^{n} {(x - \frac{k}{n})}^{2} b_{k, n} = \frac{x (1 - x)}{n}$
These are all provable by just expanding the left hand side, which is constructive.

We also fix a $δ$ so that whenever $| x - y | < δ$ we have $| f (x) - f (y) | < ϵ$ . This is because, constructively, every continuous function on a compact sublocale of $R$ is uniformly continuous. Note that here we crucially need to be working with locales! (see, eg, Thm 10.7 in Taylor’s A Lambda Calculus for Real Analysis)

Lastly, we fix $M$ an upper bound for $| f |$ . This is possible since $[0, 1]$ is compact, overt, and inhabited (see Rmk 10.4 in Bauer and Taylor’s The Dedekind Reals in Abstract Stone Duality) thus the continuous $| f |$ admits a maximum (Thm 12.9 in A Lambda Calculus for Real Analysis).

Now let $B_{n} f = \sum_{k = 0}^{n} f (k / n) b_{k, n}$ . We compute:
$\begin{aligned} | f (x) - (B_{n} f) (x) | & \overset{(a)}{=} | \sum_{k = 0}^{n} (f (x) - f (k / n)) b_{k, n} (x) | \\ \leq \sum_{k = 0}^{n} | f (x) - f (k / n) | b_{k, n} (x) \\ \overset{(b)}{\leq} \sum_{k s.t. | x - k / n | < δ} | f (x) - f (k / n) | b_{k, n} (x) + \sum_{k s.t. | x - k / n | > \frac{1}{2} δ} | f (x) - f (k / n) | b_{k, n} (x) \end{aligned}$
In step (a) we use (1), and in step (b) we use the fact that $\forall x \in R . x < δ \lor x > \frac{δ}{2}$ is constructively true (since the intervals overlap we don’t need excluded middle here!)

But we can bound the first sum by noticing in this region $| x - k / n | < ϵ$ so that (using (1) again)
$\begin{aligned} \sum_{k s.t. | x - k / n | < δ} | f (x) - f (k / n) | b_{k, n} (x) & \leq \sum_{k s.t. | x - k / n | < δ} ϵ b_{k, n} (x) \\ \leq \sum_{k = 0}^{n} ϵ b_{k, n} (x) \\ = ϵ \end{aligned}$
And since $| f (x) - f (k / n) | \leq | f (x) | + | f (k / n) | \leq 2 M$ , we compute:
$\begin{aligned} \sum_{k s.t. | x - k / n | > \frac{1}{2} δ} | f (x) - f (k / n) | b_{k, n} (x) & \leq \sum_{k s.t. | x - k / n | > \frac{1}{2} δ} 2 M b_{k, n} (x) \\ \overset{(c)}{\leq} 2 M \sum_{k s.t. | x - k / n | > \frac{1}{2} δ} {(\frac{δ}{2})}^{- 2} {(x - \frac{k}{k})}^{2} b_{k, n} (x) \\ \leq \frac{8 M}{δ^{2}} \sum_{k = 0}^{n} {(x - \frac{k}{k})}^{2} b_{k, n} (x) \\ \overset{(d)}{=} \frac{8 M}{δ^{2}} \frac{x (1 - x)}{n} \\ \leq \frac{8 M}{δ^{2}} \frac{1}{4 n} \end{aligned}$
In step (c) we use $| x - k / n | > \frac{δ}{2}$ to say that ${(\frac{δ}{2})}^{- 2} {(x - \frac{k}{k})}^{2} \geq 1$ , so that we can multiply through by it and make our sum bigger. In step (d) we use (2), and at the end we use the fact that $x (1 - x) \leq \frac{1}{4}$ on $[0, 1]$ .

Now since $R$ is constructively archimedian (Defn. 1.1 in The Dedekind Reals in Abstract Stone Duality) we see $\exists n \in N . \frac{2 M}{δ^{2} n} < ϵ$ .

Since these bounds were uniform in $x$ , we learn that
$\forall ϵ > 0 . \exists n . ‖ f - B_{n} f ‖_{\infty} < ϵ$
as desired. ↩
A real number $x$ in this topos is a program that eats a natural number $n$ and outputs a rational number $x (n)$ . We think about this as a sequence of rational approximations $x (n)$ converging to some real number $x$ . So a function $[0, 1] \to R$ in this topos is a program that takes as input a program $x$ (outputting rational approximations between $0$ and $1$ ) and outputs a new program $f (x)$ which outputs rational approximations. ↩
Because our statement of Weierstrass $\forall ϵ . \forall f . \exists p . ‖ f - p ‖_{\infty} < ϵ$ includes an existential quantifier there’s no way to get around the fact that the $p$ we build is only defined locally on an open cover.

If you were more careful and gave a type theoretic proof that $\prod_{ϵ} \prod_{f} \sum_{p} ‖ f - p ‖_{\infty} < ϵ$ then you could take $p$ to be a single polynomial defined on the whole of $Θ$ … I haven’t thought very hard about how possible this is (mainly because I haven’t spent much time thinking about what theorems about locales are provable in type theory), but I’m sure a talented undergraduate could figure it out. ↩
The earliest version of this example which I’ve seen comes from Stout’s Topological Properties of the Real Numbers Object in a Topos back in 1976. I think basically every other paper I’ve cited in this post gives some version of this example too, so it’s very well trodden ground! ↩
Well, not FULL glory I guess. Over on mastodon, Antoine Chambert-Loir pointed out that I (rather embarrassingly) forgot to mention the assumption that $f : R \to R$ is continuous!

Thankfully in my externalization I mention that we want $f : U \times R \to R$ to be continuous, which is what that would have externalized to, if I’d remembered to mention it! ↩
Recall that in the presence of countable choice, the dedekind and cauchy reals agree so that LLPO and analytic LLPO agree. ↩