The Fundamental Theorem of Calculus from Integration Axioms

Three simple axioms for integration that force the FTC to be true.

Here are a simple set of axioms that completely characterize integration on continuous functions. I learned this from Peter Clark’s analysis notes and Spivak’s book, and decided to teach integration this way in Real Analysis this year.

I really like the idea of axiomatizing integration - it makes particularly clear what we mean by different theories of integration (Riemann/Darboux vs Lebesgue, etc) they are simply different constructions that satisfy the axioms. It also neatly separates essential properties of the integral from contingent ones giving a clear distinction between things that hold in all integration theories from things that one may hope to improve by building a new integral.

Integration Axioms

For any closed interval J=[a,b]J=[a,b] we denote by I(J)\mathcal{I}(J) the set of integrable functions on JJ. Then a collection of functions I(J)R\mathcal{I}(J)\to \RR is an integral, and denoted fJff\mapsto \int_J f if it satisfies the following axioms:

  • If kRk\in\RR then f(x)=kf(x)=k is an element of I([a,b])\mathcal{I}([a,b]) for any interval [a,b][a,b] and [a,b]k=k(ba).\int_{[a,b]}k = k(b-a).

  • If f,gI([a,b])f,g\in \mathcal{I}([a,b]) and f(x)g(x)f(x)\leq g(x) for all x[a,b]x\in[a,b] then [a,b]f[a,b]g\int_{[a,b]}f\leq\int_{[a,b]}g

  • If [a,b][a,b] is an interval and c[a,b]c\in[a,b], then fI([a,b])f\in\mathcal{I}([a,b]) if and only if fI([a,c])f\in\mathcal{I}([a,c]) and fI([c,b])f\in\mathcal{I}([c,b]). Furthermore, in this case their values are related by [a,b]f=[a,c]f+[c,b]f\int_{[a,b]}f = \int_{[a,c]}f +\int_{[c,b]}f

Part of the subtlety of the theory of integration is that the axioms do not specify what functions are integrable. They do insist constants are integrable, but otherwise just apply conditionally. This means we can construct rather uninteresting integration theories if we like - for instance we could declear that only constant functions are integrable, with their integrals given by Axiom I, and then observe that this satisfies II and III so defines an integral. This motivates an additional definition

Interesting Integral

An integral is interesting if all continuous functions on all intervals are integrable.

From now on, we will think only about interesting integrals.

Preliminaries

We begin with a useful edge case of the definition, and look at the integral over a degenerate closed interval {a}=[a,a]\{a\}=[a,a]:

Integral Over a Point

If {a}\{a\} is the degenerate closed interval containing a single point, and ff is a function which is integrable on any interval containing aa, then {a}fdx=0\int_{\{a\}}f\,dx = 0

Let ff be integrable on the interval [u,v][u,v] and a[u,v]a\in[u,v] be a point. Without loss of generality we can in fact take aa to be one of the endpoints of the interval, by subdivision: if a(u,v)a\in(u,v) then Axiom III implies that ff is integrable on [u,a][u,a] and on [a,v][a,v] as well.

Thus, we assume ff is integrable on [a,v][a,v], and further subdivide this interval as [a,v]=[a,a][a,v]={a}[a,v][a,v]=[a,a]\cup [a,v]=\{a\}\cup [a,v]

By subdivision, we see that ff is integrable on {a}\{a\} and that

[a,v]fdx={a}fdx+[a,v]fdx\int_{[a,v]}f\,dx =\int_{\{a\}}f\,dx +\int_{[a,v]}f\,dx

Subtracting the common integral over [a,v][a,v] from both sides yields the result,

{a}fdx=0\int_{\{a\}}f\,dx=0

The fundamental theorem relates a function ff to its integral F(x)=[a,x]fF(x)=\int_{[a,x]}f, so we need to be able to make sense of this expression. The proposition below shows such integration on a variable interval produces a well defined function.

The Integral as a Function

If fI([a,b])f\in\mathcal{I}([a,b]) is an integrable function, then there exists a function F ⁣:[a,b]RF\colon [a,b]\to\RR defined by F(x)=[a,x]fF(x)=\int_{[a,x]}f

Let ff be integrable on the interval [a,b][a,b] and x[a,b]x\in[a,b] be a point. Then [a,b]=[a,x][x,b][a,b]=[a,x]\cup [x,b], so Axiom III (subdivision) implies that ff is integrable on [a,x][a,x] (and on [x,b][x,b]) and the number [a,x]f\int_{[a,x]}f is defined. This assignment describes a real valued function

x[a,x]fx\mapsto \int_{[a,x]}f

The above proposition has a short proof because it did not claim much: we learned nothing about the nature of the area function F(x)=[a,x]fF(x)=\int_{[a,x]}f. In fact, the axioms constrain the behavior of this function much stronger than it might seem at first: below we prove a fact needed for the Fundamental Theorem but also interesting in its own right - that the integral of bounded functions are continuous.

Integrals are Continuous

If fI([a,b])f\in\mathcal{I}([a,b]) is a bounded integrable function, then its integral F(x)=[a,x]fdxF(x)=\int_{[a,x]}f\,dx is continuous.

Let ff be integrable and bounded by MM on [a,b][a,b], and set F(x)=[a,x]fdxF(x)=\int_{[a,x]}f\,dx. Begin by choosing an ϵ>0\epsilon>0. We will prove something even strong than asked - that ff is uniformly continuous by finding a δ>0\delta>0 where if yx<δ|y-x|<\delta we have F(y)F(x)<ϵ|F(y)-F(x)|<\epsilon. Let’s unpack this a bit: if x<yx < y are two points of [a,b][a,b],

F(y)F(x)=[a,y]f[a,x]fF(y)-F(x)=\int_{[a,y]}f-\int_{[a,x]}f

But subdivision (Axiom III) implies

F(y)=[a,y]f=[a,x]f+[x,y]f=F(x)+[x,y]f\begin{align*} F(y)&=\int_{[a,y]}f\\ &= \int_{[a,x]}f+\int_{[x,y]}f\\ &=F(x)+\int_{[x,y]}f \end{align*}

Thus F(y)F(x)F(y)-F(x) is just the integral of ff on the subinterval [x,y][a,b][x,y]\subset[a,b]. Because ff is bounded by MM we know Mf(x)M-M\leq f(x)\leq M. By subdivsion, ff is then integrable on every sub-interval I[a,b]I\subset [a,b], and by comparison (Axiom II) this implies MIIfMI-M |I|\leq \int_{I}f\leq M|I|

So, we choose δ=ϵ/M\delta=\epsilon/M. This immediately yields what we want, as if yx<δ|y-x|<\delta,

ϵ=Mδ<Myx[x,y]fMyx<Mδ=ϵ-\epsilon = -M\delta < -M|y-x|\leq \int_{[x,y]}f\leq M|y-x|< M\delta =\epsilon

Thus F(y)F(x)=[x,y]f<ϵ|F(y)-F(x)|=\Big|\int_{[x,y]}f\Big|<\epsilon.

Note: careful readers may have realized that we proved more than asked - the above actually shows that FF is uniformly continuous on [a,b][a,b]. Of course, this is not really stronger than what was asked, since we began on a closed interval, and we know that continuous on a closed interval implies uniformly continuous. However, if you look carefully at the proof you see we nowhere used that the original domain was a closed interval! So what we have really proven is that the area function F(x)=[a,b]fF(x)=\int_{[a,b]}f is uniformly continuous anytime ff is bounded!

The Fundamental Theorem

We’ve already seen that these meager axioms hide great power: we could prove that the integral of a bounded function was continuous directly without anything else! But this is only the start of an incredible story. Here, we jump straight to the main event - and prove that these axioms characterize the fundamental theorem of calculus!

FTC I

Let ff be a continuous function and assume that ff is integrable on [a,b][a,b]. Denote its area function by F(x)=[a,x]fF(x)=\int_{[a,x]}f

Then FF is differentiable, and for all points x(a,b)x\in(a,b),

F=fF^\prime=f

Because ff is continuous on a closed interval, it is bounded (by the Extreme Value theorem), and so the area function FF is continuous by our work above.

Choose an arbitrary c(a,b)c\in(a,b). We wish to show that F(c)=f(c)F^\prime(c)=f(c): that is, we need limxcF(x)F(c)xc=f(c)\lim_{x\to c}\frac{F(x)-F(c)}{x-c}=f(c)

In terms of ϵ\epsilons and δ\deltas, this means for arbitrary ϵ\epsilon we need to find a δ\delta such that if xx is within δ\delta of cc, this difference quotient is within ϵ\epsilon of f(c)f(c).

It will be convenient to separate this argument into two cases, depending on if x<cx < c or c<xc < x (both arguments are analogous, all that changes is whether the interval in question is [c,x][c,x] or [x,c][x,c]). Below we proceed under the assumption that c<xc < x. In this case, looking at the numerator, we see by subidvision (Axiom III) that

F(x)=[a,x]f=[a,c]f+[c,x]f=F(c)+[c,x]f\begin{align*} F(x)&=\int_{[a,x]}f\\ &=\int_{[a,c]}f+\int_{[c,x]}f\\ &= F(c)+\int_{[c,x]}f \end{align*}

    F(x)F(c)=[c,x]f\implies F(x)-F(c)=\int_{[c,x]}f

Thus the real quantity of interest is this integral over [c,x][c,x]. Choose ϵ>0\epsilon>0. Since ff is continuous, there is some δ>0\delta>0 where xc<δ|x-c|<\delta implies f(x)f(c)<ϵ|f(x)-f(c)|<\epsilon. Equivalently, for all x[cδ,c+δ]x\in[c-\delta, c+\delta] we have f(c)ϵ<f(x)<f(c)+ϵf(c)-\epsilon < f(x) < f(c)+\epsilon

By subdivison (Axiom III), we know that ff is integrable on [c,x][c,x], and so by comparison (Axiom II) and the area of rectangles (Axiom I) we have (f(c)ϵ)(xc)[c,x]f(f(c)+ϵ)(xc)(f(c)-\epsilon)(x-c)\leq \int_{[c,x]}f \leq (f(c)+\epsilon)(x-c)

Dividing through by xcx-c

f(c)ϵ[c,x]fxcf(c)+ϵf(c)-\epsilon \leq \frac{\int_{[c,x]}f}{x-c} \leq f(c)+\epsilon

and subtracting f(c)f(c) ϵ[c,x]fxcf(c)ϵ-\epsilon\leq \frac{\int_{[c,x]}f}{x-c}-f(c) \leq \epsilon

We arrive at the inequality

[c,x]fxc<ϵ\left|\frac{\int_{[c,x]}f}{x-c}\right| < \epsilon

But the numerator here is none other than F(x)F(c)F(x)-F(c)! So, we’ve done it: for all x>cx > c with xc<δ|x-c| < \delta, we have the difference quotient within ϵ\epsilon of f(c)f(c). This implies the limit exists, and that F(c)=f(c)F^\prime(c)=f(c)

This tells us that the area function of ff is one of its antiderivatives! The theory of area is the inverse of the theory of rates of change. But which antiderivative? The mean value theorem assures us that the collection of all possible antiderivatives are easy to understand - any two differ by a constant. So to uniquely specify an antiderivative its enough to give its value at one point. And we can do this!

Let ff be a continuous function which is integrable on [a,b][a,b]. Then the function F(x)=[a,x]fdxF(x)=\int_{[a,x]}f\,dx is uniquely determined as the antiderivative of FF such that F(a)=0F(a)=0.

This connection of integration with antidifferentiation and the classification of antiderivatives has a useful corollary for computation, which is often called the second fundamental theorem

FTC II

Let ff be continuous and integrable on [a,b][a,b] and let FF be any antiderivative of ff. Then [a,b]f=F(b)F(a)\int_{[a,b]}f = F(b)-F(a)

Denote the area function for ff as A(x)=[a,x]fA(x)=\int_{[a,x]}f. Then the quantity we want to compute is A(b)A(b).

Now, let FF be any antiderivative of ff. The first part of the fundamental theorem assures us that AA is an antiderivative of ff, and any two antiderivatives of the same function differ by a constant.
Thus there is some constant CC such that A(x)F(x)=CA(x)-F(x)=C, or F(x)=A(x)+CF(x)=A(x)+C. Now computing,

F(b)F(a)=(A(b)+C)(A(a)+C)=A(b)A(a)+(CC)=A(b)A(a)=A(b)\begin{align*} F(b)-F(a)&=\left(A(b)+C\right)-\left(A(a)+C\right)\\ &=A(b)-A(a)+(C-C)\\ &= A(b)-A(a)\\ &=A(b) \end{align*}

Where the last equality comes from the fact that A(a)={a}fdx=0A(a)=\int_{\{a\}}f\,dx = 0 which we proved in the preliminaries.

The fundamental theorem of calculus is a beautiful result for many different reasons. One of course, is that it forges a deep connection between the theory of areas and the theory of derivatives - something missed by the ancients and left undiscovered until the modern advent of the calculus. But second, it shows how incredibly constraining our simple axioms are: we did not prove the fundamental theorem of calculus for any particular definition of the integral (Riemann’s, Lebesgue’s, Darboux’s, etc) but rather showed that for any interesting integral your theory of integration has no choice whatsoever on how to integrate them!

Application: Integration Techniques

Given the fundamental theorem holds for continuous functions, its immediate to build up a strong theory of integration

substitution

Let \int be an interesting integral, and ff be continuous, gg be continuously differentiable on [a,b][a,b]. Then [a,b](fg)g=[g(a),g(b)]f\int_{[a,b]}(f\circ g)g^\prime = \int_{[g(a),g(b)]}f

Because gg is differentiable it is continuous, so fgf\circ g is the composition of continuous functions, which is continuous. And as the product of continuous functions is continuous, f(g(x))g(x)f(g(x))g^\prime(x) is also continuous. Thus this function is integrable on [a,b][a,b].

By the Fundamental Theorem, we can evaluate this by antidifferentiation: let F(x)F(x) be any antiderivative of ff, then the chain rule gives (F(g(x)))=F(g(x))g(x)=f(g(x))g(x)\left(F(g(x))\right)^\prime =F^\prime(g(x))g^\prime(x)=f(g(x))g^\prime(x)

Using this antiderivative yields [a,b](fg)g=F(g(x))[a,b]=F(g(b))F(g(a))\int_{[a,b]}(f\circ g)g^\prime = F(g(x))\Big|_{[a,b]}=F(g(b))-F(g(a))

A crucial but seemingly simple observation is to note this is the same value one would get by evaluating the function FF on the endpoints of the interval [g(a),g(b)][g(a),g(b)]:

F(g(x))[a,b]=F(x)[g(a),g(b)]F(g(x))\Big|_{[a,b]}=F(x)\Big|_{[g(a),g(b)]}

And as F=fF^\prime=f, this second expression is exactly what one would get from integrating ff on the interval [g(a),g(b)][g(a),g(b)] using the Fundamental Theorem.

Similarly without any further theory we can construct the other main integration technique of the calculus: integration by parts!

Integration by Parts

Let \int be an interesting integral, and f,gf,g be two continuously differentiable functions on [a,b][a,b]. Then [a,b]fg=fg[a,b][a,b]fg\int_{[a,b]}fg^\prime = fg\Big|_{[a,b]}-\int_{[a,b]}f^\prime g

Are We Cheating?

I thought analysis - especially the theory of integration - was supposed to be difficult! But these proof’s aren’t that bad: what’s the trick? The catch is that we’ve derived properties that an integral might have without actually constructing one. Perhaps there are no interesting integrals - if it were impossible to satisfy these axioms on function sets I(J)\mathcal{I}(J) containing the continuous functions, then this theoretical work would be true only vacuously, and useful to no one.

So a very important matter still lies ahead, we need to show that there exists an interesting integral. A time-honored means of proving existence thoerems is construction, and so the natural path forward is to attempt and explicitly construct some assignment \int_{-} obeying the axioms on all continuous functions. Any of the standard approaches (Riemann, Darboux, Lesbegue) work fine here, and you can plug-and-play whichever you like from your favorite analysis text.

This is what I appreciate so much about the axiomatic approach. It makes the goal clear before jumping into a construction: we need to find a way to satisfy these axioms. Additionally, it makes clear what doesn’t matter: we (in an undergraduate class) are primarily concerned with the integration of (piecewise) continuous functions, and have already proven that any interesting integrals completely agree on the answer. Thus we needn’t spend time fretting about small details in our definition; the final answers will be completely independent of the underlying construction after all.

← All posts