Note: Exponential Differentiation

Here’s the calculation that shows if you define an exponential as a merely continuous solution to E(x+y)=E(x)E(y)E(x+y)=E(x)E(y) then in fact it is differentiable. Even better we can explicitly calculate and find the derivative, and do so inductively, proving exponentials are smooth.

Exponential Functions

An exponential function is a continuous nonconstant function E ⁣:RRE\colon\mathbb{R}\to\mathbb{R} satisfying the law of exponents E(x+y)=E(x)E(y)E(x+y)=E(x)E(y) for all x,yRx,y\in\mathbb{R}.

This is a functional characterization of exponentials; we don’t specify a formula for how to compute the function but instead we specify how the function ought to behave. There are several facts about exponentials we will need, that follow directly from this definition:

These are proven in a previous note introducing the functional characterization. We also need some general theorems from analysis about such functions

We do not prove these, but they can be found in many undergraduate analysis texts. Alright lets get on with it!

Differentiating Exponentials

Let E(x)E(x) be an exponential function. Then EE is differentiable on the entire real line, and E(x)=E(0)E(x)E^\prime(x) = E^\prime(0)E(x)

First we show that this formula holds so long as EE is actually differentiable at zero. Thus, differentiability at a single point is enough to ensure differentiability everywhere and fully determine the formula!

Let xRx\in\RR, and hn0h_n\to 0. Then we compute E(x)E^\prime(x) by the following limit: E(x)=limE(x+hn)E(x)hnE^\prime(x)=\lim \frac{E(x+h_n)-E(x)}{h_n}

Using the property of exponentials and the limit laws, we can factor an E(x)E(x) out of the entire numerator:

=limE(x)E(hn)E(x)hn=E(x)limE(hn)1hn=\lim \frac{E(x)E(h_n)-E(x)}{h_n}=E(x)\lim \frac{E(h_n)-1}{h_n}

But, E(0)=1E(0)=1 so the limit here is actually the derivative of EE at zero!

E(x)=E(x)E(0)E^\prime(x)=E(x)E^\prime(0)

Next, we tackle the slightly more subtle problem of showing that EE is in fact differentiable at zero. This is tricky because all we have assumed is that EE is continuous and satisfies the law of exponents: how are we going to pull differentiability out of this? The trick is two parts (1) show the right and left hand limits defining the derivative exist, and (2) show they’re equal.

STEP 1: Show that the left and right hand limits defining the derivative exist: EE is convex so the difference quotient is monotone increasing, and so the limit limx0\lim_{x\to 0^-} exists (as a sup) and limx0+\lim_{x\to 0^+} exists (as an inf).

STEP2: Now that we know each of these limits exist, let’s show they are equal using the definition:

To compute the lower limit, we can choose any sequence approaching 00 from below: let hnh_n be a positive sequence with hn0h_n\to 0, then hn-h_n will do:

limh0E(h)1h=limE(hn)1hn\lim_{h\to 0^-}\frac{E(h)-1}{h}=\lim \frac{E(-h_n)-1}{-h_n}

We can calculate E(hn)E(-h_n) by the law of exponents: we know E(0)=1E(0)=1 so E(hnhn)=1E(h_n-h_n)=1, but this implies E(hn)E(hn)=1E(h_n)E(-h_n)=1 so E(hn)=1/E(hn)E(-h_n)=1/E(h_n). Thus

limE(hn)1hn=lim1E(hn)1hn=lim1E(hn)hn1E(hn)=limE(hn)1hn1E(hn)\begin{align*} \lim \frac{E(-h_n)-1}{-h_n}&=\lim \frac{\frac{1}{E(h_n)}-1}{-h_n}\\ &=\lim\frac{1-E(h_n)}{-h_n}\frac{1}{E(h_n)}\\ &=\lim \frac{E(h_n)-1}{h_n}\frac{1}{E(h_n)} \end{align*}

But, since EE is continuous (by definition) and E(0)=1E(0)=1 the limit theorems imply lim1E(hn)=1limE(hn)=1E(limhn)=1E(0)=1\lim \frac{1}{E(h_n)}=\frac{1}{\lim E(h_n)}=\frac{1}{E(\lim h_n)}=\frac{1}{E(0)}=1 Thus,

lim(E(hn)1hn1E(hn))=(limE(hn)1hn)(lim1E(hn))=limE(hn)1hn\begin{align*} &\lim \left(\frac{E(h_n)-1}{h_n}\frac{1}{E(h_n)}\right)\\&= \left(\lim \frac{E(h_n)-1}{h_n}\right)\left(\lim\frac{1}{E(h_n)}\right)\\ &=\lim \frac{E(h_n)-1}{h_n}\\ \end{align*}

But this last limit evaluates exactly to the limit from above since hn>0h_n>0 and hn0h_n\to 0. Stringing all of this together, we finally see limh0E(h)1h=limh0+E(h)1h\lim_{h\to 0^-}\frac{E(h)-1}{h}=\lim_{h\to 0^+}\frac{E(h)-1}{h} So both one sided limits exist and are equal, which implies the entire limit exists: EE is differentiable at 00.

Consequences

This theorem tells us that the exponential functions have a remarkable property: they are their own derivatives, up to a constant multiple. This has several useful consequences:

Derivative is Nonzero

Let EE be an exponential function, so E(x)=kE(x)E^\prime(x)=kE(x) for k=E(0)k=E^\prime(0). Then k0k\neq 0.

If k=0k=0 then E(x)=0E^\prime(x)=0 for all xx. But then the mean value theorem implies that EE is constant, contradicting the definition of exponential.

Also, because we know how to differentiate constant multiples, this lets us calculate arbitrary derivatives of exponentials:

E(x)=[E(x)]=[kE(x)]=k[E(x)]=k2E(x)E^{\prime\prime}(x)=\left[E^\prime(x)\right]^\prime=\left[kE(x)\right]^\prime = k[E(x)]^\prime = k^2 E(x)

Continuing this inductively we get arbitrary derivatives E(n)(x)=knE(x)E^{(n)}(x)=k^n E(x), where we have written kk for the value of E(0)E^\prime(0)

Exponentials are smooth functions.

Next we see that the theory of exponentiation alone does not further the possible values of kk:

Composing with Linear

Composing an exponential with (x)=kx\ell(x)=kx for k0k\neq 0 results in another exponential.

Let EE be an exponential and (x)=kx\ell(x)=kx be linear. Since \ell is continuous and (x+y)=k(x+y)=kx+ky=(x)+(y)\ell(x+y)=k(x+y)=kx+ky=\ell(x)+\ell(y), the composition EE\circ \ell is a continuous nonconstant solution to the Law of Exponents:

(E)(x+y)=E((x+y))=E((x)+(y))=E((x))E((y))=(E)(x)(E)(y)(E\circ\ell)(x+y)=E(\ell(x+y))=E(\ell(x)+\ell(y))=E(\ell(x))E(\ell(y))=(E\circ \ell)(x)(E\circ\ell)(y)

If there exists at least one exponential function, then there is an exponential with E(x)=kE^\prime(x)=k for every nonzero kRk\in\mathbb{R}.

Let EE be an exponential function, and kk nonzero. Then define (x)=kE(0)\ell(x)=\frac{k}{E^\prime(0)}, and note the composition E=E\mathcal{E}=E\circ \ell is an exponential by our previous work. But differentiating this with the chain rule yields

E(x)=(E)(x)=E((x))(x)=E(0)E((x))kE(0)=kE((x))=kE(x)\begin{align} \mathcal{E}^\prime(x)&=(E\circ \ell)^\prime(x)\\ &=E^\prime(\ell(x))\ell^\prime(x)\\ &= E^\prime(0)E(\ell(x)) \frac{k}{E^\prime(0)}\\ &= k E(\ell(x))\\ &= k\mathcal{E}(x) \end{align}

So E\mathcal{E} is an exponential whose derivative is kk times itself, as required.

While the functional equation alone did not provide us any means of distinguishing between different exponential functions, differentiation selects a single best, or simplest exponential out of the lot: the one where that constant multiple is just 11! So long as there is any exponential at all, this must also exist, by the above. So we give it a name

We write exp(x)\exp(x) for the exponential function which has exp(0)=1\exp^\prime(0)=1.

Application: Uniqueness of Solutions to y=yy^\prime =y

Above we’ve seen that all exponential functions are differentiable and satisfy E(x)=kE(x)E^\prime(x)=kE(x) for some nonzero kk. But we can go beyond this, and actually show that every solution to this differential equation with E(0)=1E(0)=1 is an exponential. Of course, this would follow trivially if we had the existence and unqiueness for first order ODEs handy, but one doesn’t usually have that by this point in an analysis course, so I’ll just record the independent argument here, specialized to the case exp(x)\exp(x).

Assuming the existence of exponential functions, the only solution to y=yy^\prime = y with y(0)=1y(0)=1 is the exponential exp(x)\exp(x)

If any exponential exists we know that exp\exp exists by the above. Let ff be any solution to our differential equation, so f(x)=f(x)f^\prime(x)=f(x) and f(0)=1f(0)=1. The trick is to consider the function g(x)=f(x)/exp(x)g(x)=f(x)/\exp(x) and show its constant. Taking the derivative, g(x)=(f(x)exp(x))=f(x)exp(x)f(x)exp(x)exp(x)2g^\prime(x)=\left(\frac{f(x)}{\exp(x)}\right)^\prime =\frac{f^\prime(x)\exp(x)-f(x)\exp^\prime(x)}{\exp(x)^2} using the fact that exp=exp\exp^\prime =\exp by above, and f=ff^\prime = f by assumption, the numerator is zero: f(x)exp(x)f(x)exp(x)=f(x)exp(x)f(x)exp(x)=0f^\prime(x)\exp(x)-f(x)\exp^\prime(x)=f(x)\exp(x)-f(x)\exp(x)=0 Thus g(x)=0g^\prime(x)=0 for all xx, and gg is constant. To find the value of gg we need only evaluate it at a point g(0)=f(0)exp(0)=11=1g(0)=\frac{f(0)}{\exp(0)}=\frac{1}{1}=1

Thus f(x)/exp(x)=1f(x)/\exp(x)=1 for all xx, so

f(x)=exp(x)f(x)=\exp(x)

← All posts