The Lie Derivative, Geometrically

Measuring change along a flow without choosing coordinates.

The Lie derivative is a coordinate invariant, and metric-independent notion of differentiation on a smooth manifold. It differentiates with respect to a vector field, and its goal is to take some geometric object on the manifold and quantify how that object changes if you were to watch it as you traveled around the manifold, flowing along the field. The goal of this note is to give a brief formal definition. Note for vector and tensor fields I will give a definition that is slightly different than the usual one occurring in textbooks, from which I find the the connection with the Lie bracket more natural to motivate. Then of course we show equivalence with the usual form of the definition.

Lie Derivatives of Functions

We warm up by discussing the Lie derivative of a function f ⁣:MRf\colon M\to \mathbb{R}, which gives us opportunity to formalize the notion of watching ff while flowing along the vector field. Given a vector field VV on MM, we can define a flow: where every point pMp\in M is carried along an integral curve of VV. By the existence and uniqueness of solutions to ODEs, doing this at each point generates1 a 1-parameter family of diffeomorphisms2 Φt ⁣:MM\Phi_t\colon M\to M, satisfying ddtt=0Φt=V\frac{d}{dt}\Big|_{t=0}\Phi_t=V

If we were to start at the point pp and flow along VV for time tt we would end up at Φt(p)\Phi_t(p), and so if we were watching ff, the value we would see is f(Φt(p))f(\Phi_t(p)). To measure how quickly we see ff change as we flow care-free along the manifold, we simply differentiate this quantity with respect to tt:

Lie Derivative of a Function

Let f ⁣:MRf\colon M\to \mathbb{R} and VV be a vector field on MM generating the flow Φt\Phi_t. Then LVf:=ddtt=0f(Φt())\mathcal{L}_Vf :=\frac{d}{dt}\Big|_{t=0}f(\Phi_t(-))

For comparison with the coming generalizations, it will be useful to write out the difference quotient defining the derivative explicitly:

(LVf)(p)=limt0f(Φt(p))f(p)t(\mathcal{L}_Vf)(p)=\lim_{t\to 0}\frac{f(\Phi_t(p))-f(p)}{t}

Finally, while our goal here is to understand the geometric definition and not particularly to find convenient computational formulas, its worth pointing out that the Lie derivative of functions coincides3 with the directional derivative:

(LVf)(p):=ddtt=0f(Φt(p))=(df)Φ0(p)(ddtt=0Φt(p))=(df)Φ0(p)(V(p))=dfp(V(p))=[V(f)](p)\begin{align} (\mathcal{L}_Vf)(p):&=\frac{d}{dt}\Big|_{t=0}f(\Phi_t(p))\\ &= (df)_{\Phi_0(p)}\left(\frac{d}{dt}\Big|_{t=0}\Phi_t(p)\right)\\ &=(df)_{\Phi_0(p)}\left(V(p)\right)\\ &= df_p(V(p))\\ &= [V(f)](p) \end{align}

This is why you don’t often hear about the Lie derivative of functions as a separate concept.

Lie Derivatives of Vector Fields

Moving on to perhaps the most common case, what is the Lie derivative of a vector field? Imagine again that we are lazily flowing along MM following the integral curves of VV, but this time instead of watching the values of a scalar function ff we are observing a vector field XX as we pass by. At time tt, we hae flowed to the point Φt(p)\Phi_t(p) and so we observe the vector XΦt(p)X_{\Phi_t(p)}. And we may be tempted to say that the change we observe in XX is simply the time derivative of this quantity ddtt=0XΦt(p)\tfrac{d}{dt}|_{t=0}X_{\Phi_t(p)}. But we must be careful! This quantity actually does not make any sense: if we wrote out the defining difference quotient we would find the numerator contains the term XΦt(p)XpX_{\Phi_t(p)}-X_{p}, and these two vectors live in two different4 vector spaces!

To fix this we need to think harder about the picture we are trying to formalize, of us flowing along the vector field and comparing the vector we see alongside our boat with the original vector we saw at the start. To be able to make the comparison, we need to actually carry the original vector with us so we can place it directly alongside the vectors we flow past and make an accurate comparison.

Happily, there is a natural means5 of carrying a vector along the flow with us; which is to use the pushforward (Φt)(\Phi_t)_\star. If XX is any vector based at pp then (Φt)X(\Phi_t)_\ast X is the result carrying XX along the flow to the point Φt(p)\Phi_t(p).

So now that we have brought our vector with us, we can legitimately make a comparison of two vectors in the same tangent space: XΦt(p)X_{\Phi_t(p)} and (Φt)Xp(\Phi_t)_\ast X_p. Thus, one may wish to attempt a definition of the form

Lie Derivative of a Vector Field

Let VV be a vector field on MM with associated flow Φ\Phi, and XX another vector field. Then for each pMp\in M the Lie derivative of XX along VV is given by (LVX)p=limt0XΦt(p)(Φt)Xpt(\mathcal{L}_V X)_p=\lim_{t\to 0 }\frac{X_{\Phi_t(p)}-(\Phi_t)_\ast X_p}{t}

This is both correct and matches perfectly well our mental image, but the formula itself raises some new technical difficulties. Particularly, while for each fixed tt the difference quotient is perfectly well defined, it is occurring in the tangent space at Φt(p)\Phi_t(p), and so to compute our limit we cannot work in a single vector space but rather need to deal with the full machinery of the tangent bundle.

This is not a serious issue, as in differential geometry, a vector field is an object which acts on functions. So, to ‘compute6’ the Lie derivative LVX\mathcal{L}_VX means to be able to compute the number (LVX)p(f)(\mathcal{L}_VX)_p(f) for an arbitrary smooth function ff at each pMp\in M.

(LVX)p(f)=limt0XΦt(p)(f)[(Φt)Xp](f)t=limt0XΦt(p)(f)Xp(fΦt)t\begin{align} (\mathcal{L}_VX)_p(f) &=\lim_{t\to 0 }\frac{X_{\Phi_t(p)}(f)-\left[(\Phi_t)_\ast X_p\right](f)}{t}\\ &=\lim_{t\to 0 }\frac{X_{\Phi_t(p)}(f)-X_p(f\circ \Phi_t)}{t}\\ \end{align}

While each term in the numerator here is being computed in a different tangent space the result of each computation is just a real number. So, this is a real-valued limit, and can be computed without thinking in tangent-bundle terms.

An Alternative Formulation

We should also note that there is a means of getting around this technical difficulty (of the vectors for each tt occuring in a different tangent space) by proposing an alternative definition. Instead of carrying the original vector along for the journey, why don’t we just pull the new vector back to the starting point? Now every difference quotient occurs in the fixed tangent space TpMT_pM.

This results in the expression below, where we have introduced the tilde to distinguish it from our definition above

(LVX~)p:=limt0ΦtXΦt(p)Xpt(\widetilde{\mathcal{L}_VX})_p:=\lim_{t\to 0}\frac{\Phi_t^\ast X_{\Phi_t(p)}-X_p}{t}

Here Φt\Phi_t^\ast is the pullback along the diffeomorphism Φt\Phi_t. This however introduces a new complication : vector fields are objects which naturally push forward under maps rather than pull back. For a diffeomorphism like Φt\Phi_t of course, both are defined, but it is still simplest to compute the pullback as the pushforward along the inverse. And, as the inverse of the time-tt flow is simply to flow backwards for the same amount of time, Φt1=Φt\Phi_t^{-1}=\Phi_{-t}, one often sees this second definition appearing in textbooks as

(LVX~)p=limt0(Φt)XΦt(p)Xpt(\widetilde{\mathcal{L}_VX})_p=\lim_{t\to 0}\frac{(\Phi_{-t})_\ast X_{\Phi_t(p)}-X_p}{t}

Also note that while the definition itself cleanly all occurs in a single tangent space, when one goes to use it, of course the computations go back to occuring in different tangent spaces, much like for our definition above. Below we apply this to a smooth function ff:

(LVX~)p(f)=limt0(Φt)XΦt(p)(f)Xp(f)t=limt0XΦt(p)(fΦt)Xp(f)t\begin{align} (\widetilde{\mathcal{L}_VX})_p(f)&=\lim_{t\to 0}\frac{(\Phi_{-t})_\ast X_{\Phi_t(p)}(f)-X_p(f)}{t}\\ &=\lim_{t\to 0}\frac{X_{\Phi_t(p)}(f\circ \Phi_{-t})-X_p(f)}{t} \end{align}

While the limit is again real valued, the first term in the numerator is computed in the tangent space at Φt(p)\Phi_t(p) and the second is computed directly at pp.

Of course, the only reason we are carrying on speaking about two Lie derivatives is that they are one in the same thing: either can be used depending on convenience.

Equivalence of Definitions

For any vector fields X,VX, V and any point pp, (LVX)p=(LVX~)p(\mathcal{L}_VX)_p=(\widetilde{\mathcal{L}_VX})_p

Let Δt,Δt~\Delta_t, \widetilde{\Delta_t} denote the difference quotients occurring in the definitions of LVX\mathcal{L}_VX and LVX~\widetilde{\mathcal{L}_VX} respectively. A quick computation shows ΦtΔt=Δt~\Phi_t^\ast \Delta_t = \widetilde{\Delta_t}:

ΦtΔt=Φt(XΦt(p)(Φt)Xpt)=ΦtXΦt(p)Φt(Φt)Xpt=ΦtXΦt(p)(Φt)(Φt)Xpt=ΦtXΦt(p)(Φtt)Xpt=ΦtXΦt(p)Xpt=Δt~\begin{align} \Phi_t^\ast \Delta_t &=\Phi_t^\ast \left(\frac{X_{\Phi_t(p)}-(\Phi_t)_\ast X_p}{t}\right)\\ &= \frac{\Phi_t^\ast X_{\Phi_t(p)}-\Phi_t^\ast(\Phi_t)_\ast X_p}{t}\\ &= \frac{\Phi_t^\ast X_{\Phi_t(p)}-(\Phi_{-t})_\ast (\Phi_t)_\ast X_p}{t}\\ &= \frac{\Phi_t^\ast X_{\Phi_t(p)}-(\Phi_{t-t})_\ast X_p}{t}\\ &= \frac{\Phi_t^\ast X_{\Phi_t(p)}-X_p}{t}\\ &= \widetilde{\Delta_t} \end{align}

Where we have used that pulling back along Φt\Phi_t is equal to pushing forward along Φt\Phi_{-t}, and that pushforwards along a 1-parameter family of diffeomorphisms compose via (Φs)(Φt)=(Φs+t)(\Phi_s)_\ast(\Phi_t)_\ast = (\Phi_{s+t})_\ast, and that Φ0=id\Phi_0=\mathrm{id}. Thus, LVX~=limt0Δt~=limt0(Φt)Δt=limt0(DΦt)Δt\widetilde{\mathcal{L}_VX}=\lim_{t\to 0}\widetilde{\Delta_t}=\lim_{t\to 0}(\Phi_t)_\ast \Delta_t=\lim_{t\to 0}(D\Phi_t)\Delta_t

Where we’ve realized the pushforward by the derivative of the diffeomorphism Φt\Phi_t. As t0t\to 0, the limit of Δt\Delta_t as well as the limit of (DΦt)(D\Phi_t) exist, so we can compute the limit termwise:

LVX~=limt0(DΦt)Δt=(limt0DΦt)(limt0Δt)=(IdTpM)LVX=LVX\begin{align} \widetilde{\mathcal{L}_VX} &= \lim_{t\to 0}(D\Phi_t)\Delta_t\\ &= \left(\lim_{t\to 0}D\Phi_t\right)\left(\lim_{t\to 0}\Delta_t\right)\\ &=\left(\mathrm{Id}_{T_pM}\right)\mathcal{L}_VX\\ &=\mathcal{L}_VX \end{align}

So the two are equal.

Lie Derivative of Tensors

The same geometric intuition carries over to general tensors TT: starting at a point pp we carry the tensor TpT_p along with us as we flow, and compare the result to the tensor TΦt(p)T_{\Phi_t(p)} we find at time tt. The Lie derivative is the infinitesimal difference between these quantities:

Lie Derivative of a Tensor

Let VV be a vector field on MM with associated flow Φ\Phi, and TT a tensor field. Then for each pMp\in M the Lie derivative of TT along VV is given by (LVT)p=limt0TΦt(p)(Φt)Tpt(\mathcal{L}_V T)_p=\lim_{t\to 0 }\frac{T_{\Phi_t(p)}-(\Phi_t)_\ast T_p}{t}

As for vectors, the usual textbook definition is not as written here but rather formulated to pull all vectors back to the original tangent space, instead of push the original forward along the flow:

(LVT)p=limt0(Φt)TΦt(p)Tpt=limt0(Φt)TΦt(p)Tpt(\mathcal{L}_V T)_p=\lim_{t\to 0 }\frac{(\Phi_t)^\ast T_{\Phi_t(p)}-T_p}{t}=\lim_{t\to 0 }\frac{(\Phi_{-t})_\ast T_{\Phi_t(p)}-T_p}{t}

Footnotes

  1. If MM is compact, we can define this flow globally for some interval [0,ϵ][0,\epsilon] by a compactness argument. And to calculate the Lie derivative at any point pMp\in M, its enough to have the flow on some compact neighborhood of pp.

  2. To see the flow actually generates a diffeomorphism at each time step requires some analysis and topology. By the uniqueness of solutions to ODEs, for each fixed tt the map Φt\Phi_t is injective (if we have two points pp, qq where Φt(p)=Φt(q)\Phi_t(p)=\Phi_t(q): call this common point rr. Flowing backwards along VV is the same as flowing along the vector field V-V, and this would imply that the point rr is carried under this flow to both pp and qq, so by uniqueness of solutions p=qp=q). Thus, on any compact subset of MM, Φt\Phi_t is an injection from a compact space into a hausdorff space, so its a homeomorphism onto its image. And in fact, by the assumed smoothness of VV and MM its a local diffeomorphism. This implies its a covering map, but every point has a unique preimage (recall its injective), so its a 1-sheeted cover, or a global diffeomorphism.

  3. We have used that the flow at time zero is the identity, that the derivative of a function satisfies the identity df(X)=X(f)df(X)=X(f) for any vector field XX, and the defining property of the flow Φt\Phi_t, that ddtt=0Φt(p)=V(p)\frac{d}{dt}|_{t=0}\Phi_t(p)=V(p).

  4. This poses a serious problem, as there is no unique natural way to identify vectors in TpMT_pM with those in TqMT_qM for pqp\neq q. And we can’t just choose some random identification, as we are looking to define an intrinsic geometric notion, something that depends only on the vector field XX and the flow Φ\Phi.

  5. If you’ve seen enough geometry you might have an idea here - we could carry the original vector along with us by parallel transport! But this won’t do here: we are looking for something metric independent. Parallel transport aims to keep everything rigid as it moves, whereas we are very much trying to go with the flow. Instead of imagining yourself and your boat as a rigid object which is pushed and pulled by the vector field VV, we instead allow ourselves to deform freely following the flow Φt\Phi_t.

  6. This fits well with the more practical-minded concern of computing the components of LVX\mathcal{L}_V X in a coordinate basis. Indeed, if x,y,zx,y,z are coordinates on MM near pp and one is looking for an expression of the form (LVX)p=ax+by+cz(\mathcal{L}_V X)_p = a\partial_x + b\partial_y + c\partial_z, one can compute the coefficients a,b,ca,b,c using the dual 11-form basis dx,dy,dzdx, dy, dz as a=dx(LVX)a=dx(\mathcal{L}_V X), b=dy(LVX)b=dy(\mathcal{L}_V X) and c=dz(LVX)c=dz(\mathcal{L}_V X). But, as df(v)=v(f)df(v)=v(f) for any vector vv and smooth function ff these become a=LVXp(x)a=\mathcal{L}_V X_p(x), b=LVXp(y)b=\mathcal{L}_V X_p(y) and c=LVXp(z)c=\mathcal{L}_V X_p(z) respectively; all quantities of the form we computed above.

← All posts