A Lie group GG is a group that is also a (continuous, differentiable) topological space. An example to keep in mind is G=RnG=\mathbb R^n which is a group under vector addition and has well-defined notions of continuity and differentiation. To measure lengths and volumes (and relatedly, to define and integrate probability densities) we need to endow GG with additional structure so that it is not merely a manifold, but a Riemannian manifold. Luckily for us, we only need to define an inner product for the Lie algebra, after which there is a natural definition of length and volume that can be made for the entire group manifold. I say that the resulting choice of volume (called the Haar measure) is natural because it is compatible with the group structure of GG as well as its differential structure as a manifold. This can be compared to how the standard notion of volume for Rn\mathbb R^n, the Lebesgue measure, is compatible with vector addition; we have for a (measurable) set ARnA\subseteq \mathbb R^n and for every x0Rn\mathbf x_0 \in \mathbb R^n,

Vol(A)=Vol({x+x0  xA}). \textrm{Vol}(A) = \textrm{Vol}(\lbrace \mathbf x + \mathbf x_0 \ \vert\ \mathbf x \in A \rbrace).

Thus begins our journey into making sense of this compatibility in the general context of a Lie group. If you are seeking a more application-oriented approach and/or aren’t all that interested in this sort of abstraction, this book has all the formulae worked out, and my previous post introduces the idea of invariant metrics and measures on Lie groups. Note that I will be using the Einstein summation convention throughout.

Lengths and Volumes ✨

Let MM be an nn-dimensional smooth manifold. A covariant 22-tensor field on MM is a bilinear map that takes 22 smooth vector fields as its arguments and produces a smooth C(M)C^\infty(M) function.1 A Riemannian metric is a covariant 22-tensor field that is symmetric:

g(v1,v2)=g(v2,v1)C(M), \begin{align} g(\mathbf{v}_1, \mathbf{v}_2) &= g(\mathbf{v}_2, \mathbf{v}_1) \in C^\infty(M), \end{align}

and positive-definite:

g(v1,v1)(p)>0v1(p)0. \begin{align} g(\mathbf{v}_1, \mathbf{v}_1)(p) > 0 &\quad \Leftrightarrow \quad \mathbf{v}_1(p) \neq \mathbf 0. \end{align}

where v1,v2X(M)\mathbf{v}_1, \mathbf{v}_2 \in \mathfrak X(M) are smooth vector fields on MM. At some point pMp\in M, the number g(v1,v2)(p)g(\mathbf{v}_1, \mathbf{v}_2)(p) is interpreted as the inner product between the tangent vectors v1(p)\mathbf{v}_1(p) and v2(p)\mathbf{v}_2(p), often written as v1,v2pg(v1,v2)(p)\langle \mathbf v_1, \mathbf v_2 \rangle_p \coloneqq g(\mathbf{v}_1, \mathbf{v}_2)(p). With such a mathematical structure imposed on MM, we call (M,g)(M,g) a Riemannian manifold.

While a metric tensor is a symmetric covariant 22-tensor field, a volume form ω\omega is an alternating covariant nn-tensor field, also called as a differential nn-form. By combining its alternating property with its linearity, ω\omega can be shown to be antisymmetric in its arguments. That is, if v1,v2,,vnX(M)\mathbf{v}_1,\mathbf{v}_2, \cdots, \mathbf{v}_n \in \mathfrak X(M) are smooth vector fields, then

ω(v1,v2,,vn)=ω(v2,v1,,vn)=νC(M) \omega(\mathbf{v}_1, \mathbf{v}_2, \cdots, \mathbf{v}_n) = -\omega(\mathbf{v}_2, \mathbf{v}_1, \cdots, \mathbf{v}_n) = \nu \in C^\infty(M)

The function ν\nu that is spit out by ω\omega (after it eats nn vector fields) assigns the volume ν(p)R\nu(p)\in \mathbb R to the parallelopiped spanned by the vectors v1(p),v2(p),,vn(p)TpM\mathbf{v}_1(p), \mathbf{v}_2(p), \cdots, \mathbf{v}_n(p)\in T_pM. Thus, ω\omega is sort of like a ‘volume meter’ affixed to each point of MM. It is the authority on what counts as a positive volume, what counts as a small or a large volume, and so on. Its antisymmetry can be compared with the fact that abf(x)dx=baf(x)dx\int_a^b f(x) dx= -\int_b^a f(x) dx.

Either of these maps can be written as a tensor in local coordinates on UMU\subseteq M, e.g., g=gijdxidxjg=g_{ij}dx^i \otimes dx^j, where the Einstein summation convention is used and gijC(U)g_{ij}\in C^\infty (U). Since the standard tensor notation doesn’t reflect the symmetry/antisymmetry of the tensors, one typically drops it in favor of notation that does:

g=gi1i2dxi1dxi2ω=ωi1i2indxi1dxi2dxin \begin{align} g &= g_{i_1 i_2}dx^{i_1} dx^{i_2}\\ \omega &= \omega_{i_1 i_2 \cdots i_n} dx^{i_1} \wedge dx^{i_2} \wedge \cdots \wedge dx^{i_n} \end{align}

This helps us remember that the dxi1dx^{i_1} and dxi2dx^{i_2} of gg can be swapped without consequence, whereas swapping the dxi1dx^{i_1} and dxikdx^{i_k} of ω\omega may or may not incur a sign-change depending on the parity of the permutation.

With this notation, observe that if v=vkxk\mathbf{v} = v^k \frac{\partial}{\partial x^k} and w=wlxl\mathbf{w} = w^l \frac{\partial}{\partial x^l} are vector fields on UU, where (xi)i(\frac{\partial}{\partial x^i})_i is the dual basis of (dxi)i(dx^i)_{i}, then

g(v,w)=gijdxidxj(vkxk,wlxl)=gijvkwl [dxidxj(xk,xl)]=gijvkwlδkiδlj=gijviwj. \begin{align} g(\mathbf{v},\mathbf{w}) &= g_{ij} dx^i dx^j \left(v^k \frac{\partial}{\partial x^k}, w^l \frac{\partial}{\partial x^l}\right) \\ &= g_{ij}v^k w^l \ \left[dx^i dx^j \left(\frac{\partial}{\partial x^k}, \frac{\partial}{\partial x^l}\right)\right] \\ &= g_{ij} v^k w^l \delta^i_k \delta^j_l \\ &= g_{ij} v^i w^j. \end{align}

Since it is a (pointwise) sum of (pointwise) products of C(U)C^\infty(U) functions, gijviwjC(U)g_{ij} v^i w^j\in C^\infty(U). The metric tensor coefficients (gij)i,j=1n(g_{ij})_{i,j=1}^n play a role similar to that of a weighting matrix WRn×nW \in \mathbb R^{n \times n} that is introduced when defining a non-standard inner product in Rn\mathbb R^n, as v,wvWw\langle \mathbf{v}, \mathbf{w}\rangle \coloneqq \mathbf{v}^\top W \mathbf{w}.

Frames ✨

In the above expressions, we used a frame (a system of vector fields) that arises from a coordinate chart. However, there may arise situations where we prefer to work with a frame that is not only not induced by a coordinate chart, but also cannot be induced by a coordinate chart. The frame of left-invariant vector fields on a (non-Abelian) Lie group is a prime example of this.

A local frame in an open set UU of MM is a set of tangent vector fields in X(U)\mathfrak X(U), enumerated as (Ei)i=1n(E_i)_{i=1}^n, such that (Ei(p))i=1n(E_i(p))_{i=1}^{n} is a basis of TpMT_p M for all pUMp\in U\subseteq M. A local frame is orthonormal if g(Ei,Ej)=δijg(E_i, E_j) = \delta_{ij}, where δij\delta_{ij} is the Kronecker delta considered as a constant-valued C(U)C^\infty(U) function. A global frame is one that is defined on all of MM, with U=MU=M.

The dual coframe to (Ei)i=1n(E_i)_{i=1}^n is the collection of cotangent (or covariant) vector fields (εi)i=1n(\varepsilon^i)_{i=1}^n, such that εi(Ej)=δji\varepsilon^i(E_j) = \delta^i_{j}. These cotangent vector fields form a basis for differential 11-forms. We can take their tensor products to obtain a basis for covariant kk-tensor fields:

{εi1εi2εik  1i1,i2,,ikn} \begin{align} \lbrace \varepsilon^{i_1} \otimes \varepsilon^{i_2} \otimes \cdots \otimes \varepsilon^{i_k}\ \vert\ 1\leq i_1, i_2, \cdots, i_k \leq n \rbrace \end{align}

or their exterior /wedge products to obtain a basis for the space of differential kk-forms:

{εi1εi2εik  1i1<i2<<ikn} \begin{align} \lbrace \varepsilon^{i_1} \wedge \varepsilon^{i_2} \wedge \cdots \wedge \varepsilon^{i_k}\ \vert\ 1\leq i_1 < i_2 < \cdots < i_k \leq n \rbrace \end{align}

🍩 Orthonormal Frames

In the coordinate coframe (dxi)i=1n(dx^i)_{i=1}^n, we expressed the metric tensor as g=gijdxidxjg = g_{ij}dx^i dx^j. Let’s now try to express it in a local coframe (εi)i=1n(\varepsilon^i)_{i=1}^n on UMU \subseteq M that is dual to an orthonormal one:

g=gijdxidxj=g~ijεiεj \begin{align} g = g_{ij}dx^i dx^j = \tilde g_{ij} \varepsilon^i \varepsilon^j \end{align}

We have then, that

δkl=g(Ek,El)=g~ijεiεj(Ek,El)=g~ijεi(Ek)εj(El)=g~ijδkiδlj=g~kl. \begin{align} \delta_{kl} = g(E_k, E_l) &= \tilde g_{ij} {\varepsilon}^i {\varepsilon}^j \left(E_k, E_l\right) \\ &= \tilde g_{ij} {\varepsilon}^i \left(E_k\right) {\varepsilon}^j \left(E_l\right) \\ &= \tilde g_{ij} \delta^i_k \delta^j_l = \tilde g_{kl}. \end{align}

This means that the metric tensor, when expressed in a coframe dual to an orthonormal one, has the trivial representation: g=δijεiεjg = \delta_{ij} \varepsilon^i \varepsilon^j. By writing out the summation explicitly, this takes a more familiar form:

g=(ε1)2+(ε2)2++(εn)2. g = (\varepsilon^1)^2 + (\varepsilon^2)^2 + \cdots + (\varepsilon^n)^2.

Given a local orthonormal frame of vector fields (Ei)i=1n(E_i)_{i=1}^n whose dual coframe is (εi)i=1n(\varepsilon^i)_{i=1}^n, the unique (up to choice of orientation) Riemannian volume form ωg\omega_g is given by

ωg=ε1ε2εn \begin{align} \omega_g = \varepsilon^1 \wedge \varepsilon^2 \wedge \cdots \wedge \varepsilon^n \end{align}

so that ω(E1,E2,,En)=1\omega(E_1, E_2, \cdots, E_n) = 1. The above statements will look identical in any of the local orthonormal frames of MM.

🍩 Coordinate Frames

The allure of orthonormal frames is that gg and ωg\omega_g can be represented quite succinctly in them. However, the existence of an orthonormal frame that arises as the coordinate frame of a chart is a very rare occasion: such a frame only exists when the Riemannian manifold (M,g)(M, g) is locally flat. If we would rather work with a frame that arises from coordinates, then we must resort to computing the components of a non-flat metric tensor (one that is not simply the Kroenecker delta). Also see my post on the non-flatness of the sphere .

Let xiq\frac{\partial}{\partial x^i}\Big\vert_{q} be the usual coordinate-wise partial derivative operators in Rn\mathbb R^n, q(q1,q2,,qn)q \coloneqq (q_1, q_2, \dots, q_n), and qVRnq\in V \subseteq \mathbb R^n. Given a smooth function fC(V)f\in C^\infty(V), the partial derivative operators of Rn\mathbb R^n operate on ff as follows:

x2qf=limh0f(q1,q2+h,,qn)f(q1,q2,,qn)h \begin{align*} \frac{\partial}{\partial x^2}\bigg\vert_{q} f = \lim_{h\rightarrow 0} \frac{f(q_1, q_2+h, \dots, q_n) - f(q_1, q_2, \dots, q_n)}{h} \end{align*}

Recalling that the job of a vector field is to map a smooth function to a real number at every point (in a smooth manner), the following object is in fact a vector field on VV: xi(  ):VTV.\frac{\partial}{\partial x^i}\bigg\vert_{(\ \cdot\ )}:V \rightarrow TV. Thus, {xi(  )}i=1n\left\lbrace\frac{\partial}{\partial x^i}\big\vert_{(\ \cdot\ )}\right\rbrace_{i=1}^n is a set of vector fields on VV, and may be visualized as a “fisherman’s net” spread across VV.

Ultimately, we want a coordinate frame on a subset UU of MM, rather than on VV, which is in Rn\mathbb R^n. Let φ:UV\varphi:U \rightarrow V be a smooth chart containing some point pp of MM. Its differential (φ)p(\varphi_*)_p is a vector space isomorphism (i.e., an invertible linear map) between TpUT_p U and TqVT_q V. The pushforward of the “partial derivative vector fields” of VV under φ1\varphi^{-1} gives us a coordinate frame on UU. That is, φ1\varphi^{-1} maps the fisherman’s on VV to one on UU. By an abuse of notation, I and many others use (xi(  ))i=1n\left(\frac{\partial}{\partial x^i}\big\vert_{(\ \cdot\ )}\right)_{i=1}^n to refer to either frame; the subscript, the function being operated on, and/or the context will make it clear which frame is being used. This means that if f~C(U)\tilde f\in C^\infty (U), then we write

xipf~=((φ1)φ(p)xiφ(p))f~=xiφ(p)f~φ1=xiφ(p)(φ1f~). \begin{align} \frac{\partial}{\partial x^i}\Big\vert_{p} \tilde f &= \left((\varphi_*^{-1})_{\varphi(p)} \frac{\partial}{\partial x^i}\Big\vert_{\varphi(p)}\right) \tilde f\\ &= \frac{\partial}{\partial x^i}\Big\vert_{\varphi(p)} \tilde f \circ \varphi^{-1} = \frac{\partial}{\partial x^i}\Big\vert_{\varphi(p)} (\varphi^{-1 *} \tilde f). \end{align}

where φ1f~\varphi^{-1 *} \tilde f is called the pullback of f~\tilde f under φ1\varphi^{-1}; it pulls the domain of f~\tilde f back to VV.

🍩 Orthonormal Coordinate Frames

I reiterate that there are Riemannian manifolds where such a coordinate frame couldn’t possibly be orthonormal at all pUp\in U. Theorem 13.14 of Lee’s Introduction to Smooth Manifolds says that this is only possible when UU is flat, i.e., MM is locally flat.


Pullbacks ✨

Let f:MNf:M \rightarrow N be a diffeomorphism between manifolds (though it is possible to generalize the forthcoming discussion to other kinds of smooth maps).

Recall that tangent and cotangent vectors are dual to each other, and so are their kthk^{th} exterior powers: alternating kk-vector fields and differential kk-forms. Whenever we have a morphism for an object going one way, we expect a dual morphism for the corresponding dual object going the other way. Using this intuition, we deduce that if f:TMTNf_*: TM \rightarrow TN allows us to push forward vector fields, there must be a dual morphism f:TNTMf^*: T^* N \rightarrow T^* M that allows us to pull back covector fields. Similarly, f:Ωk(N)Ωk(M)f^*: \Omega^k(N) \rightarrow \Omega^k(M)2 pulls back differential forms from NN to MM (we use the same notation for either map, ff^*). In particular, metrics and volume forms on NN can be pulled back to define metrics and volume forms on MM.

The covariant tensor field thus obtained on MM is called as the pullback of the covariant tensor field on NN under ff. For example, consider M=S2M=S^2 to be the unit 22-sphere and ff to be its usual submersion into N=R3N=\mathbb R^3. Then, the pullback of the Euclidean (‘dot product’) metric gˉ\bar g of R3\mathbb R^3 under ff, fgˉf^* \bar g, is called the round metric, and it is a bonafide Riemannian metric for S2S^2 (I compute its components in the next post ). When the Euclidean metric is pulled back onto a submanifold MR3M\subseteq \mathbb R^3 in this manner, the pullback metric is called the induced metric or the first fundamental form of MM. More generally, we can pull back covariant tensor fields from arbitrary manifolds, as long as we have a smooth map onto it.

The pullback of a differential form is defined such that it must be in concordance with the pushforwards of vector fields. Specifically, if ωΩk(N)\omega \in \Omega^k(N) is a differential kk-form and v1,v2,,vkX(M)\mathbf{v}_1, \mathbf{v}_2, \cdots, \mathbf{v}_k \in \mathfrak X(M), then

(fω)(v1,v2,,vk)=ω(fv1,fv2,,fvk). \begin{align} (f^*\omega)(\mathbf{v}_1, \mathbf{v}_2, \cdots, \mathbf{v}_k) = \omega(f_*\mathbf{v}_1, f_*\mathbf{v}_2, \cdots, f_*\mathbf{v}_k). \end{align}

I like to read this as: fωf^*\omega eats vector fields on MM by imitating how ω\omega might eat the corresponding pushforward vector fields on NN. Thus, fωf^*\omega is a differential kk-form on MM; the domain of ω\omega has been pulled back by ff^*.


Lie Groups ✨

The tangent space at the identity of a Lie group GG can be given one of infinitely many possible inner products. However, there is a unique way to extend this inner product to a Riemannian metric by requiring that it be compatible with the group structure of GG (and as a consequence, compatible with the differential structure of GG as a manifold). For the same reason, there is also a unique choice of volume form (or equivalently, measure) with respect to which one can define the integral. This will be called the Haar integral, and it specializes to the Lebesgue integral when G=RnG=\mathbb R^n.

Yet another useful property of Lie groups is that there is a way to construct global orthonormal frames for it: we choose an orthonormal basis of TeGT_e G and extend it to a set of left/right-invariant vector fields. Even among Lie groups, orthonormal coordinate frames are a rare occurrence; if GG is non-Abelian, then an orthonormal frame could not possibly come from a coordinate system (also see this ). Nevertheless, the fact that a global orthonormal frame exists is already quite a special property.

🍩 Preliminaries

Let GG be a Lie group, eGe\in G its identity element, and g\mathfrak g its Lie algebra.3 Consider an inner product on g\mathfrak g, ,e\langle \cdot, \cdot \rangle_e, and use the Gram-Schmidt process to construct an orthonormal basis for g\mathfrak g. Denote one such orthonormal basis by (E~i)i=1n(\tilde E_i)_{i=1}^n, where E~ig\tilde E_i \in \mathfrak g and E~i,E~je=δij\langle \tilde E_i, \tilde E_j \rangle_e = \delta_{ij}. Its corresponding dual basis is denoted as (ε~i)i=1n(\tilde \varepsilon^i)_{i=1}^n, where ε~ig\tilde \varepsilon^i \in \mathfrak g^*. We can then express ,e\langle \cdot, \cdot \rangle_e by the tensor δijε~iε~j\delta_{ij} \tilde \varepsilon^i \tilde \varepsilon^j.

Let Lg:GG\mathcal L_{g}:G\rightarrow G denote the left-multiplication map, Lg(h)=gh\mathcal L_{g}(h) = gh, and similarly define Rg\mathcal R_{g}4; observe that these maps are diffeomorphisms from GG to GG, and can therefore push and pull tensors and tensor fields from one point of GG to another. For instance, the orthonormal basis (E~i)i=1n(\tilde E_i)_{i=1}^n can be extended to a global orthonormal frame (Ei)i=1n(E_i)_{i=1}^n on GG:

Ei(g)(Lg)E~i. E_i(g) \coloneqq \left(\mathcal L_{g}\right)_* \tilde E_i.

Such a global orthonormal frame on GG also serves as a “basis” of the space (or more rigorously, a generating set of the C(G)C^\infty(G)-module) of vector fields on GG, since any vector field VX(G)V\in\mathfrak X(G) can be uniquely expressed as V=viEiV = v^i E_i with viC(G)v^i \in C^\infty(G). Vector fields of the form ciEi\text{c}^i E_i (where ci\text{c}^i are constants) are precisely the left-invariant vector fields of GG.

One can similarly extend (ε~i)i=1n(\tilde \varepsilon^i)_{i=1}^n to a left-invariant global coframe (εi)i=1n(\varepsilon^i)_{i=1}^n on GG:

εi(g)(Lg1)ε~i. \varepsilon^i(g) \coloneqq \left(\mathcal L_{g^{-1}}\right)^* \tilde \varepsilon^i.

Immediately, we have the following property at all gGg\in G:

εi(Ej)(g)=[(Lg1)ε~i]((Lg)E~j)=ε~i((Lg1)(Lg)E~j)=ε~i(E~j)=δij. \begin{align} \varepsilon^i(E_j)(g) &= \Big[\left(\mathcal L_{g^{-1}}\right)^* \tilde \varepsilon^i\Big]\Big(\left(\mathcal L_{g}\right)_* \tilde E_j \Big) \\ &= \tilde \varepsilon^i\Big(\left(\mathcal L_{g^{-1}}\right)_*\left(\mathcal L_{g}\right)_* \tilde E_j \Big) =\tilde \varepsilon^i\Big(\tilde E_j\Big) = \delta_{ij}. \end{align}

In the following, we assume that (Ei)i=1n(E_i)_{i=1}^n and (εi)i=1n(\varepsilon^i)_{i=1}^n are left-invariant. Analogous arguments follow for the right-invariant case. The only caveat is that the left-invariant and right-invariant metrics and volume forms may or may not turn out to be the same, as discussed in my previous post .

🍩 Left-Invariance of Vector Fields

Let’s scrutinize the left-invariance of EiE_i. Pretend that the left-multiplication map LgL_g sends GG to another copy of itself, denoted as G💧G^💧!

If we view EiE_i as a vector field on GG, its pushforward on G💧G^💧, (Lg)Ei(\mathcal L_{g})_*E_i, should act on a function fC(G💧)f\in C^\infty (G^💧) by mimicking whatever EiE_i would have done in its place. Given some point hGh\in G, with Lg(h)=ghG💧\mathcal L_g(h) = gh \in G^💧, we have

[(Lg)Eif](gh)=[Ei(fLg)](h) \begin{align} \big[(\mathcal L_{g})_*E_i \,f\big](gh) &= \big[E_i(f \circ \mathcal L_g)\big](h) \end{align}

Moreover,

[Ei(fLg)](h)=ddt[fLg](hexp(tE~i))t=0=ddtf(ghexp(tE~i))t=0=[Eif](gh)=[(Eif)Lg](h). \begin{align} \big[E_i(f \circ \mathcal L_g)\big](h) &= \frac{d}{dt}[f\circ \mathcal L_g]\big(h \exp(t\tilde E_i)\big)\Big\vert_{t=0}\\ &= \frac{d}{dt}f\big(g h\exp(t\tilde E_i)\big)\Big\vert_{t=0}\\ &= [E_i f](gh)\\ &= [(E_i f)\circ \mathcal L_g](h). \end{align}

Now let G=G💧G=G^💧 (as we have done implicitly in the calculation above). Observe that the calculation above involved the following maps:

f:GREif:GR \begin{align} f: G &\rightarrow \mathbb R\\ E_i f : G &\rightarrow \mathbb R\\ \end{align}

i.e., ff and its derivative. Then, we notice that we can perform either of these maps on G💧G^💧 as well. That is, we do Lg:GG\mathcal L_g: G \rightarrow G first, and then perform either of the above maps. This gives us two more maps:

fLg:GR(Eif)Lg:GR \begin{align} f\circ \mathcal L_g: G &\rightarrow \mathbb R\\ (E_i f)\circ \mathcal L_g : G &\rightarrow \mathbb R\\ \end{align}

Finally, we note that EiE_i can act on the function fLgf\circ \mathcal L_g, giving us yet another function

Ei(fLg):GR.E_i(f\circ \mathcal L_g): G \rightarrow \mathbb R. That Ei(fLg)E_i(f\circ \mathcal L_g) and Ei(f)LgE_i(f)\circ \mathcal L_g are the same function, is what we showed, which is not true unless EiE_i is left-invariant. The fact that Lg\mathcal L_g moves in and out of the differentiation is what “left-invariant” refers to (also see the commutative square here ). An analogous property is exhibited by εi\varepsilon^i.

🍩 Left-Invariance of Geometric Structure

Now consider what should happen if we define the Riemannian metric of GG as

,=kijεiεj \langle \hspace{1pt}\cdot\hspace{2pt},\hspace{1pt}\cdot\hspace{2pt}\rangle = \textrm{k}_{ij}\hspace{1pt}\varepsilon^i \varepsilon^j

where kij\textrm{k}_{ij} are constants that should be thought of as a “weighting matrix”. Clearly, this metric should inherit the left-invariance properties of (εi)i=1n(\varepsilon^i)_{i=1}^n. Indeed, we can use similar arguments as before to show that if v,wTgG\mathbf v, \mathbf w \in T_gG, then

v,wg=(Lh)gv,(Lh)gwhg \langle \mathbf v,\mathbf w\rangle_g = \langle (\mathcal L_h)_{\ast_g}\mathbf v, (\mathcal L_h) _{\ast_g}\mathbf w\rangle _{hg}

To see this, we evaluate the right hand side:

(Lh)gv,(Lh)gwhg=kijεhgiεhgj((Lh)gv,(Lh)gw)=kijεhgi((Lh)gv)εhgj((Lh)gw) \begin{align*} \langle (\mathcal L_h)_{\ast_g}\mathbf v, (\mathcal L_h) _{\ast_g}\mathbf w\rangle _{hg} &= \textrm{k}_{ij}\hspace{1pt}\varepsilon^i_{hg} \varepsilon^j_{hg} \big((\mathcal L_h)_{\ast_g}\mathbf v, (\mathcal L_h) _{\ast_g}\mathbf w\big)\\ &= \textrm{k}_{ij}\hspace{1pt}\varepsilon^i_{hg} \big((\mathcal L_h)_{\ast_g}\mathbf v\big) \varepsilon^j_{hg} \big((\mathcal L_h) _{\ast_g}\mathbf w\big)\\ \end{align*}

Then, use the duality between pushforwards and pullbacks to show that

εhgi((Lh)gv)=[(Lh)hgεhgi](v)=[(Lh)hg(Lh1)gεgi](v)=εgi(v). \begin{align*} \varepsilon^i_{hg} \big((\mathcal L_h)_{\ast_g}\mathbf v\big) &= \big[(\mathcal L_h)^\ast_{_{hg}}\varepsilon^i_{hg}\big] \big(\mathbf v\big) \\ &=\big[(\mathcal L_h)^\ast_{_{hg}} (\mathcal L_{h^{-1}})^\ast_{_g} \varepsilon^i_{g}\big] \big(\mathbf v\big) \\ &= \varepsilon^i_g \big(\mathbf v\big). \end{align*}

The notation may seem cumbersome, but given how light-yet-powerful the notation of differential geometry is already, it’s not too bad (depending on what you intend to do with it). Drawing a diagram involving the points gg and hghg, as well as the maps Lg\mathcal L_g and Lh\mathcal L_h, can help in understanding the above calculation.

A left-invariant volume form can be defined as ω=ε1ε2εn\omega = \varepsilon^1 \wedge \varepsilon^2 \wedge \cdots \wedge \varepsilon^n, and has analogous invariance properties.


🍩 Some Actual Examples

We could also have chosen to work with a coordinate coframe on an open set UU containing gg in order to express ,\langle \cdot, \cdot \rangle. In this case, we should compute the metric tensor coefficients since they will no longer be trivial. In this post , I computed the metric tensor coefficients for the sphere (not a Lie group!) in spherical polar coordinates. In the next one , I pull back the geometric structure of SO(3)SO(3) to its axis-angle parametrization .


  1. Formally, such an object is an element of Γ(TMTM)\Gamma (T^*M \otimes T^*M), i.e., it is ‘a smooth section of the 2nd tensor power of the cotangent bundle of MM’. There is a sense in which covariant kk-tensor fields are elements of the dual space corresponding to the module of contravariant kk-tensor fields on MM, where instead of a field of scalars, we have a ring of C(M)C^\infty(M) functions. With this linear algebraic perspective, we recognize that a vector and its dual should combine to give a scalar. ↩︎

  2. The space of differential kk-forms on MM is denoted by Ωk(M)\Omega^k(M), which is also the space Γ(ΛkTM)\Gamma (\Lambda^k T^* M) of smooth sections of the kthk^{th} exterior power of the cotangent bundle, TMT^*M. Note that Ωk(M)\Omega^k(M) is a subspace (specifically, a submodule) of the space of all the covariant kk-tensor fields on MM (viewed as a C(M)C^\infty(M)-module). ↩︎

  3. We conflate g\mathfrak g with TeGT_e G for convenience. (the latter does not come with a Lie bracket). ↩︎

  4. Beware: In the first half of this post, gg denoted the Riemannian metric, whereas in the latter half, it represents an arbitrary element of GG↩︎