Rolling Geodesics, Mechanical Systems and Elastic Curves

Jurdjevic, Velimir

doi:10.3390/math10244827

Open AccessFeature PaperArticle

Rolling Geodesics, Mechanical Systems and Elastic Curves

by

Velimir Jurdjevic

Department of Mathematics, University of Toronto, Toronto, ON M5S 3G3, Canada

Mathematics 2022, 10(24), 4827; https://0-doi-org.brum.beds.ac.uk/10.3390/math10244827

Submission received: 20 September 2022 / Revised: 10 December 2022 / Accepted: 12 December 2022 / Published: 19 December 2022

(This article belongs to the Special Issue Variational Methods on Riemannian Manifolds: Theory and Applications)

Download Versions Notes

Abstract

:

This paper defines a large class of differentiable manifolds that house two distinct optimal problems called affine-quadratic and rolling problem. We show remarkable connections between these two problems manifested by the associated Hamiltonians obtained by the Maximum Principle of optimal control. We also show that each of these Hamiltonians is completely intergrable, in the sense of Liouville. Finally we demonstrate the significance of these results for the theory of mechanical systems.

Keywords:

Lie groups; Lie algebras; homogeneous manifolds; Hamiltonians; Poisson bracket; mechanical tops

MSC:

49J15; 53A17; 53A35; 58A05; 58A30; 70B15

1. Introduction

This paper is a continuation of my long-standing interest in the role of Lie groups and Lie algebras in the theory of integrable systems and the equations of mathematical physics. The interest in this topic originated in two seemingly unrelated phenomena, the presence of elastica in the theory of rolling spheres ([1,2]), and the presence of the heavy top in the equations describing the equilibrium configurations of an elastic rod ([3,4]). My interest in these phenomena was further renewed by the subsequent studies ([5,6,7]) that showed intriguing connections between rolling problems, elastic curves and problems in mechanics. These studies also identified a class of variational problems on Lie groups, called affine-quadratic that not only played a pivotal role in this theory, but also made a significant impact on the theory of integrable systems ([8], Chapters 9, 10 and 11).

In this paper, we will shift emphasis to a new class of rolling problems associated with homogeneous Riemannian spaces rolling isometrically on their tangent planes (based on our recent study [9,10] ). We will show that each such isometric rolling has a well defined length which then leads to natural definition for a rolling geodesic. The rolling problem then consists of finding some necessary differential conditions that the rolling geodesics must satisfy.

We will show that each rolling problem can be recast as a left-invariant optimal control problem on a Lie group, and consequently, we will be able to regard the rolling geodesics as the projections of the extremal curves generated by a suitable Hamiltonian obtained through Pontryagin’s Maximum Principle. We will show several remarkable properties of the aforementioned Hamiltonian. First we will show that any such Hamiltonian is completely integrable, and secondly, we will show that the Hamiltonian system associated with an affine-quadratic system may be regarded as an invariant subsystem of the Hamiltonian differential system associated with the rolling problem. This discovery sheds new light on the geometric origins of the affine-quadratic systems and their connections to mechanical systems ([11,12]). These findings seem particularly remarkable considering the fact that the control functions that define these optimal problems lie in mutually orthogonal spaces of each other.

The general setting of the paper in which the above-mentioned problems will be analyzed is defined by a semi-simple Lie group G and a compact subgroup K with a finite centre. Any such pair

(G, K)

is reductive in the sense that the Lie algebra

g

of G admits a splitting

g = k \oplus p

where

p

is a vector space complementary to the Lie algebra

k

of K. In this paper,

p

will be the orthogonal complement of

k

relative to the Killing form

K l (X, Y) = T r (a d X \circ a d Y)

. We recall that the Killing form is non-degenerate on G and also satisfies

K l (X, [Y, Z]) = K l ([X, Y], Z)

for any elements

X, Y, Z

in

g

. This implies that

[p, k] \subset p

. We shall make another assumption that

k

and

p

satisfy strong Cartan’s Lie algebraic conditions

[p, p] = k, [p, k] \subseteq p, [k, k] \subseteq k .

(1)

Finally, we will assume that the Killing form is of definite sign on

p

. This last condition is automatically satisfied when G is compact and is also satisfied by irreducible symmetric Riemannian pairs

(G, K)

in the theory of symmetric spaces.

Let us now recall the definition of the affine-quadratic problem in this general setting ([8]).

1.1. Affine-Quadratic Problem

Any element A in

p

generates an affine set

Γ = {A + U : U \in k}

in

g

, and this set defines a left invariant differential system

\frac{d g}{d t} = g (t) (A + U (t)), g (t) \in G,

(2)

where

U (t)

is a bounded and measurable curve in

k

. We will think of (2) as a control system with

U (t)

playing the role of control functions. We will assume that A is regular

p

, that is, that the set of elements in

p

that commute with A forms an abelian subalgebra in

g

. Our assumption

[p, p] = k

implies that

g = g_{1} \oplus g_{2} \dots \oplus g_{m}

where each factor

g_{i}

is a simple ideal of the form

g_{i} = p_{i} + [p_{i}, p_{i}]

. It then follows that the projection of a regular element A on each factor

g_{i}

in (4) is non-zero which, in turn, implies that (2) is controllable, in the sense that for any two points

g_{0}, g_{1}

in G there is a solution

g (t)

on an interval

[0, T]

that satisfies

g (0) = g_{0}

and

g (T) = g_{1}

(see [8], page 162 for a proof). Since any two Cartan subalgebras in

p

are

A d_{K}

conjugate, so are the systems defined by any two regular elements

A_{1}

and

A_{2}

.

We will now let

\frac{1}{2} \int_{0}^{T} 〈 U (t), U (t) 〉 d t

be the energy functional associated with any solution

g (t)

of (2) generated by a control

U (t)

, where

〈 A, B 〉 = - K l (A, B)

. Note that the Killing form is negative semi-definite on the Lie algebra

k

of K when K is compact, and is strictly negative when K has a finite centre (2). Therefore, our energy functional is positive for any non-zero control

U (t)

. This energy functional is called canonical relative to a more general one

\frac{1}{2} \int_{0}^{T} 〈 P (U (t)), U (t) 〉 d t

defined by any positive linear operator

P

on

k

.

The above data induce a natural optimal control problem: find the solutions

g (t)

of (2) that satisfy the given boundary conditions

g (0) = g_{0}

,

g (T) = g_{1}

for which the energy of transfer

\frac{1}{2} \int_{o}^{T} 〈 P (U (t)), U (t) 〉 d t

is minimal. The above optimal control problem will be referred to as the affine-quadratic problem (reminiscent of linear-quadratic problems in the control theory literature). In this paper we shall be interested only in the canonical case

P = I

.

As we mentioned earlier, the pair

(G, K)

is reductive. Any reductive semi-simple Lie algebra

g

also carries along a “hidden” semi-direct product

g_{0} = p ⋊ k

for the following reasons. Since

[p, k] \subseteq p

, K acts linearly on

p

by the adjoint action

h \to {Ad}_{h} |_{p}, h \in K

, and induces the semi-direct product

G_{0} = p ⋊ K

with the group operation

(A_{1}, h_{1}) (A_{2}, h_{2}) = (A_{1} + {Ad}_{h_{1}} (A_{2}), h_{1} h_{2})

. Then the Lie algebra

g_{0}

of

G_{0}

is equal to

p ⋊ k

with the Lie bracket given by

[(A_{1}, B_{1}), (A_{2}, B_{2})] = ([B_{1}, A_{2}] - [B_{2}, A_{1}], [B_{1}, B_{2}]), (A_{i}, B_{i}) \in p \times k .

We will identify elements

(A, B) \in p \times k

with the sums

A + B

under the identification

(A, B) = (A, 0) + (0, B) = A + B

, in which case the Lie brackets in

g_{0}

are identified with

[A_{1} + B_{1}, A_{2} + B_{2}] = [B_{1}, A_{2}] - [B_{2}, A_{1}] + [B_{1}, B_{2}] .

Thus,

g

as a vector space carries two Lie brackets:

{[A_{1} + B_{1}, A_{2} + B_{2}]}_{s} = [B_{1}, A_{2}] - [B_{2}, A_{1}] + s [A_{1}, A_{2}] + [B_{1}, B_{2}],

defined by a single parameter s:

s = 0

in the semi-direct case, and

s = 1

in the semi-simple case.

It follows that every affine space

Γ = {A + U : U \in k}

that defines an affine left-invariant system on G also defines a corresponding left-invariant affine system on the semi-direct product

G_{0}

. Thus, behind every affine quadratic optimal problem on G there is a corresponding affine-quadratic “shadow” problem on the semi-direct product

G_{0}

.

When K is a compact group with finite centre, then the above optimal problems are well defined in the sense that for any set of boundary points

g_{0}

and

g_{1}

there exists an optimal trajectory that satisfies

g (0) = g_{0}

and

g (T) = g_{1}

for some

T > 0

.

Remarkably, the Hamiltonian associated with the shadow problem is particularly relevant in the theory of mechanical systems (see [8], Ch. 10 for the mechanical problem of Neumann on the sphere [13], Ch. 11 for Jacobi’s problem on the ellipsoid, and Ch. 13 for the elastic problem and the pendulum). This phenomenon raises a natural question: what is the geometric origin behind the affine-quadratic problem that properly accounts for its relevance for the above mentioned problems? This question was partly addressed in the literature on integrable systems where the drift vector was associated with a linear potential V associated to an abstract “rigid body” with a Hamiltonian

H (g, L) = \frac{1}{2} 〈 P^{- 1} (L), L 〉 + V (g)

on the tangent bundle of a Lie group G ([14]) but that association raised its own questions, and at the end proved to be more enigmatic than useful.

In this paper, we will show that the Poisson systems generated by the canonical affine-quadratic problem and the rolling problem provide new and original answers to the above query: we will show that the Poisson system associated with the affine-quadratic problem is an invariant subsystem of the Poisson system generated by the rolling problem on a coadjoint orbit where the drift element A appears as a constant of motion for the rolling problem (Propositions 5 and 6).

With this goal in mind, we will now turn our attention to the quotient space

G / K

and the rolling problem.

1.2. Homogeneous Riemannian Manifolds

We will first need to introduce the Riemannian structure on the homogeneous manifold

M = G / K

defined by G and K. To begin with we will regard G as a semi-Riemannian manifold (in the sense of O-Neill [15]) with the left-invariant metric

{〈 〈 g X, g Y 〉 〉}_{g} = 〈 X, Y 〉, X, Y \in g

induced by a scalar multiple of the Killing form

〈, 〉

that is positive definite on

p

. Such a choice is possible by our assumption. On compact Lie groups G, this multiple will be a negative multiple of the Killing form and then the above metric on G coincides with the canonical bi-invariant metric. However, on non-compact Lir groups, the Killing form is indefinite and the above metric is semi-Riemannian. Here

g X

is a shorthand notation for the left-invariant vector field

X (g) = d_{e} L_{g} (X)

, where

L_{g}

is the left translation

L_{g} (h) = g h

. The same shorthand notation applies to the right-invariant vector fields with

X (g) = X g = d_{e} R_{g} (X), R_{g} h = h g .

We also recall that the Killing form is invariant under any linear automorphism of

g

and hence the quadratic form

〈, 〉

is

A d_{G}

invariant.

In order to make an easy passage to the techniques of optimal control, we will assume that all curves are absolutely continuous, and all differential equations involving such curves will be understood to be true only up to sets of measure zero without explicitly saying so. With that convention in mind any curve

g (t)

in G is a solution of

\frac{d g}{d t} = g (t) U (t)

for some bounded and measurable curve

U (t) \in g

. When

U (t)

takes values in

p

,

g (t)

is called horizontal, and when

U (t)

takes values in

k g (t)

is called vertical. Correspondingly, the left-invariant distributions

H (g) = {g X : X \in p}

and

V (g) = {g X : X \in k}

will be called horizontal and vertical, respectively. Thus, horizontal curves are tangent to

H

in the sense that

\frac{d g}{d t} \in H (g (t))

. Likewise vertical curves are tangent to

V

. It follows that

H (g) \oplus V (g) = T_{g} G, g \in G .

(3)

We shall assume that

G / K

is endowed with a manifold structure so that the natural projection

π (g) = g K

is a smooth surjection (such a structure exists ([15])). Then

G / K

with this manifold structure will be denoted by M and o will denote the point in M such that

π (e) = o

, where e is the group identity in G.

A curve

g (t)

in G is called a lift of a curve

p (t) \in M

if

π (g (t)) = p (t)

. Such a lift is said to be horizontal when

g (t)

is a horizontal curve. The projection

p (t)

of a vertical curve

g (t)

is a single point

π (g (0))

in M because any solution of

\frac{d g}{d t} = g (t) U (t), U (t) \in k

is of the form

g (t) = g (0) h (t), h (t) \in K

.

If

g (t)

is any lift of a curve

p (t)

, then

\frac{d g}{d t} = g (t) U (t) = g (t) (U_{p} (t) + U_{k} (t))

where

U_{p} (t)

and

U_{k} (t)

are the orthogonal projections of

U (t)

on

p

and

k

. Then,

d_{g (t)} π (g (t) U (t)) = d_{g (t)} π (U_{p} (t)) = \frac{d p}{d t} .

The above shows that

\tilde{g} (t)

, the solution of

\frac{d \tilde{g} (t)}{d t} = \tilde{g} (t) U_{p} (t), \tilde{g} (0) = g (0)

, is a horizontal curve that projects on

p (t)

, and secondly, it shows that

d_{g} π (g U (t)) = \frac{d p}{d t}

for any horizontal lift

g (t)

of

p (t)

. The isomorphism

H (g) \to T_{π (g)} M

can then be used to induce a metric on M

{(d_{g} π (g V), d_{g} π (g W))}_{π (g)} = {〈 〈 g V, g W 〉 〉}_{g}, V, W \in p .

(4)

Let now

{τ_{g} : g \in G}

denote the group of diffeomorphisms on M defined by the group action

π (L_{g} (h)) = τ_{g} (π (h)), h \in G, L_{g} (h) = g h .

(5)

Since G acts transitively on M, M can be represented by the orbit

{τ_{g} (o) : g \in G}

. It follows that

π (exp (t U) g) = τ_{exp (t U)} π (g)

for any

U \in g

. Note that

g \to exp (t U) g

is the flow generated by the right-invariant vector field

U_{r} (g) = U g

. The above equality shows that the flow of

U_{r}

is

π

-related to the flow

{τ_{exp (t U)}, t \in R}

in M.

In what follows,

\vec{U}

will denote the infinitesimal generator of the flow

{τ_{exp (t U)}, t \in R}

, and

F

will denote the family of vector fields

{\vec{U} : U \in g}

. The correspondence

U_{r} (g) \to \vec{U} (π (g))

is one to one and onto

T_{π (g)} M

. Since the Lie brackets of vector fields related by a mapping F are also F-related ([16]) , the Lie brackets

[U_{r}, V_{r}]

are

d π

-related to

[\vec{U}, \vec{V}]

. Therefore the correspondence

U_{r} (g) \to \vec{U} (π (g))

is a Lie algebra homomorphism, and hence

F = {\vec{U} : U \in g}

is a finite dimensional Lie algebra of vector fields that satisfies

F (p) = T_{p} M

for each

p \in M

. Elements of

F

are generally known as the vector fields generated by the group action.

Note that

π (exp (t U)) = τ_{e^{(t U)}} (o) = exp (t \vec{U}) (o)

and therefore

d_{e} π (U) = \vec{U} (o)

. Then

π (g) = τ_{g} π (e)

implies that

d_{g} (π (g U)) = d_{o} τ_{g} d_{e} π (U) = d_{o} τ_{g} \vec{U} (o) .

(6)

Furthermore,

{(\vec{U} (o), \vec{V} (o))}_{o} = {〈 〈 U_{r} (e), V_{r} (e) 〉 〉}_{e} = {〈 〈 U_{l} (e), V_{l} (e) 〉 〉}_{e} = 〈 U, V 〉 .

(7)

Hence,

\begin{matrix} {(\vec{U} (o), \vec{V} (o))}_{o} = 〈 U, V 〉 = {〈 〈 g U, g V 〉 〉}_{g} = \\ {(d_{g} π (g U), d_{g} π (g V))}_{π (g)} = {(d_{o} τ_{g} \vec{U} (o), d_{o} τ_{g} \vec{V} (o))}_{π (g)} . \end{matrix}

It follows that

(d_{g} τ_{g} (V (p), d_{g} τ_{g} {(W (p))}_{τ_{g} (p)} = {(V (p), W (p))}_{p},

(8)

for any

g \in G

and any tangent vectors

V (p)

and

W (p)

in

T_{p} M

.

Therefore,

{τ_{g} : g \in G}

acts on M by isometries, and consequently each vector field

\vec{U}

in

F

is a Killing vector field. Recall that the isometry group of M is a subgroup of Diff

(M)

that leaves the metric invariant, also recall that a vector field is a Killing vector field if its flow acts on M by isometries (the flow of

\vec{U}

is given by

τ_{exp (t U)}, t \in R

). See [15] for additional details.

A homogeneous manifold

M = G / K

defined by the above data will be referred to as semi-simple (it is defined by a semi-simple Lie group G, a compact subgroup K, and the metric induced by the Killing form). It can be shown that any symmetric Riemannian space with no Euclidean factors can be reduced to a semi-simple manifold (so that

[p, p] = k

holds). Conversely, if G is simply connected then every semi-simple manifold is symmetric (see [17], Proposition 6.27). In any event, the present exposition makes no use of geodesic symmetry so there is no need to get distracted with the theory of symmetric spaces.

On semi-simple manifolds, parallel transport and covariant derivative are given by nice formulas inherited from G. To elaborate, note that any semi-simple Lie group G with its left-invariant metric a scalar multiple of the Killing form is a semi-Riemannian group in the terminology of O’Neill ([15], p. 305) because the Killing form is

A d_{G}

invariant (it is only in the compact case that this semi-metric is Riemannian, i.e., equal to the canonical bi-invariant metric on G).

Relative to this left-invariant semi-metric,

\nabla_{X} Y = \frac{1}{2} [X, Y]

, X and Y left-invariant, is the (unique) bi-invariant affine connection that preserves the inner product and is torsion free ([15]). The associated covariant derivative

\frac{D_{g (t)}}{d t} V (t)

of a vector field

g (t) V (t)

defined along a curve

g (t)

in G is given by

\frac{D_{g (t)}}{d t} V (t) = g (t) (\frac{d V}{d t} (t) + \frac{1}{2} [V (t), U (t)]), g^{- 1} (t) \frac{d g}{d t} (t) = U (t) .

(9)

Since the metric on M is the pull-back of the metric on G, the covariant derivative and parallel transport in M can be described in terms of the lifted objects in

g

via the following formulas ([9]): any curve of tangent vectors

v (t)

along a curve

p (t)

in M can be represented by

v (t) = d_{g (t)} π (g (t) V (t))

in terms of a unique curve

V (t) \in p

, where

g (t)

denotes a horizontal curve in G that projects onto

p (t)

. It follows that

\frac{d g}{d t} = g (t) U (t), U (t) \in p

and

d_{g (t)} π (g (t) U (t)) = \frac{d p}{d t}

. Then the covariant derivative

\frac{D_{p (t)}}{d t} v (t)

of

v (t)

along

p (t)

is given by

\frac{D_{p (t)}}{d t} v (t) = d_{g (t)} π (g (t) (\frac{d V}{d t} + \frac{1}{2} {[U (t), V (t)]}_{p})) = d_{g (t)} π (g (t) \frac{d V}{d t}) .

(10)

where

{[U (t), V (t)]}_{p}

denotes the orthogonal projection of

[U, V]

on

p

(because of our assumption

[p, p] \subseteq k

, the orthogonal projection of

[U, V]

on

p

is zero). Hence,

v (t)

is parallel along

p (t)

whenever

v (t)

is the projection of a curve

g (t) V (t)

with

V (t)

a constant in

p

. With this background at our disposal we will now come to the rolling problem.

1.3. The Rolling Problem

The most direct route to the rolling problem is via the intrinsic definition of rolling, introduced by R. Bryant and L. Hsu in ( [18]), and later used by A. Agrachev in ( [19]), Y. Chitour in ( [20,21]) and Godoy Molina in ([22]). According to this definition a curve

α (t)

on a Riemannian manifold M rolls on a curve

\hat{α} (t)

on another Riemannian manifold

\hat{M}

if there exists an isometry

A (t) : T_{α (t)} M \to T_{\hat{α} (t)} \hat{M}

that satisfies:

\frac{d \hat{α}}{d t} = A (t) \frac{d α}{d t},

(11)

and also satisfies the condition that

A (t) v (t)

is a parallel vector field in

\hat{M}

along

\hat{α} (t)

for each parallel vector field

v (t)

along

α (t)

in M. The triple

(α (t), \hat{α} (t), A (t))

is called a rolling curve. It is clear that rolling is reflexive in the sense that if

α (t)

is rolled on

\hat{α} (t)

by an isometry

A (t)

then

\hat{α} (t)

is rolled on

α (t)

by the isometry

A^{- 1} (t)

, and therefore

(\hat{α} (t), α (t), A^{- 1} (t))

is also a rolling curve. We will take

\hat{M} = T_{o} M

which we regard as a Euclidean space with its metric

{(u, v)}_{o}

defined by (4) and we address the rolling of curves in M on curves in

\hat{M}

. Recall that in any semi-Euclidean vector space parallel transport

v (t)

along a curve

\hat{α} (t)

in

\hat{M}

is done only by constant vector fields (translations).

Any curve

α (t)

in M is the projection of a horizontal curve

g (t)

, that is,

\frac{d g}{d t} = g (t) U (t), U (t) \in p

and

α (t) = π (g (t)) = τ_{g (t)} (o)

. Then,

\frac{d α (t)}{d t} = d_{g (t)} π (g (t) U (t)) = d_{o} τ_{g (t)} \vec{U} (t) (o),

(12)

where

\vec{U} (t)

denotes the curve of Killing vector fields in

T_{o} M

defined by

U (t)

. If we now let

\hat{α} (t)

be any solution in

\hat{M}

of

\frac{d \hat{α} (t)}{d t} = \vec{U} (t) (o)

and let

A (t) = d_{o} τ_{g (t)}

then

A (t)

is an isometry that rolls

\hat{α} (t)

on

α (t)

since the parallel transport condition is fulfilled (by Equation (10)). Of course, then

A^{- 1} (t)

rolls

α (t)

on

\hat{α} (t)

.

It follows that each horizontal curve

g (t)

in G defines a family of curves

\hat{α} (t)

in

\hat{M}

, each a solution of

\frac{d \hat{α}}{d t} = \vec{U} (t) (o)

associated with

U (t) = g^{- 1} (t) \frac{d g}{d t}

, that roll on

α (t) = π (g (t))

. Conversely, every solution

(g (t), \hat{α} (t))

of the differential system

\frac{d g}{d t} = g (t) U (t), \frac{d \hat{α} (t)}{d t} = \vec{U} (t) (o), U (t) \in p

(13)

defines a curve

α (t) = π (g (t))

in M on which

\hat{α} (t)

in

\hat{M}

is rolled by the isometry

d_{o} τ_{g (t)}

.

The rolling problem will be defined on the configuration space

G = G \times \hat{M}, \hat{M} = T_{o} M

, which will be regarded as a Lie group with the group operation

g h = (g, p) (h, q) = (g h, p + q)

, for all

g = (g, p)

and

h = (h, q)

in

G

. Then the Lie algebra

G

of

G

will be naturally identified with

g \times T_{o} M

with the Lie bracket

[(X, \vec{U} (o)), (Y, \vec{V} (o))] = ([X, Y], 0)

.

Let now

H (g, p)

denote the left invariant distribution defined by

Γ = {(U, \vec{U} (o)) : U \in p}

that is,

H (g, p) = {(g U, \vec{U} (o)) : U \in p}, (g, p) \in G .

(14)

The distribution

H

will be referred to as the rolling distribution and its integral curves will be called rolling motions. Any rolling motion

g (t) = (g (t), p (t))

is a solution of

\frac{d g}{d t} = g (t) U (t), \frac{d p}{d t} = \vec{U} (t) (o),

(15)

and can be associated with the rolling curve

(\hat{α} (t), α (t)), d_{o} τ_{g (t)})

, where

α (t) = τ_{g (t)} (o)

. The reader may want to show that this intrinsic definition of rolling agrees with the extrinsic descriptions [23] based on the formalism in [24].

Since

Γ

is a vector subspace in

G

that satisfies

Γ + [Γ, Γ] + [Γ, [Γ, Γ]] = G,

(16)

the Lie algebra generated by the left-invariant vector fields tangent to

H

is equal to

G

, and therefore, any two points in

G

can be connected by a rolling motion, and each rolling motion inherits a natural length

\int_{0}^{T} \sqrt{〈 U (t), U (t) 〉} d t

from G. To put the matter in a control theoretic context, let

A_{1}, \dots, A_{m}

be an orthonormal basis in

p

so that

(A_{i}, {\vec{A}}_{i} (o))

is an orthonormal basis in

Γ

. Then an absolutely continuous curve

g (t) = (g (t), p (t))

is a rolling motion if and only if

\frac{d g}{d t} = g (t) (\sum_{i = 1}^{m} u_{i} (t) A_{i}), \frac{d p}{d t} = \sum_{i = 1}^{m} u_{i} (t) {\vec{A}}_{i} (o),

(17)

for some bounded and measurable control functions

u_{1} (t), \dots, u_{m} (t)

, in which case the length of

g (t)

is given by

\int_{0}^{T} \sqrt{u_{1}^{2} (t) + \dots + u_{m}^{2} (t)} d t

. It then follows from (16) that the Lie algebra generated by the left-invariant vector fields

X_{i} (g, p) = (g A_{i}, {\vec{A}}_{i} (o)), i = 1, \dots, m

is of full rank in

G

. Since each left-invariant vector field

X_{i}

is complete, any pair of points in

G

can be connected by an integral curve of

H

of minimal length ([19]). An integral curve

g (t)

of

H

is called a rolling geodesic if for any

t_{0}

and

t_{1}

, sufficiently close to each other, the length of

g (t)

in the interval

[t_{0}, t_{1}]

is minimal among all other integral curves of

H

that connect

g (t_{0})

to

g (t_{1})

.

The rolling problem consists of characterizing the rolling geodesics in

G

induced by

H

. Since each rolling geodesic is also a sub-Riemannian geodesic on the configuration space

G

relative to the above length, the rolling problem can be equivalently phrased as a sub-Riemannian problem in

G

where one looks for the solutions

g (t) = (g (t), p (t))

on a fixed time interval

[0, T]

that satisfy the given boundary conditions

g (0) = g_{0}

and

g (T) = g_{1}

along which the energy of transfer

\frac{1}{2} \int_{0}^{T} \sum_{i = 1}^{m} u_{i}^{2} (t) d t

is minimal.

Return now briefly to the affine-quadratic problem introduced earlier with its dynamics

\frac{d g}{d t} = g (t) (A + U (t)), U (t) \in k

(18)

and the energy (sometimes called the cost in the literature on optimal control)

E = \frac{1}{2} \int_{0}^{T} 〈 P (U (t)), U (t) 〉 d t

, induced by a positive definite operator

P

relative to the scalar product

〈, 〉

. Since

P

can be diagonalized by an orthonormal basis

B_{1}, \dots, B_{k}

in

k

, the affine-quadratic problem can be restated as an optimal problem over the system

\frac{d g}{d t} = g (t) (A + \sum_{i = 1}^{k} u_{i} (t) B_{i}) = X_{0} (g) + \sum_{i = 1}^{k} u_{i} (t) X_{i} (g),

(19)

with

X_{0} (g) = g A, X_{i} (g) = g B_{i}, i = 1, \dots k

, and

E = \frac{1}{2} \int_{0}^{T} \sum_{i = 1}^{k} λ_{i} u_{i}^{2} (t) d t

the energy of transfer (

λ_{1}, \dots, λ_{n}

are the eigenvalues of

P

). In the canonical case

E = \frac{1}{2} \int_{0}^{T} \sum_{i = 1}^{k} u_{i}^{2} (t) d t

.

Let us now single out some examples that are relevant for the results that follow.

1.4. Some Notable Examples

1.

G = S L (n), K = S O (n) .

In this situation we will assume that the Lie algebra

g = s l (n)

, that consists of

n \times n

matrices having zero trace, is endowed with the scalar product

〈 X, Y 〉 = \frac{1}{2} T r (X Y)

. Then

k = s o (n)

is the Lie algebra of K, and

p = s y m_{0} (n)

is the space of symmetric matrices in

g

. It is easy to verify that

〈, 〉

is positive on

p

and negative on

k = s o (n)

. Therefore G with its left-invariant metric induced by

〈, 〉

is a semi-Riemannian manifold.

Then the quotient space

M = G / K

will be identified with

P^{n}

, the space of positive-definite matrices of determinant one, through the action

τ_{g} (P) = g P g^{T}, g \in S L (n), P \in P^{n}

, where

g^{T}

is the matrix transpose of g. Since any positive definite matrix P with

D e t (P) = 1

can be written as

P = S S^{T}

for some

S \in S L (n)

the action is transitive, and

P^{n}

can be identified with the orbit through the identity I. Since the identity matrix I is both an element of

P^{n}

and the group identity in G, it is equal to the point o (

π (e) = o

). Horizontal curves are the solutions of

\frac{d g}{d t} = g (t) U (t), U (t) \in s y m_{0} (n)

. Any curve

α (t)

in

P^{n}

is the projection of a horizontal curve

g (t)

and the length of

α (t)

is given by

\int_{0}^{T} \sqrt{〈 U (t), U (t) 〉} d t

. Killing vector fields are given by

\vec{U} (P) = U P + P U^{T}

,

U \in s l (n)

and

P \in M

. The rolling distribution is given by

\frac{d g}{d t} = g (t) U (t), U (t) \in s y m_{0} (n), \frac{d p}{d t} = \vec{U} (t) (o) = 2 U (t)

(20)

The case

n = 2

is somewhat special, for then

P^{2}

is isometrically diffeomorphic to the Poincaré upper half plane

P = {z + i y : y > 0}

with its metric

\frac{1}{y} \sqrt{{\dot{x}}^{2} + {\dot{y}}^{2}}

. To elaborate, note that every

g \in S L (2)

can be written as

g = P R

where P is upper triangular and R is a rotation matrix. In fact, if

g = (\begin{matrix} a & b \\ c & d \end{matrix})

is an element of

S L (2)

then

(\begin{matrix} a & b \\ c & d \end{matrix}) = \frac{1}{\sqrt{c^{2} + d^{2}}} (\begin{matrix} 1 & a c + b d \\ 0 & c^{2} + d^{2} \end{matrix}) \frac{1}{\sqrt{c^{2} + d^{2}}} (\begin{matrix} d & - c \\ c & d \end{matrix}) .

(21)

Let now

F (x + i y) = g g^{T} = P P^{T} = \frac{1}{y} (\begin{matrix} x^{2} + y^{2} & x \\ x & 1 \end{matrix}),

(22)

where

P = (\begin{matrix} \frac{y}{\sqrt{y}} & \frac{x}{\sqrt{y}} \\ 0 & \frac{1}{\sqrt{y}} \end{matrix}) .

We will now show that F is an isometry from

P

with its Poincaré hyperbolic metric onto

P^{2}

with its G-invariant metric. If

\tilde{α} (t) = F (α (t)

then

\begin{matrix} \dot{\tilde{α}} (t) = \dot{P} P^{T} + P {\dot{P}}^{T} = P (P^{- 1} \dot{P} + {\dot{P}}^{T} {(P^{- 1})}^{T}) P^{T}, \end{matrix}

and therefore,

| | \dot{\tilde{α}} (t) | | = | | P^{- 1} \dot{P} + {\dot{P}}^{T} {(P^{- 1})}^{T} | |

. If

Y = \frac{y}{\sqrt{y}}

and

X = \frac{x}{\sqrt{y}}

, then an easy calculation shows that

P^{- 1} \dot{P} + {\dot{P}}^{T} {(P^{T})}^{- 1} = (\begin{matrix} 2 \frac{\dot{Y}}{Y} & \frac{X \dot{Y} + \dot{X} Y}{Y^{2}} \\ \frac{X \dot{Y} + \dot{X} Y}{Y^{2}} & - 2 \frac{\dot{Y}}{Y} \end{matrix}) = \frac{1}{y} (\begin{matrix} \dot{y} & \dot{x} \\ \dot{x} & - \dot{y} \end{matrix}),

and hence

| | \dot{\tilde{α}} (t) | | = \frac{1}{y} \sqrt{{\dot{x}}^{2} + {\dot{y}}^{2}}

. It follows that

| | \dot{α} (t) | | = | | \dot{\tilde{α}} (t) | |

and therefore F is an isometry.

It then follows that the rolling distribution has its isometric analogue on

P

rolling on the tangent space at i. In this scenario

S L (2)

acts on

P

via the Moebius transformations

τ_{g} (z) = \frac{a z + b}{c z + d}, g = (\begin{matrix} a & b \\ c & d \end{matrix})

, and

P

is represented by the orbit

{τ_{g} (i) : g \in S L (2)}

. Horizontal curves are the solutions of

\frac{d g}{d t} = g (t) (\begin{matrix} u_{1} (t) & u_{2} (t) \\ u_{2} (t) & - u_{1} (t) \end{matrix})

and their projections on

P

are given by

z (t) = g (t) (i)

. Then

\frac{d z (t)}{d t} |_{t = 0} = \frac{d}{d t} g (t) (i) {|_{t = 0} = \frac{d}{d t} \frac{1}{c^{2} + d^{2}} (b d + a c + i) |}_{t = 0} = 2 i (u_{1} - i u_{2}) .

Therefore, rolling motions are the solutions of

\frac{d g}{d t} = g (t) (\begin{matrix} u_{1} (t) & u_{2} (t) \\ u_{2} (t) & - u_{1} (t) \end{matrix}), \frac{d w}{d t} = 2 i (u_{1} (t) - i u_{2} (t)) .

(23)

2.

G = S O_{ϵ} (n + 1), K = {1} \times S O (n), ϵ = \pm 1

, where

S O_{ϵ} (1, n)

denotes the connected component of

S O (1, n)

that contains the group identity when

ϵ = - 1

, and

S O_{ϵ} (n + 1) = S O (n + 1)

, when

ϵ = 1

. Both cases can be treated in a uniform manner as follows.

Let

V_{ϵ}

denote

R^{n + 1}

with the scalar product (x,y)_ϵ = x₀y₀ + ϵ

\sum_{i = 1}^{n}

x_iy_i. Each SO_ϵ(n + 1) acts on

V_{ϵ}

by matrix multiplications, and each group is defined as the matrix group whose elements have a positive determinant and preserve the bilinear form

{(,)}_{ϵ}

. It follows that each g ∈ SO_ϵ(n + 1) satisfies g^TDg = D where D is a diagonal matrix with its diagonal entries equal to (1,ϵ,...,ϵ). Therefore, Det(g^T)Det(g)Det(D) = Det(D) which implies that Det(g) = 1. This shows that each of SO_ϵ(n + 1) is a subgroup of SL(n + 1).

We will let

S_{ϵ}^{n}

denote the Euclidean unit sphere when ϵ = 1 and the hyperboloid {x ∈ Rⁿ⁺¹:

x_{0}^{2} = 1 + \sum_{i = 1}^{n} x_{i}^{2}

, x₀ > 0} when ϵ = −1. In each case, SO_ϵ(n + 1) acts on

S_{ϵ}^{n}

by the left matrix multiplications on the points of

S_{ϵ}^{n}

written as column vectors. It can be shown that this action is transitive. When

S_{ϵ}^{n}

is represented by the orbit through

e_{0}

then the isotropy group

K = {g \in S O_{ϵ} (n + 1) : g e_{0} = e_{0}}

is equal to

{1} \times S O (n)

. Therefore,

S_{ϵ}^{n} = S O_{ϵ} (n + 1) / K .

with the natural projection

π

given by

π (g) = g e_{0} = τ_{g} (e_{0})

.

We will regard

G = S O_{ϵ} (n + 1)

as a semi-Riemannian subgroup of

S L (n + 1)

with its left-invariant metric introduced through the bilinear form

{〈 X, Y 〉}_{ϵ} = - \frac{ϵ}{2} T r (X Y)

(this metric is indefinite on

g_{ϵ}

when ϵ = −1 and is positive when ϵ = 1).

The following notations will be useful in describing the Cartan factors

k_{ϵ}

and

p_{ϵ}

. If a and b are any points in Rⁿ⁺¹ then a ⊗_ϵ b will denote the matrix defined by (a ⊗_ϵ b)x = (a,x)_ϵb, x ∈ ℝⁿ⁺¹, and then a ∧_ϵ b will denote the matrix a ⊗_ϵ b − b ⊗_ϵ a. Since ((a ∧_ϵ b)x,y)_ϵ + (x,(a ∧_ϵ b)y)_ϵ = 0, a ∧_ϵ b belongs to 𝔰𝔬_ϵ(^{n + 1}) for any a, b in ℝⁿ⁺¹.

It is easy to show that the Lie algebra

k

of K and its orthogonal complement

p_{ϵ}

are given by the following expressions:

p_{ϵ} = {U = u \land_{ϵ} e_{0} : {(u, e_{0})}_{ϵ} = 0},

(24)

k = {V = v \land_{ϵ} w : {(v, e_{0})}_{ϵ} = {(w, e_{0})}_{ϵ} = 0},

(25)

The preceding matrices can be also written as

U = (\begin{matrix} 0 & - {ϵ u}^{*} \\ u & 0 \end{matrix}), V = (\begin{matrix} 0 & 0 \\ 0 & v \land w \end{matrix}), u, v, w in R^{n} .

Horizontal curves are the solutions of

\frac{d g}{d t} = g (t) U (t), U (t) = u (t) \land_{ϵ} e_{o}, u (t) ⊥ e_{0},

that satisfy

{〈 〈 g (t) U (t), g (t) U (t) 〉 〉}_{ϵ} = {〈 U (t), U (t) 〉}_{ϵ} = \sum_{i = 1}^{n} u_{i}^{2} (t) .

Then

| | \dot{α} {(t) | |}_{ϵ}^{2} = ϵ {(\dot{α}, \dot{α})}_{ϵ} = ϵ {\dot{α}}_{0}^{2} + \sum_{i = 1}^{n} {\dot{α}}_{i}^{2}

is the natural metric on

S_{ϵ}^{n}

. We then have

\begin{matrix} | | \dot{α} {(t) | |}_{ϵ}^{2} = ϵ {(\dot{α} (t), \dot{α} (t))}_{ϵ} = ϵ {(g (t) u (t), g (t) u (t))}_{ϵ} = \sum_{i = 1}^{n} u_{i}^{2} (t) = {〈 U (t), U (t) 〉}_{ϵ}, \end{matrix}

hence the metric is

{S O}_{ϵ} (n + 1)

invariant, and

S_{ϵ}^{n}

with this metric is a semi-simple homogeneous manifold. It follows that the rolling distribution is given by

\frac{d g}{d t} = g (t) u (t) \land_{ϵ} e_{0}), \frac{d p}{d t} = (u (t) \land_{ϵ} e_{o}) e_{0} = u (t),

(26)

which agrees with 2.4 in ([6]).

2. Symplectic Background, Hamiltonian Systems

Let us now turn our attention to the extremal curves associated with our main problems. Because of the constraints present in these problems, the Maximum Principle of optimal control, rooted in the Hamiltonian formalism, is the only tool available for arriving to the appropriate extremal equations. However, in order to make an effective use of the Maximum Principle we will need to work with the symplectic form in a special system of coordinates that is well adapted for left-invariant optimal control problems (described in [3,8]) which calls for a brief review of symplectic geometry. Below is a brief summary of the symplectic material required for the main results.

Recall that a manifold M endowed with a non-degenerate and closed 2-form

ω

is called symplectic. The symplectic form induces a correspondence between functions and vector fields: every function f corresponds to a vector field

\vec{f}

defined by

ω (\vec{f}, X) = d f (X)

. In this context,

\vec{f}

is called the Hamiltonian vector field generated by f. Every symplectic manifold is even dimensional, and at each point of M there exists a neighbourhood with coordinates

(x_{1}, \dots, x_{n}, p_{1}, \dots, p_{n})

such that the Hamiltonian vector fields are given by

\vec{f} = \sum_{i = 1}^{n} \frac{\partial f}{\partial p_{i}} \frac{\partial}{\partial x_{i}} - \frac{\partial f}{\partial x_{i}} \frac{\partial}{\partial p_{i}} .

(25)

This choice of coordinates in which

\vec{f}

is represented by (27) is called symplectic.

Any cotangent bundle

T^{*} M

is a symplectic manifold endowed with its canonical symplectic form, usually written as

ω = d p \land d x

relative to a choice of symplectic coordinates

\sum_{i = 1}^{n} p_{i} d x_{i}

. As a symplectic manifold the cotangent bundle is somewhat special, it is a vector bundle at the same time. For that reason every vector field X on M can be lifted to a unique Hamiltonian vector field

{\vec{f}}_{X}

in

T^{*} M

via the function

f_{X} (ξ) = ξ (X (x))

,

ξ \in T_{x}^{*} M

. Vector field

{\vec{f}}_{X}

is called the Hamiltonian lift of X. The same procedure is applicable to any time varying vector field, and by extension to any differential system on M. Thus, any differential system in M can be lifted to a Hamiltonian system in

T^{*} M

. Then the Maximum Principle singles out the appropriate Hamiltonian lifts that govern the optimal solutions ([8]).

When the base manifold is a Lie group G, and when the underlying differential system is either left or right invariant, then there is privileged system of coordinates based on the realization of

T^{*} G

as

G \times g^{*}

, with

g^{*}

the dual of

g

, that preserves the left (or right) invariant symmetries and elucidates the conservation laws of the associated Hamiltonian system. The passage to these coordinates is explained below.

2.1. Left-Invariant Trivializations and the Symplectic Form

Having in mind applications involving left-invariant variational systems, the cotangent bundle

T^{*} G

and the tangent bundle

T G

will be represented as

G \times g^{*}

and

G \times g

via the left-translations. That is, tangent vectors

v \in T_{g} G

will be identified with the pairs

(g, V) \in G \times g

via the relation

v = d L_{g} V

. Similarly, linear functions

ξ \in T_{g}^{*} G

will be identified with pairs

(g, ℓ) \in G \times g^{*}

via

ξ = {d L_{g}^{- 1}}^{*} ℓ

, i.e.,

ξ (v) = ξ (d L_{g} V) = ℓ (V)

. Then

T (T^{*} G)

is naturally identified with the product

(G \times g) \times (g^{*} \times g^{*}) ≅ (G \times g^{*}) \times (g \times g^{*})

, with the understanding that an element

((g, ℓ), (A, a)) \in (G \times g^{*}) \times (g \times g^{*})

denotes the tangent vector

(A, a)

at the base point

(g, ℓ)

.

Note that

G \times g^{*}

is a Lie group in its own right since

g^{*}

is an abelian Lie group with the group multiplication given by the vector addition. Then left-invariant vector fields in

G \times g^{*}

are the left-translates of the pairs

(A, a)

in the Lie algebra

g \times g^{*}

of

G \times g^{*}

. In this formalism the flow associated with the left-invariant vector field

(g A, a)

in

G \times g^{*}

is given by

(g exp (t A), ℓ + t a)

. In terms of left-invariant vector fields

V_{1} = (A_{1}, a_{1})

and

V_{2} = (A_{2}, a_{2})

, the canonical symplectic form on

T^{*} G

is given by the following formula:

ω_{(g, ℓ)} (V_{1}, V_{2}) = a_{2} (A_{1}) - a_{1} (A_{2}) - ℓ ([A_{1}, A_{2}])

(28)

The above differential form is invariant under left-translations in

G \times g^{*}

, and is particularly revealing for the Hamiltonian vector fields generated by left-invariant functions on

G \times g^{*}

, that is, functions that satisfy

H (h g, ℓ) = H (g, ℓ) = H (e, ℓ)

for all

g, h \in G

and all

ℓ \in g^{*}

. Evidently, left-invariant functions on

G \times g^{*}

are in exact correspondence with functions in

C^{\infty} (g^{*})

.

Each left-invariant vector field

X (g) = d L_{g} X

,

X \in g

, lifts to a linear function

h_{X}

on

g^{*}

because

h_{X} (ξ) = ξ (X (g)) = ξ \circ (d L_{g}) (X) = ℓ (X), ξ \in T_{g}^{*} G

and each function H on

g^{*}

generates a Hamiltonian vector field

\vec{H}

on

G \times g^{*}

whose integral curves are the solutions of

\frac{d g}{d t} (t) = g (t) d H_{ℓ (t)}, \frac{d ℓ}{d t} (t) = - {ad}^{*} d H_{ℓ (t)} (ℓ (t)) .

(29)

Equation (29) can be easily verified by the following argument: when H is a function on

g^{*}

, then its differential at a point ℓ is a linear function on

g^{*}

, hence an element of

g

, because

g^{*}

is a finite dimensional vector space. If

{\vec{H}}_{(g, ℓ)} = (A (g, ℓ), a (g, ℓ))

for some vectors

A (g, ℓ) \in g

and

a (g, ℓ) \in g^{*}

, then

b (d H_{ℓ}) = b (A) - a (B) - ℓ [A, B],

must hold for any tangent vector

(B, b)

at

(g, ℓ)

. This implies that

A (g, ℓ) = d H_{ℓ}

, and

a = - {ad}^{*} d H_{ℓ} (ℓ)

, where

({ad}^{*} A) (B) (ℓ) = ℓ [A, B]

for all

B \in g

. Hence, (29) holds.

In a more general case where H is a function of both g and ℓ, the equations for

\vec{H}

are given by

\frac{d g}{d t} (t) = g (t) d H_{ℓ (t)}, \frac{d ℓ}{d t} (t) = - {ad}^{*} d H_{ℓ (t)} (ℓ (t)) - d H_{g} \circ d L_{g},

(30)

as can be easily verified through the relations

b (d H_{ℓ}) + d H_{g} \circ d L_{g} B = b (A) - a (B) - ℓ [A, B] .

This situation typically occurs in problems of mechanics in the presence of potential functions. For instance, the motion of a three-dimensional rigid body with a potential function

V : S O (3) \to R

is described by the Hamiltonian

H (R, ℓ) = H_{0} (ℓ) + V (α_{1}, α_{2}, α_{3})

where

α_{1}, α_{2}, α_{3}

denote the columns of the matrix transpose

R^{T}

of the rotation R in

S O (3)

. If

R (t) = R e^{t X}

is a curve in

S O (3)

defined by an element

X \in so (3)

, then

α_{i} (t) = R {(t)}^{T} e_{i} = e^{- t X} R^{T} e_{i} = e^{- t X} α_{i}

. Therefore,

d V (R X) = \sum_{i = 1}^{3} (\frac{\partial V}{\partial α_{i}}, \frac{d α_{i}}{d t}) |_{t = 0} = \sum_{i = 1}^{3} (\frac{\partial V}{\partial α_{i}}, - X α_{i}) = \sum_{i = 1}^{3} 〈 \frac{\partial V}{\partial α_{i}} \land α_{i}, X 〉

where

〈, 〉

is the standard inner product

- \frac{1}{2} T r (X Y)

in

so (3)

. Thus,

d H_{g} \circ d L_{g} = \sum_{i = 1}^{3} \frac{\partial V}{\partial α_{i}} \land α_{i}

is the external torque exerted by V. The corresponding equations of motion are given by

\frac{d g}{d t} (t) = g (t) d H_{0} (ℓ (t)), \frac{d ℓ}{d t} (t) = - {ad}^{*} d H_{0} (ℓ (t)) (ℓ (t)) + \sum_{i = 1}^{3} α_{i} \land \frac{\partial V}{\partial α_{i}} .

(31)

These equations extend to an “n-dimensional rigid body”

H (R, ℓ) = H_{0} (ℓ) + V (α_{1}, \dots, α_{n})

) with the external torque

\sum_{i = 1}^{n} α_{i} \land \frac{\partial V}{\partial α_{i}}

. This system of equations is usually written on the tangent bundle of

S O (n)

, represented as the product

S O (n) \times s o (n)

, as

\begin{matrix} \frac{d R}{d t} = R (t) Ω (t), \frac{d M}{d t} = [Ω (t), M (t)] + \sum_{i = 1}^{n} α_{i} \land \frac{\partial V}{\partial α_{i}} \\ P (Ω (t)) = M (t), α_{i} (t) = R^{T} (t) e_{i}, i = 1, \dots, n . \end{matrix}

(32)

In this context,

M (t)

is the generalization of the angular momentum,

Ω (t)

is the generalization of the angular velocity, and

P

is the generalization of the inertia tensor.

2.2. Poisson Manifolds, Coadjoint Orbits

We will now address the Poisson structure on

g^{*}

inherited from the symplectic form

ω

given by (28). Recall that a manifold M together with a bilinear, skew-symmetric form

{,} : C^{\infty} (M) \times C^{\infty} (M) \to C^{\infty} (M)

that satisfies

\begin{matrix} {f g, h} = f {g, h} + g {f, h}, (Leibniz’s rule), and \\ {f, {g, h}} + {h, {f, g}} + {g, {h, f}} = 0, (Jacobi’s identity), \end{matrix}

for all functions

f, g, h

on M, is called a Poisson manifold.

Every symplectic manifold is a Poisson manifold with the Poisson bracket defined by

{f, g} (p) = ω_{p} (\vec{f} (p), \vec{g} (p)), p \in M

. However, a Poisson manifold need not be symplectic, because it may happen that the Poisson bracket is degenerate at some points of M. Nevertheless, each function f on M induces a Poisson vector field

\vec{f}

through the formula

\vec{f} (g) = {f, g}

. It is known that every Poisson manifold is foliated by the orbits of its family of Poisson vector fields, and that each orbit is a symplectic submanifold of M with its symplectic form

ω_{p} (\vec{f}, \vec{h}) = {f, h} (p)

. (This foliation is known as a the symplectic foliation of M).

Proposition 1.

The dual

g^{*}

of a Lie algebra

g

is a Poisson manifold with the Poisson bracket

{f, h} (ℓ) = ℓ ([d h, d f], f, h i n C^{\infty} (g^{*}) .

Proof.

Functions on

g^{*}

coincide with the left-invariant functions on

G \times g^{*}

. Hence,

ω_{(g, ℓ)} (\vec{f}, \vec{h}) = ω_{(g, ℓ)} ((d f, 0), (d h, 0)) = - {ad}^{*} ([d f, d h]) (ℓ) = ℓ ([d h, d f]) .

It follows that the Poisson bracket on

g^{*}

is the restriction of the canonical Poisson bracket on

G \times g^{*}

to the left-invariant functions. As such it automatically satisfies the properties of a Poisson manifold. □

In the literature on integrable systems, Poisson bracket

{f, h} (ℓ) = ℓ ([d f, d h])

is often referred as the Lie-Poisson bracket ([14]). We have taken its negative so that Poisson vector fields agree with the projections of the Hamiltonian vector fields generated by left-invariant functions (and also agree with the sign convention in [7,8]).

It follows that each function H on

g^{*}

defines a Poisson vector field

\vec{H}

on

g^{*}

through the formula

\vec{H} (f) (ℓ) = {H, f} (ℓ) = - ℓ ([d H_{ℓ}, d f]) = - a d^{*} d H_{l} (d f)

. The integral curves of

\vec{H}

are the solutions of

\frac{d ℓ}{d t} (t) = - {ad}^{*} d H_{ℓ (t)} (ℓ (t))

(33)

That is, each function H on

g^{*}

may be considered both as a Hamiltonian on

T^{*} G

, as well as a function on the Poisson space

g^{*}

. It follows that the Poisson equations of the associated Poisson field are the projections of the Hamiltonian Equation (29) on

g^{*}

.

Solutions of Equation (33) are intimately linked with the coadjoint orbits of G. We recall that the coadjoint orbit of G through a point

ℓ \in g^{*}

is given by

{Ad}_{g}^{*} (ℓ) = {ℓ \circ {Ad}_{g^{- 1}}, g \in G} .

The following proposition is a paraphrase of A.A. Kirillov’ fundamental contributions to the Poisson structure of

g^{*}

([25]).

Proposition 2.

Let

F

denote the family of Poisson vector fields on

g^{*}

and let

M = O_{F} (ℓ_{0})

denote the orbit of

F

through a point

ℓ_{0} \in g^{*}

. Then M is equal to the connected component of the coadjoint orbit of G that contains

ℓ_{0}

. Consequently each coadjoint orbit is a symplectic submanifold of

g^{*}

.

The fact that the Poisson equations evolve on coadjoint orbits implies useful reductions in the theory of Hamiltonian systems with symmetries. Our main results will make use of this fact.

2.3. Representation of Coadjoint Orbits on Lie Algebras- Semi-Simple vs. Semi-Direct

On semi-simple Lie groups, the Killing form, or any scalar multiple of it

〈, 〉

, is non-degenerate, and can be used to identify linear functions ℓ on

g

with points

L \in g

via the formula

〈 L, X 〉 = ℓ (X)

,

X \in g

. Then Poisson Equation (33) can be expressed dually on

g

as

\frac{d L}{d t} = [d H, L] .

(34)

The argument is simple:

〈 \frac{d L}{d t}, X 〉 = \frac{d ℓ}{d t} (X) = - ℓ ([d H, X]) = 〈 L, [X, d H] 〉 = 〈 [d H, L], X 〉 .

Since X is arbitrary, Equation (34) follows.

Under the above identification coadjoint orbits are identified with the adjoint orbits

O (L_{0}) = {g L_{0} g^{- 1} : g \in G}

, and the Poisson vector fields

{\vec{f}}_{X} (ℓ) = - {ad}^{*} X (ℓ)

are identified with vector fields

\vec{X} (L) = [X, L]

. Each vector field

[X, L]

is tangent to

O (L_{0})

at L, and

ω_{L} ([X, L], [Y, L]) = 〈 L, [Y, X] 〉

,

X, Y

in

g

is the symplectic form on each orbit

O (L_{0})

.

In a reductive semi-simple Lie group G with a subgroup K there is also the semi-direct product

G_{0} = p ⋊ K

, described earlier in the introduction. Then Poisson equations on

g_{0}^{*} = {(p ⋊ k)}^{*}

can be also represented on

g_{0}

via the quadratic form

〈, 〉

as in the semi-simple case, but the resulting expression takes on a slightly different form. To see the difference, let

d H = d H_{p} + d H_{k}

and

L = L_{p} + L_{k}

denote the decompositions of

d H

and L onto the factors

p

and

k

. On the semi-direct product,

\begin{matrix} 〈 \frac{d L_{p}}{d t}, X_{p} 〉 + 〈 \frac{d L_{k}}{d t}, X_{k} 〉 = 〈 \frac{d L}{d t}, X 〉 = \frac{d ℓ}{d t} (X) = - ℓ ([d H, X]) = \\ - 〈 L, [d H, X] 〉 = - 〈 L, [d H_{p}, X_{k}] + [d H_{k}, X_{p}] + [d H_{k}, X_{k}] 〉 \\ = - 〈 L_{p}, [d H_{p}, X_{k}] + [d H_{p}, X_{k}] 〉 - 〈 L_{k}, [d H_{k}, X_{k}] = \\ 〈 X_{k}, [d H_{k}, L_{k}] + [d H_{p}, L_{p}] 〉 + 〈 X_{p}, [d H_{k}, L_{p}] 〉 . \end{matrix}

Hence, the Poisson equations are given by

\frac{d L_{k}}{d t} = [d H_{k}, L_{k}] + [d H_{p}, L_{p}], \frac{d L_{p}}{d t} = [d H_{k}, L_{p}] .

(35)

This equation can be combined with the equations for the semi-simple case in terms of the parameter s with

\frac{d L_{k}}{d t} = [d H_{k}, L_{k}] + [d H_{p}, L_{p}], \frac{d L_{p}}{d t} = [d H_{k}, L_{p}] + s [d H_{k}, L_{p}], s = 0, 1 .

(36)

One can show that the coadjoint orbit through

P_{0} \in p, Q_{0} \in k

under the action of

G_{0} = p ⋊ K

consists of pairs

(P, Q)

P = A d_{h} (P_{0}), Q = [A d_{h} (P_{0}), X] + A d_{h} (Q_{0}), (X, h) \in G_{0},

(37)

when

ℓ_{0} \in g_{s}^{*}

is identified with

L_{0} = P_{0} + Q_{0}

in

g_{0}

, and when

ℓ = {Ad}_{(X, h)}^{*} (ℓ_{0})

is identified with

L = P + Q

([8]).

The adjoint orbits of a non-compact semi-simple Lie group G are often symplectomorphic with the cotangent bundles of manifolds ([26]). It appears that the same is true for coadjoint orbits under the action of semi-direct products. We will now single out two such situations which are relevant for the connections to mechanical tops.

Return now to

G = S O_{ϵ} (n + 1)

and

K = {1} \times S O (n)

introduced in Example 2.

Proposition 3.

The coadjoint orbit

O (P_{0})

through

P_{0} = p_{0} \land_{ϵ} e_{0}, {(p_{0}, e_{0})}_{ϵ} = 0, Q_{0} = 0

under the action of the semi-direct product

p_{ϵ} ⋊ K

is diffeomorphic to the tangent bundle of the connected component of the “sphere”

S_{ϵ}^{n} = {p \in R^{n + 1} : {(p, p)}_{ϵ} = {(p_{0}, p_{0})}_{ϵ}}

that contains

p_{0}

.

Proof.

Let

h \in K

, and

X = x \land e_{o}, {(x, e_{0})}_{ϵ} = 0

. Then

\begin{matrix} P = A d_{h} (P_{0}) = h (p_{0}) \land_{ϵ} h (e_{0}) = p \land_{ϵ} e_{0}, p = h (p_{0}) \\ Q = [A d_{h} (P_{0}), X] = [p \land_{ϵ} e_{0}, x \land_{ϵ} e_{0}] = p \land_{ϵ} x = p \land_{ϵ} x_{p}^{⊥}, \end{matrix}

where

x_{p}^{⊥}

is the projection of x on the orthogonal complement of p in

R^{n + 1}

. Therefore,

(p, x_{p}^{⊥}) \Rightarrow p \land_{ϵ} e_{0} + p \land x_{p}^{⊥}

is the desired diffeomorphism from the tangent bundle of the connected sphere

S_{ϵ}^{n}

onto the coadjoint orbit

{A d_{h} (P_{0}) + [A d_{h} (P_{0}), X], (X, h) \in p_{ϵ} ⋊ K}

. □

The above diffeomorphism is actually a symplectomorphism from the cotangent bundle of either the Euclidean sphere

S^{n}

when

ϵ = 1

, or the hyperboloid of one sheet when

ϵ = - 1

, to the appropriate coadjoint orbit, but we will not go into these details. ([8]).

We will now turn our attention to the reductive pair

G = S L (n), K = S O (n)

(Example 1) and the coadjoint orbit through a symmetric matrix

P_{0}

with distinctive non-zero eigenvalues

α_{1}, \dots, α_{k}

under the action of

G_{o} = p ⋊ S O (n)

. We recall that

s l (n) = s o (n) \oplus p

where

p

is the space of symmetric

n \times n

matrices of trace zero. Every symmetric

n \times n

matrix S can be written as

S = S_{0} + \frac{T r (S)}{n} I

,

S_{0} \in p

. An easy inspection of (37) shows that the orbit through S differs by a constant factor

\frac{T r (S)}{n} I

from the orbit through

S_{0}

. So the zero-trace requirement is inessential for the structure of coadjoint orbits.

Proposition 4.

The coadjoint orbit through

P_{0}

given by

P = A d_{h} (P_{0}), Q = [A d_{h} (P_{0}), X], (X, h) \in p ⋊ S O (n)

is diffeomorphic to the tangent bundle of the flag manifold

F (1, 2, \dots, k)

consisting of subspaces

V_{1} \subset V_{2} \dots \subset V_{k}

with

d i m V_{i} = i

.

Sketch of the proof: Let

P_{0}

denote a symmetric matrix with distinct non-zero eigenvalues

α_{1} < α_{2} \dots < α_{k}

. Then

P_{0}

can be identified with a point

(V_{1} \subset V_{2} \dots \subset V_{k})

in

F (1, \dots, k)

where each subspace

V_{i}

is equal to the linear span of unit eigenvectors

a_{1}, \dots, a_{i}

of

P_{0}

. If

P_{0}

is represented by the matrix

\sum_{i = 1}^{k} α_{i} (a_{i} \otimes a_{i})

, then

A d_{h} (P_{0})

is represented by the matrix

\sum_{i = 1}^{k} α_{i} (h (a_{i}) \otimes h (a_{i}))

that corresponds to the point

F_{h} = (h V_{1} \subset h V_{2} \dots \subset h V_{k})

in

F (1, \dots, k)

. The correspondence

A d_{h} (P_{0}) \to F_{h}

is a diffeomorphism from the orbit

{A d_{h} (P_{0}), h \in S O (n)}

onto

F (1, \dots, k)

.

Let now

S t_{k}^{n}

denote the Stiefel manifold of k-orthonormal frames

[a_{1}, \dots, a_{k}]

in

R^{n}

. Points of

S t_{k}^{n}

can be represented by

n \times k

matrices M with columns

a_{1}, \dots, a_{k}

that satisfy

M^{T} M = I_{k}

, where

M^{T}

denotes the matrix transpose of M, and where

I_{k}

is the k-dimensional identity matrix. Let

ϕ : S t_{k}^{n} \to F (1, \dots, k)

be the embedding

M = [a_{1}, \dots, a_{k}] \to F_{M} = (V_{1} \subset V_{2} \dots \subset V_{k}), V_{i} = < a_{1}, \dots, a_{i} > .

Then

ϕ^{- 1} (F_{M}) = M D

, where D is a diagonal

k \times k

matrix with its diagonal entries equal to

\pm 1

. Therefore,

F (1, \dots, k)

is a covering space for

S t_{k}^{n}

, and hence

F (1, \dots, k)

and

S t_{k}^{n}

are locally diffeomorphic, that is, every point

M \in S t_{k}^{n}

admits an open neighbourhood U such that the restriction of

ϕ

to U is a diffeomorphism onto

ϕ (U)

. It follows that tangent vectors at a point M can be identified with

n \times k

matrices

\dot{M}

that satisfy

{\dot{M}}^{T} M + M^{T} \dot{M} = 0

.

Let now U be an open set in

S t_{k}^{n}

such that

ϕ

restricted to U is a diffeomorphism onto

ϕ (U)

. For every

F_{h} \in ϕ (U)

,

A d_{h} (P_{0})

is identified with

M = [m_{1}, \dots, m_{k}], m_{i} = h (a_{i}), i = 1, \dots, k

. Then

Q = [{Ad}_{h} (P), X] = \sum_{i = 1}^{k} [α_{i} (m_{i} \otimes m_{i}), X] = \sum_{i = 1}^{k} y_{i} \land m_{i},

with

y_{i} = α_{i} X (m_{i})

. Since X is symmetric,

α_{j} (y_{i}, m_{j}) = α_{i} (m_{i}, y_{j})

. Moreover,

y_{i}

could be replaced by its orthogonal projection on

m_{i}^{⊥}

without altering the value of Q. So we may assume that

(y_{i}, m_{i}) = 0, i = 1, \dots, k .

It follows that

M^{T} Q M + M^{T} Q^{T} M = 0

, hence

\dot{M} = Q^{T} M

is a tangent vector at M. The pairs

(M, Q^{T} M)

are parametrized by the entries of M and the entries of the matrix Y. The columns

y_{i} = α_{i} X m_{i}

of Y satisfy

k (k + 1)

constraints

α_{j} (y_{i}, m_{j}) = α_{i} (y_{j}, m_{i}), i \neq j

, and

(y_{i}, m_{i}) = 0

. This implies that the manifold of pairs of

n \times k

matrices

(M, Y)

subject to the constraints

(m_{i}, m_{j}) = δ_{i j}, α_{j} (y_{i}, m_{j}) = α_{i} (y_{j}, m_{i}), i \neq j, (y_{i}, m_{i}) = 0,

(38)

is of the same dimension as the tangent bundle of

S t_{k}^{n}

. Therefore, the correspondence

\sum_{i = 1}^{k} α_{i} (m_{i} \otimes m_{i}), \sum_{i = 1}^{k} y_{i} \land m_{i} \to (M, Q^{T} M)

is one to one and onto the sub-bundle

T U

over U.

Corollary 1.

If

P_{0}

is the orthogonal projection on a k-dimensional vector space, i.e., if

P_{0} = \sum_{i = 1}^{k} a_{i} \otimes a_{i}

, for some orthonormal vectors

a_{1}, \dots, a_{k}

, then the coadjoint orbit through

P_{0}

under the action of the semi-direct product

p ⋊ S O_{n}

is diffeomorphic to the tangent bundle of the oriented Grassmannian

G r_{k}^{n}

.

Here

P_{0}

is identified with the flag consisting of a single k-dimensional vector space

V_{k}

spanned by

a_{1}, \dots, a_{k}

. Then

{(h V_{k}), h \in S O (n)}

is diffeomorphic to the oriented Grassmannians

G r_{k}^{n}

.

Note 1.

Proposition 4 is a correction to Proposition 10.2 on page 170 in [8] which incorrectly states that the coadjoint orbit through

P_{0}

is the Steifel

S t_{k}^{n}

rather than the flag manifold

F (1, 2, \dots, k)

.

3. Hamiltonian and Poisson Systems: Extremal Curves

We now come to the central part of the paper, the Hamiltonian systems associated with our optimal control problems,

3.1. Rolling Hamiltonians

Recall the rolling problem Equation (17),

\frac{d g}{d t} = g (t) (\sum_{i = 1}^{m} u_{i} (t) A_{i}), \frac{d p}{d t} = \sum_{i = 1}^{m} u_{i} (t) {\vec{A}}_{i} (o),

and the associated optimal control problem of minimizing the energy function

\frac{1}{2} \int_{0}^{T} \sum_{i = 1}^{m} u_{i}^{2} (t) d t

. Our immediate aim is to use the Maximum Principle to obtain the equations for the extremal curves in the cotangent bundle

T^{*} G

of the configuration space

G

. To emphasize the structure of the problem, we will rewrite (17) as

\frac{d g}{d t} = \sum_{i = 1}^{m} u_{i} (t) X_{i} (g),

(39)

where each

X_{i}

a left-invariant vector field

X_{i} (g) = (g A_{i}, {\vec{A}}_{i} (o))

,

g = (g, p)

. If

g (t)

is an optimal trajectory then, according to the Maximum Principle,

g (t)

is the projection of an extremal curve

ξ (t)

in

T^{*} G

along which the cost extended Hamiltonian

- \frac{λ}{2} \sum_{i = 1}^{m} u_{i}^{2} (t) + \sum_{i = 1}^{m} u_{i} (t) H_{i} (ξ (t)), λ = 0, 1

is maximal relative to all competing controls. In this notation, each

H_{i}

is the Hamiltonian lift of

X_{i}

, i.e.,

H_{i} (ξ (t)) = ξ (t) (X_{i} (g (t))

. In the abnormal case,

λ = 0

, the Maximum principle results in the constraints

H_{i} (ξ (t)) = 0, i = 1, \dots, m,

(40)

while in the normal case,

λ = 1

, the maximality condition implies that the optimal controls are of the form

u_{i} (t) = H_{i} (ξ (t))

, in which case the corresponding optimal solutions are the projections of the solution curves of a single Hamiltonian vector field

\vec{H}

generated by the Hamiltonian

H (ξ) = \frac{1}{2} \sum_{i = 1}^{m} H_{i}^{2} (ξ) .

(41)

This Hamiltonian is left-invariant in the representation

T^{*} G = G \times G^{*}

and hence its Hamiltonian equations are given by the Equation (29), that is,

\frac{d g}{d t} = \sum_{i = 1}^{n} H_{i} (ℓ (t)) X_{i} (g (t)), \frac{d ℓ}{d t} = - a d^{*} d H (ℓ (t)) (ℓ (t))

We will now concentrate on the solutions of the associated Poisson equation

\frac{d ℓ}{d t} = - a d^{*} d H (ℓ (t)) (ℓ (t))

(42)

Let us first expand on the structure of the coadjoint orbits in this situation. Since

\hat{M}

is a Euclidean vector space, its tangent space at the origin can be identified with

\hat{M}

. Then the Lie algebra

G

can be identified with

g \times \hat{M}

, and its dual can be identified with

G^{*} = g^{*} \oplus {\hat{M}}^{*}

, where

g^{*} = {ℓ \in G^{*} : ℓ (\dot{p}) = 0, \dot{p} \in \hat{M}}, {\hat{M}}^{*} = {ℓ \in G^{*} : ℓ (g) = 0} .

It then follows that every

ℓ \in G^{*}

can be written as

ℓ = ℓ_{1} + ℓ_{2}

with

ℓ_{1} \in g^{*}

and

ℓ_{2} \in {\hat{M}}^{*}

. Since

\hat{M}

is a vector space, and therefore an abelian algebra, the projection

ℓ_{2}

on

{\hat{M}}^{*}

is constant on each coadjoint orbit of

G

. The argument is straightforward: if

g = (g, p)

, then

A d_{g}^{*} (ℓ) (X + \dot{p}) = ℓ (A d_{g^{- 1}} (X + \dot{p})) = ℓ (A d_{g^{- 1}} (X) + \dot{p}) = ℓ_{1} (A d_{g^{- 1}} (X)) + ℓ_{2} (\dot{p}),

It follows that the coadjoint orbits in

G

are of the form

{A d_{g}^{*} (ℓ_{1}) : g \in G} + ℓ_{2}, for any ℓ = ℓ_{1} + ℓ_{2} .

This fact can be also verified directly from Equation (42): we have

\frac{d ℓ}{d t} V = - ℓ [d H, V], for any V = X + \dot{p} in G,

where

d H = \sum_{i = 1}^{m} H_{i} (ℓ) (A_{i} + {\vec{A}}_{i} (o))

and

H_{i} (ℓ) = ℓ_{1} (A_{i}) + ℓ_{2} ({\vec{A}}_{i} (o))

. Therefore,

\frac{d ℓ_{1}}{d t} (X) + \frac{d ℓ_{2}}{d t} (\dot{p}) = - (ℓ_{1} + ℓ_{2}) ([d H, X + \dot{x}]) = - \sum_{i = 1}^{m} H_{i} (ℓ_{i}) [A_{i}, X] .

from which follows that

\frac{d ℓ_{1}}{d t} (X) = - \sum_{i = 1}^{n} H_{i} (ℓ_{i}) [A_{i}, X], X \in g, \frac{d ℓ_{2}}{d t} (\dot{p}) = 0 .

Since

\dot{p}

is arbitrary

\frac{d ℓ_{2}}{d t} = 0

.

To uncover other constants of motion, identify

G^{*}

with

G

via the natural quadratic forms on each of the factors, and then recast the preceding equations on

G

. More precisely, identify each

ℓ_{2}

in

{\hat{M}}^{*}

with a tangent vector

l = \sum_{i = 1}^{m} l_{i} {\vec{A}}_{i} (o)

via the formula

ℓ_{2} (\dot{p}) = (l, \dot{p}), \dot{p} \in \hat{M}

. Similarly, identify

ℓ_{1} \in g^{*}

with

L \in g

via the formula

ℓ_{1} (X) = 〈 L, X 〉, X \in g

. Then decompose

L \in g

into the sum

L = L_{p} + L_{k}

,

L_{p} \in p

and

L_{k} \in k

. Relative to the basis

A_{1}, \dots, A_{m}

in

p

,

L_{p} = \sum_{i = 1}^{m} P_{i} A_{i}

where

P_{i} = ℓ_{1} (A_{i}) = 〈 L, A_{i} 〉

. It follows that

H_{i} (ξ) = ℓ (A_{i} + {\vec{A}}_{i} (o)) = ℓ_{1} (A_{i}) + ℓ_{2} ({\vec{A}}_{i} (o)) = P_{i} + l_{i},

and

\begin{matrix} \frac{d ℓ_{1}}{d t} (X) = 〈 \frac{d L}{d t}, X 〉 = - 〈 L, [\sum_{i = 1}^{m} (l_{i} + P_{i}) A_{i}, X] 〉 = - 〈 [L, \sum_{i = 1}^{m} (l_{i} + P_{i}) A_{i}], X 〉, \\ (\frac{d l}{d t}, \dot{p}) = \frac{d ℓ_{2}}{d t} (t) (\dot{p}) = 0 \end{matrix}

Since X and

\dot{p}

are arbitrary,

\frac{d L}{d t} = [\sum_{i = 1}^{m} (l_{i} + P_{i}) A_{i}, L] = [A + L_{p}, L], A = \sum_{i = 1}^{m} l_{i} A_{i}, \frac{d l}{d t} = 0 .

(43)

Equation (43) constitutes the Poisson equations on

G

generated by the Hamiltonian

H = \frac{1}{2} \sum_{i = 1}^{m} H_{i}^{2} = \frac{1}{2} \sum_{i = 1}^{m} {(l_{i} + P_{i})}^{2}

. Note that in this identification of the Lie algebras with their duals, coadjoint orbits

{A d_{g}^{*} (ℓ_{1}) + ℓ_{2} : g \in G}

are identified with the affine sets

{A d_{g} (L) + l : g \in G}

. Coupled with

\frac{d g}{d t} = g (t) (A + L_{p}), \frac{d p}{d t} = \sum_{i = 1}^{n} (l_{i} + P_{i}) {\vec{A}}_{i} (o),

(44)

Equation (43) constitutes the extremal equations for the rolling geodesics. Each extremal curve projects onto a geodesic

g (t) = (g (t), p (t))

, and each geodesic further projects onto the pair of curves

α (t) = τ_{g (t)} (o)

in M and

β (t) = p (t)

in

\hat{M}

that are rolled upon each other by

g (t)

.

3.2. Affine-Quadratic Hamiltonian

Similar to the rolling problem, the Maximum Principle reveals that the normal extremals of the affine-quadratic system (18) are the integral curves of the Hamiltonian vector field

\vec{H}

associated with the Hamiltonian function

H (L) = \frac{1}{2} 〈 P^{- 1} L_{k}, L_{k} 〉 + 〈 A, L_{p} 〉,

where as before

L = L_{p} + L_{k}

is the decomposition of

L \in g

onto the factors

p

and

k

. In the canonical case

P = I

, and in the representation

T^{*} G = G \times g^{*}

, the Hamiltonian equations generated by H are then given by

\frac{d g}{d t} = g (t) (A + U (t)), U (t) = L_{k} (t), \frac{d L}{d t} = [d H, L] = [A + L_{k}, L],

(45)

The Poisson equation

\frac{d L}{d t} = [d H, L]

can be written in expanded form as

\frac{d L_{k}}{d t} = [A, L_{p}], \frac{d L_{p}}{d t} = [L_{k}, L_{p}] + [A, L_{k}] = [A - L_{p}, Ł_{k}]

(46)

The “shadow” problem generates an analogous Hamiltonian on the tangent bundle of the semi-direct product

G_{0} = p ⋊ K

with its extremal equations given by:

\frac{d x}{d t} = A d_{R (t)} A, \frac{d R}{d t} = R (t) L_{k} (t), \frac{d L_{k}}{d t} = [A, L_{p}], \frac{d L_{p}}{d t} = [L_{k}, L_{p}] .

(47)

Here

g (t) = (x (t), R (t))

, and

\frac{d g}{d t} = g (t) (A + L_{k} (t))

is the same as

\frac{d x}{d t} = A d_{R (t)} A, \frac{d R}{d t} = R (t) L_{k} (t)

.

The propositions below reveal a remarkable fact that the Poisson equations of a canonical affine-quadratic Hamiltonian can always be regarded as an invariant subsystem of the Poisson equations associated with a rolling Hamiltonian. We will use bold letters when referring to the variables in the rolling Hamiltonian in contrast to the variables in the affine-quadratic Hamiltonian.

Proposition 5.

Let

g (t) = (g (t), p (t)), L_{p} (t), L_{k} (t)

be any integral curve of the rolling Hamiltonian

H = \frac{1}{2} | | A + L_{p} {| |}^{2}

, that is,

\begin{matrix} \frac{d g}{d t} = g (t) (A + L_{p} (t)), \frac{d p}{d t} = \sum_{i = 1}^{m} (l_{i} + P_{i}) {\vec{A}}_{i} (o), \\ \frac{d L_{k}}{d t} = [A, L_{p}], \frac{d L_{p}}{d t} = [A + L_{p}, L_{k}], A = \sum_{i = 1}^{m} l_{i} A_{i} \end{matrix}

Then

\tilde{g} (t) = g (t) h (t), L_{p} (t) = A d_{h^{- 1} (t)} (L_{p} (t)), L_{k} = A d_{h^{- 1} (t)} (L_{k} (t))

(48)

is an integral curve of the affine Hamiltonian

H = \frac{1}{2} 〈 L_{k}, L_{k} 〉 + 〈 A, L_{p} 〉

, where

A = A d_{h^{- 1} (t)} (A + L_{p} (t))

, and

h (t)

is the solution of

\frac{d h}{d t} = L_{k} (t) h (t)

with

h (0) = I

.

Moreover,

\tilde{g} (t) = (x (t), R (t))

in

p ⋊ K

with

R (t) = h (t)

and

x (t)

a solution of

\frac{d x}{d t} = A + L_{p} (t)

is the projection of an extremal curve

L_{k} (t) = A d_{h^{- 1} (t)} L_{k} (t), L_{p} (t) = A d_{h^{- 1} (t)} (L_{p} (t)) - A, A = A d_{h^{- 1} (t)} (A + L_{p} (t))

associated with the shadow Hamiltonian

H = \frac{1}{2} 〈 L_{k}, L_{k} 〉 + 〈 A, L_{p} 〉

.

Proof.

If A is any element in

p

then

\frac{d}{d t} A d_{h (t)} (A) = [A d_{h (t)} (A), L_{k}]

. Since

\frac{d}{d t} (A + L_{p} (t)) = [A + L_{p} (t), L_{k} (t)]

,

A d_{h (t)} (A)

and

A + L_{p} (t)

are the solutions of the same differential equation they will be equal to each other whenever

A d_{h (0)} (A) = A + L_{p} (0)

, that is, when

A = A + L_{p} (0)

.

Assume that

A d_{h (t)} (A) = A + L_{p} (t)

. Then,

\begin{matrix} \frac{d \tilde{g}}{d t} = g (t) (A + L_{p} (t)) h (t) + g (t) L_{k} (t) h (t) = \\ \tilde{g} (t) (A d_{h^{- 1} (t)} (A + L_{p} (t)) + A d_{h^{- 1} (t)} L_{k} (t)) = \tilde{g} (t) (A + L_{k} (t)) . \end{matrix}

Additionally,

\begin{matrix} \frac{d L_{p}}{d t} = \frac{d}{d t} A d_{h^{- 1} (t)} (L_{p} (t)) = A d_{h^{- 1} (t)} ([L_{k}, L_{p}]) + A d_{h^{- 1} (t)} ([A + L_{p} (t), L_{k} (t)]) \\ = A d_{h^{- 1} (t)} [A, L_{k} (t)] = [A d_{h^{- 1} (t)} A, A d_{h^{- 1} (t)} L_{k} (t)] = [A - A d_{h^{- 1} (t)} L_{p} (t), L_{k} (t)] = \\ [A - L_{p} (t), L_{k} (t)], \end{matrix}

and

\begin{matrix} \frac{d L_{k}}{d t} = \frac{d}{d t} A d_{h^{- 1} (t)} (L_{k} (t)) = A d_{h^{- 1} (t)} [A, L_{p} (t)] = [A d_{h^{- 1} (t)} A, A d_{h^{- 1} (t)} (L_{p} (t)] = \\ [A - A d_{h^{- 1} (t)} (L_{p} (t)), A d_{h^{- 1} (t)} (L_{p} (t))] = [A, L_{p} (t)] . \end{matrix}

As to the proof of the second statement, note that

\dot{x} (t) = A d_{R (t)} A = A + L_{p} (t)

and

\frac{d R}{d t} = L_{k} (t) R (t) = R (t) L_{k} (t)

is a solution of

\frac{d \tilde{g}}{d t} = \tilde{g} (t) (A + L_{k} (t))

as remarked in Equation (47). An argument identical to the one above shows that

\frac{d L_{k}}{d t} = [A, L_{p} (t)], \frac{d L_{p}}{d t} = [L_{k} (t), L_{p} (t)] .

□

The converse also holds as this proposition demonstrates.

Proposition 6.

Suppose that

(\tilde{g} (t), L_{p} (t), L_{k} (t))

is an extremal curve of the affine Hamiltonian

H = \frac{1}{2} 〈 L_{k}, L_{k} 〉 + 〈 A, L_{p} 〉

. Then

\begin{matrix} g (t) = ((\tilde{g} (t) h^{- 1} (t), p (t)), \frac{d p}{d t} = \vec{A} (o) + {\vec{L}}_{p} (o), \frac{d h}{d t} = h (t) (L_{k} (t)) \\ L_{p} (t) = A d_{h (t)} (L_{p} (t)), L_{k} (t) = A d_{h (t)} (L_{k} (t)), A = A d_{h (t)} (A - L_{p} (t)) \end{matrix}

is an extremal curve of the rolling Hamiltonian

H = \frac{1}{2} 〈 A + L_{p}, A + L_{p} 〉

.

However, if

\tilde{g} (t) = (x (t), R (t))

,

L_{p} (t)

and

L_{k} (t)

is an extremal curve of the shadow Hamiltonian H, then

\begin{matrix} \frac{d g}{d t} = g (t) A d_{R (t)} (A)), \frac{d p}{d t} = \vec{\frac{d x}{d t}} (o) \\ L_{p} (t) = A d_{R (t)} (A + L_{p} (t)), L_{k} (t) = A d_{R (t)} (L_{k} (t)) \end{matrix}

is an extremal equation of the Hamiltonian

H = \frac{1}{2} 〈 A + L_{p}, A + L_{p} 〉

with

A d_{R (t)} A = A + L_{p} (t)

.

Proof.

The proof of the first part is essentially the same as in the previous proposition.

In the second part, we have

\frac{d x}{d t} = A d_{R} (t) (A), \frac{d R}{d t} = R (t) L_{k} (t), \frac{d L_{p}}{d t} = [L_{k}, L_{p}], \frac{d L_{k}}{d t} = [A, L_{p}] .

Then

\frac{d}{d t} A d_{R (t)} (L_{p} (t)) = A d_{R (t)} ([L_{p}, L_{k}]) + A d_{R (t)} ([L_{k}, L_{p}]) = 0

.

Let

A d_{R (t)} (L_{p} (t)) = - A

so that

A d_{R (t)} (A) = A + L_{p} (t)

. It follows that

\frac{d g}{d t} = g (t) (A + L_{p} (t))

and

\frac{d x}{d t} = A d_{R (t)} (A) = A + L_{p} (t)

. Hence,

\frac{d p}{d t} = \vec{A} (o) + {\vec{L}}_{p} (o) = \vec{\frac{d x}{d t}} (o) .

Additionally,

\begin{matrix} \frac{d L_{p}}{d t} = \frac{d}{d t} A d_{R (t)} (A + L_{p} (t)) = A d_{R (t)} ([A + L_{p}, L_{k}]) + A d_{R (t)} ([L_{k}, L_{p}]) = \\ A d_{R (t)} ([A, L_{k}]) = [A d_{R (t)} (A), A d_{R (t)} (L_{k})] = [A + L_{p} (t), L_{k} (t)], \\ \frac{d L_{k}}{d t} = \frac{d}{d t} A d_{R (t)} (L_{k} (t)) = A d_{R (t)} ([A, L_{p} (t)]) = [A d_{R (t)} (A), A d_{R (t)} (L_{p} (t)]) = \\ [A + L_{p} (t), - A] = [A, L_{p} (t)] . \end{matrix}

□

The above shows that the Poisson systems generated by any affine-quadratic Hamiltonian are invariant subsystems of the rolling Hamiltonians. To summarize, let

L (t) = L_{p} (t) + L_{k} (t)

denote an integral curve of the rolling Hamiltonian

H = \frac{1}{2} 〈 A + L_{p}, A + L_{p} 〉

. If

h (t)

denotes the solution of

\frac{d h}{d t} = L_{k} (t) h (t), h (0) = I

, then define

A \in p

by

A d_{h (t)} (A) = A + L_{p} (t)

. It follows from above that

L_{p} (t) = A d_{h^{- 1} (t)} (L_{p} (t)), L_{k} (t) = A d_{h^{- 1} (t)} (L_{k} (t))

(49)

are integral curves of the affine quadratic Hamiltonian

H = \frac{1}{2} 〈 L_{k}, L_{k} 〉 + 〈 A, L_{p} 〉

. However, when

L_{p} (t) = A d_{h^{- 1} (t)} (L_{p} (t)) - A, L_{k} (t) = A d_{h^{- 1} (t)} (L_{k} (t))

(50)

then

L_{p} (t), L_{k} (t)

are integral curves of the shadow Hamiltonian H.

3.3. Isospectral Representations and Integrability

An

n \times n

matrix equation

\frac{d L}{d t} = [M (t), L (t)]

is called a Lax equation, and

(M, L)

is called Lax pair. If

(M, L)

is a Lax pair, then the spectrum of

L (t)

is constant. The proof is simple:

g (t) L (t) g^{- 1} (t) = Λ

, where

Λ

a constant matrix for any solution

\frac{d g}{d t} = g (t) M (t)

in the general linear group

G l (n)

. Since the spectrum of

Λ

is equal to the spectrum of

L (t)

, the spectrum of

L (t)

must be constant.

It follows that the Poisson equation of any left-invariant Hamiltonian H is a Lax equation on a semi-simple Lie algebra

g

(Equation (34)) and therefore, the eigenvalues of

L (t)

are constants of motion for any left-invariant Hamiltonian on

g

and hence may be regarded as the conservation laws on

g

.

A function h on a Poisson space is said to be invariant if

{h, f} = 0

for any function f. On semi-simple Lie algebras any spectral function is invariant. In particular functions

ϕ_{k} (L) = T r (L^{k}), k = 1, 2, \dots

form a family of invariant functions.

In some situations, a Lax equation

\frac{d L}{d t} = [M (t), L (t)]

extends to a Lax equation

\frac{d L_{λ}}{d t} = [M_{λ} (t), L_{λ} (t)]

with a spectral parameter

λ

. Then a discrete spectrum of L is replaced by a continuous spectrum of

L_{λ}

which results in additional constants of motion. In the case of rolling spheres J. Zimmerman in his PhD thesis (2002, University of Toronto) discovered an extension of the Lax equation which he called isospectral ([6]). Remarkably, Zimmerman’s extension exists for the rolling problem on any semi-simple homogeneous manifold, for the same reasons as in the rolling sphere problem. In fact, if

X_{0} (t) = A + L_{p} (t), X_{1} (t) = L_{k} (t), X_{2} (t) = - A, X_{3} = 0

, then the Poisson equations may be written as

\frac{d X_{i}}{d t} = [X_{0} (t), X_{i + 1} (t)], i = 0, 1, 2 .

(51)

This equation is invariant under a dilational change

X_{i} \to λ^{i - 1} X_{i}

. It then follows that

L_{λ} = \sum_{i = 0}^{3} λ^{i} X_{i} = L_{p} (t) + λ L_{k} (t) + (1 - λ^{2}) A

(52)

satisfies the equation

\frac{d L_{λ}}{d t} = [M_{λ} (t), L_{λ} (t)], M_{λ} (t) = \frac{1}{λ} (A + L_{p} (t)) .

(53)

Therefore, the spectrum of

L_{λ} (t)

is constant. We will refer to

L_{λ}

as the spectral curve for

H

. Of course, the above implies that the Poisson system associated with the affine-quadratic Hamiltonian also admits an isospectral representation. To be specific note that after the substitutions from Equation (49),

\begin{matrix} L_{λ} = A d_{h (t)} L_{p} + λ A d_{h (t)} L_{k} + (1 - λ^{2} (A d_{h (t)} (A - L_{p}) = \\ A d_{h (t)} (λ^{2} L_{p} + λ L_{k} + (1 - λ^{2}) A) = A d_{h (t)} L_{λ} . \end{matrix}

Then

\begin{matrix} \frac{d L_{λ}}{d t} = \frac{d}{d t} (A d_{h (t)} (L_{λ}) = A d_{h (t)} [L_{λ}, L_{k}] + A d_{h (t)} \frac{d L_{λ}}{d t} \\ = [\frac{1}{λ} (A + L_{p}), L_{λ}] = A d_{h (t)} [\frac{1}{λ} A, L_{λ}] . \end{matrix}

Therefore,

\frac{d L_{λ}}{d t} = [L_{k}, L_{λ}] + [\frac{1}{λ} A, L_{λ}] = [\frac{1}{λ} A + L_{k}, L_{λ}] .

To be consistent with my earlier publications, replace

λ

by

- \frac{1}{λ}

to get

\frac{d L_{λ}}{d t} = [M_{λ}, L_{λ}],

(54)

where

M_{λ} = L_{k} - λ A

, and

L_{λ} = L_{p} - λ L_{k} + (λ^{2} - 1) A

. Equation (54) agrees with the isospectral representation in ([8]) (obtained by other means).

To get the spectral curve

L_{λ}

for the shadow Hamiltonian, use Equation (50). In such a case,

L_{k} = A d_{h} (L_{k})

,

L_{p} = A d_{h} (L_{p} + A)

and

A = - A d_{h} L_{p}

yields

L_{λ} = A d_{h} L_{λ}, L_{λ} = λ^{2} L_{p} + λ L_{k} + A .

Then a calculation analogous to the one above gives

\frac{d L_{λ}}{d t} = [\frac{1}{λ} A + L_{k}, L_{λ}]

. After the rescaling

λ \to - \frac{1}{λ}

we get a modified Lax pair

\frac{d L_{λ}}{d t} = [M_{λ}, L_{λ}], M_{λ} = L_{k} - λ A, L_{λ} = L_{p} - λ L_{k} + λ^{2} A .

(55)

Each spectral curve

L_{λ}

defines a family of functions

I = {ϕ_{λ}^{(k)} (L) = T r (L_{λ}^{k}), k = 1, 2, \dots} \cup {f (L) = 〈 L, X 〉 : X \in k, [X, A] = 0} .

Proposition 7.

The family

I

is involutive, that is,

{h, g} = 0

for each g and h in

I

, and in the case that

A

is regular, it is also complete, in the sense that it contains a subfamily

I_{0}

that is Liouville integrable on each coadjoint orbit in

g

([8], pp. 164–165).

See also related papers also [27,28,29]).

Since

H

belongs to

I

, the rolling problem is completely integrable when

A

is regular.

Corollary 2.

Each affine-quadratic Hamiltonian

H = \frac{1}{2} 〈 L_{k}, L_{k} 〉 + 〈 A, L_{k} 〉

is completely integrable on

g

when A is regular.

4. Symmetric Mechanical Tops

We will now relate the “top-like” equations

\begin{matrix} \frac{d R}{d t} = R (t) (P^{- 1} (M (t))), α_{i} (t) = R {(t)}^{T} e_{i}, i = 1, \dots, n \end{matrix}

(56)

\begin{matrix} \frac{d M}{d t} = [P^{- 1} (M (t)), M (t)] + \sum_{i = 1}^{n} α_{i} (t) \land \frac{\partial V}{\partial α_{i}} \end{matrix}

(57)

on the tangent bundle of

S O (n)

, associated with the energy Hamiltonian

H = \frac{1}{2} 〈 P^{- 1} (M), M 〉 + V (α_{1}, \dots, α_{n})

, to the rolling equations. For simplicity of exposition, we will assume that the top is maximally symmetric, that is we will assume that all principal moments of inertia are equal, which is the same as

P = I

. We will first consider the case of linear potentials.

Linear potentials:

V = - \sum_{i = 1}^{n} c_{i} (α_{i}, a)

, where a is a vector in

R^{n}

, and

c_{1}, \dots, c_{n}

are constants. Then Equation (56) can be written as

\frac{d R}{d t} = R (t) Ω (t), \frac{d M}{d t} = a \land p (t), \frac{d p}{d t} = - Ω (t) p (t)

(58)

where

Ω (t) = M (t)

and

p (t) = \sum_{i = 1}^{n} c_{i} α_{i} (t)

. Our proposition below relates Equation (58) to the rolling equations

\begin{matrix} \frac{d g}{d t} = g (t) (A + L_{p} (t)), \frac{d x}{d t} = \vec{A} (o) + {\vec{L}}_{p} (o), \end{matrix}

(59)

\begin{matrix} \frac{d L_{p}}{d t} = [A + L_{p} (t), L_{k} (t)], \frac{d L_{k}}{d t} = [A, L_{p} (t)] . \end{matrix}

(60)

on

S O_{ϵ} (n + 1) \times T_{o} M, ϵ = \pm 1

, where

M = S O_{ϵ} (n + 1) / K, K = {1} \times S O (n)

.

To set the stage for this proposition, we will need to embed Equation (58) in

R^{n + 1}

via the following embeddings. To begin with,

\hat{v} \in R^{n + 1}

will denote the embedding

\hat{v} = 0 e_{0} + \sum_{i = 1}^{n} v_{i} e_{i}

for any

v = \sum_{i = 1}^{n} v_{i} e_{i}

. Then

a \in R^{n}

will be identified with

A = \hat{a} \land_{ϵ} e_{0}

and

p \in R^{n}

will be identified with

L_{p} = p \land_{ϵ} e_{0}

in

p_{ϵ}

. In addition

R \in S O (n)

will be identified with

h = {1} \times R = (\begin{matrix} 1 & 0 \\ 0 & R \end{matrix})

, and Ω will be identified with

L_{k} = (\begin{matrix} 0 & 0 \\ 0 & M \end{matrix})

so that

\frac{d R}{d t} = R (t) Ω (t)

is identified with

\frac{d h}{d t} = h (t) L_{k} (t)

. Then

\begin{matrix} \frac{d L_{p}}{d t} (t) = \frac{d}{d t} (\hat{p} (t) \land_{ϵ} e_{0}) = - Ł_{k} (t) \hat{p} (t) \land_{ϵ} e_{0} = [L_{k} (t), L_{p} (t)] \end{matrix}

is the same as

\frac{d M}{d t} = a \land p (t)

. It follows that Equation (58) can be paraphrased as

\begin{matrix} \frac{d h}{d t} = h (t) L_{k} (t), \frac{d L_{p}}{d t} = [L_{k} (t), L_{p} (t)] . \end{matrix}

(61)

However, then

\frac{d}{d t} A d_{h (t)} L_{p} (t) = A d_{h (t)} [L_{p} (t), L_{k} (t)] + A d_{h (t)} [L_{k} (t), L_{p} (t)] = 0,

and therefore

A d_{h (t)} L_{p} (t)

is constant (same as

\frac{d}{d t} (R (t) p (t)) = R (t) Ω (t) p (t) - R (t) Ω (t) p (t) = 0

).

Proposition 8.

Top-like Equation (58) are isomorphic to the Equations (59) and (60) under the identification

\begin{matrix} A = - A d_{h (t)} L_{p} (t), L_{p} (t) = A d_{h (t)} (A + L_{p} (t)), L_{k} (t) = A d_{h (t)} L_{k} (t)), \\ \frac{d g}{d t} = g (t) A d_{h (t)} A, \frac{d x}{d t} = \vec{A} (o) + {\vec{L}}_{p} (o) . \end{matrix}

Proof.

It follows that

A d_{h (t)} A = A + L_{p} (t)

and

\frac{d g}{d t} = g (t) (A + L_{p} (t))

. Thus, (60) is satisfied. We also have

\begin{matrix} \frac{d L_{p}}{d t} = A d_{h (t)} [A + L_{p} (t), L_{k} (t)] + A d_{h (t)} [L_{k} (t), L_{p} (t)] = \\ A d_{h (t)} [A, L_{k} (t)] = [A + L_{p} (t), L_{k} (t)], \\ \frac{L_{k}}{d t} = A d_{h (t)} [A, L_{p} (t)] = [A d_{h (t)} A, A d_{h (t)} L_{p} (t)] = [A + L_{p} (t), - A] = [A, L_{p} (t)], \end{matrix}

and Equation (59) are also satisfied. □

Corollary 3.

An n-dimensional symmetric top with a linear potential is completely integrable.

Quadratic potentials. We will now show that the rolling geodesic equations on

M = S L (n) / S O (n)

can be identified with movements of the symmetric top under a quadratic potential. For our purposes, an n-dimensional top with quadratic potential is synonymous with the Hamiltonian

H (R, M) = \frac{1}{2} (P^{- 1} (M), M) + \frac{1}{2} \sum_{i = 1}^{n} a_{i} 〈 S α_{i}, α_{i} 〉,

with

R \in S O (n), M \in s o (n)

,

R^{T} e_{i} = α_{i}

, S a symmetric

n \times n

matrix, and

a_{1}, \dots, a_{n}

arbitrary numbers. In accordance with (32) the Hamiltonian equations of

\vec{H}

are given by

\frac{d R}{d t} = R (t) Ω (t), \frac{d M}{d t} = [Ω (t), M (t)] + \sum_{i = 1}^{n} a_{i} α_{i} (t) \land S α_{i} (t),

(62)

Ω (t) = P^{- 1} (M (t))

. In the symmetric case

P = I

and

[Ω (t), M (t)] = 0

and the equations reduce to

\frac{d R}{d t} = R (t) Ω (t), \frac{d M}{d t} = \sum_{i = 1}^{n} a_{i} α_{i} (t) \land S α_{i} (t) .

(63)

To relate these equations to the rolling equations, let

L_{p} (t) = \sum_{i = 1}^{n} a_{i} (α_{i} (t) \otimes α_{i} (t)) - \frac{1}{n} \sum_{i = 1}^{n} a_{i} I .

Recall that

a \otimes a

is a rank one matrix defined by

(a \otimes a) x = (a, x) a

where

(a, x)

is the standard Euclidean inner product in

R^{n}

. Therefore each matrix

α_{i} \otimes α_{i}

is a symmetric matrix with its trace equal to one, and consequently

L_{p}

is a symmetric matrix having zero trace. Along each solution of (63)

\frac{d}{d t} L_{p} (t) = - \sum_{i = 1}^{n} a_{i} (Ω (t) α_{i} (t) \otimes α_{i} (t) + α_{i} (t) \otimes Ω (t) α_{i} (t)) = [Ω (t), L_{p} (t)] .

Additionally,

\frac{d}{d t} A d_{R (t)} L_{p} (t) = A d_{R (t)} [L_{p} (t), Ω (t)] + A d_{R (t)} [Ω (t), L_{p} (t)]) = 0 .

Now let

A = - A d_{R (t)} L_{p} (t), L_{p} (t) = A d_{R (t)} (S + L_{p} (t)), L_{k} (t) = A d_{R (t)} Ω (t) .

(64)

We then have

Proposition 9.

Equation (63) are isomorphic to the Poisson equations of the rolling problem on

g = s l (n)

(Equation (43)) associated with the extremal

\frac{d g}{d t} = A d_{R (t)} S = g (t) (A + L_{p} (t)), \frac{d p}{d t} = \vec{A} (o) + {\vec{L}}_{p} (o) .

Proof.

By a straightforward calculation. □

Corollary 4.

Equations of a symmetric n-dimensional top with quadratic potential are completely integrable.

See also related results in [30,31,32]).

Funding

This research received no external funding.

Data Availability Statement

Not applicable.

Conflicts of Interest

No conflict of interest.

References

Jurdjevic, V. The geometry of the Ball-Plate Problem. Arch. Ration. Mech. Anal. 1993, 124, 305–328. [Google Scholar] [CrossRef]
Jurdjevic, V. Non-Euclidean Elasticae. Am. J. Math. 1995, 117, 93–125. [Google Scholar] [CrossRef]
Jurdjevic, V. Geometric Control Theory; Cambridge Studies in Advanced Mathematics; Cambridge University Press: New York, NY, USA, 1997; Volume 52. [Google Scholar]
Jurdjevic, V. Integrable Hamiltonian Systems on Lie groups: Kowalewski type. Ann. Math. 1999, 150, 605–644. [Google Scholar] [CrossRef]
Zimmerman, J.A. Optimal control of the sphere Sⁿ rolling on Eⁿ. Math. Control Signals Syst. 2005, 17, 14–37. [Google Scholar] [CrossRef]
Jurdjevic, V.; Zimmermann, J. Rolling sphere problems on spaces of constant curvature. Math. Proc. Camb. Phil. Soc. 2008, 144, 729–747. [Google Scholar] [CrossRef]
Jurdjevic, V. Integrable Hamiltonian Systems on Complex Lie Groups; American Mathematical Society: Providence, RI, USA, 2005; Volume 178. [Google Scholar]
Jurdjevic, V. Optimal Control and Geometry: Integrable Systems; Cambridge Studies in Advanced Mathematics; Cambridge University Press: Cambridge, UK, 2016. [Google Scholar]
Jurdjevic, V.; Markina, I.; Silva Leite, F. Symmetric spaces rolling on flat spaces. arXiv 2022, arXiv:2205.14636. [Google Scholar]
Jurdjevic, V. Rolling on Affine Tangent Planes: Parallel Transport and the Associated Sub-Riemannian Problems. In Proceedings of the 14th APCA International Conference on Automatic Control and Soft Computing, Bragança, Portugal, 1–3 July 2020; pp. 136–147. [Google Scholar]
Fomenko, A.T.; Mischenko, A.S. Euler equation on finite-dimensional Lie groups. Izv. Ross. Akad. Nauk. Seriya Mat. 1978, 12, 371–389. [Google Scholar]
Fomenko, A.T.; Trofimov, V.V. Integrability in the sense of Liouville of Hamiltonian systems on Lie algebras. Uspekhi Mat. Nauk 1984, 2, 3–56. (In Russian) [Google Scholar]
Neumann, C. De problemate quodam mechanico, quod ad primam integralium ultraellipticorum classem revocatum. J. Reine Angew. Math. 1859, 56, 46–63. [Google Scholar]
Reyman, A.G.; Semenov Tian-Shansky, M.A. Group theoretic Methods in the Theory of finite dimensional Integrable Systems. In Dynamical Systems VII; Chapter 2, Encyclopaedia of Mathematical Sciences; Springer: Berlin/Heidelberg, Germany, 1994; Volume 16. [Google Scholar]
O’Neill, B. Semi-Riemannian Geometry; Academic Press: Cambridge, MA, USA; Elsevier: Amsterdam, The Netherlands, 1983. [Google Scholar]
Sternberg, S. Lectures on Differential Geometry; Prentice- Hall, Inc.: Englewood Cliffs, NJ, USA, 1964. [Google Scholar]
Ziller, W. Lie Groups. Representation Theory and Symmetric Spaces; University of Pennsylvania: Philadelphia, PA, USA, 2010. [Google Scholar]
Bryant, R.; Hsu, L. Rigidity of integral curves of rank 2 distributions. Invent. Math. 1993, 114, 435–461. [Google Scholar] [CrossRef]
Agrachev, A.; Sachkov, Y. Control Theory from the Geometric Viewpoint; Springer: Berlin/Heidelberg, Germany, 2004. [Google Scholar]
Chitour, Y.; Kokkonen, P. Rolling Manifolds: Intrinsic Formulation and Controllability. arXiv 2011, arXiv:1011.2925v2. [Google Scholar]
Chitour, Y.; Godoy Molina, M.; Kokkonen, P. The rolling problem: Overview and challenges. In Geometric Control theory and Sub-Riemannian Geometry; Springer INdAM Series 5; Springer: Berlin/Heidelberg, Germany, 2014; pp. 103–123. [Google Scholar]
Godoy Molina, M.; Grong, E.; Markina, I.; Silva Leite, F. An intrinsic formulation of the problem on rolling manifolds. J. Dyn. Control Syst. 2012, 18, 181–214. [Google Scholar] [CrossRef]
Krakowski, K.A.; Machado, L.; Silva Leite, F. Rolling Symmetric Spaces. In Proceedings of the Second International Conference on Geometric Science of Information, Palaiseau, France, 28–30 October 2015; Springer: Berlin/Heidelberg, Germany, 2015; pp. 550–557. [Google Scholar]
Sharpe, R.W. Differential Geometry; GTM, 166; Springer: New York, NY, USA, 1997. [Google Scholar]
Kirillov, A.A. Lectures on the Orbit Method; Graduate Studies in Mathematics, 64; American Mathematical Society: Providence, RI, USA, 2004. [Google Scholar]
Gasparim, E.; Grama, L.; San Martin, L.A. Adjoint orbits of semi-simple Lie groups and Lagrangian submanifolds. Proc. Edinb. Math. Soc. 2017, 60, 361–385. [Google Scholar] [CrossRef]
Bolsinov, A. A completeness criterion for a family of functions in involution obtained by the shift method. Soviet Math. Dokl. 1989, 38, 161–165. [Google Scholar]
Reyman, A.G. Integrable Hamiltonian Systems connected with graded Lie algebras. J. Sov. Math. 1982, 19, 1507–1545. [Google Scholar] [CrossRef]
Perelomov, A.M. Integrable Systems of Classical Mechanics and Lie Algebras; Birkhauser: Basel, Switzerland, 1990; Volume 1. [Google Scholar]
Bogoyavlenski, O. New Integrable Problem of Classical Mechanics. Comm. Math. Phys. 1984, 94, 255–269. [Google Scholar] [CrossRef]
Jurdjevic, V. Affine quadratic problem on Lie groups. J. Lie Theory 2020, 20, 425–444. [Google Scholar] [CrossRef]
Manakov, S.V. Note on the integration of Euler’s equations of the dynamics of an n dimensional rigid body. Funct. Anal. Appl. 1976, 10, 328–329. [Google Scholar] [CrossRef]

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Jurdjevic, V. Rolling Geodesics, Mechanical Systems and Elastic Curves. Mathematics 2022, 10, 4827. https://0-doi-org.brum.beds.ac.uk/10.3390/math10244827

AMA Style

Jurdjevic V. Rolling Geodesics, Mechanical Systems and Elastic Curves. Mathematics. 2022; 10(24):4827. https://0-doi-org.brum.beds.ac.uk/10.3390/math10244827

Chicago/Turabian Style

Jurdjevic, Velimir. 2022. "Rolling Geodesics, Mechanical Systems and Elastic Curves" Mathematics 10, no. 24: 4827. https://0-doi-org.brum.beds.ac.uk/10.3390/math10244827

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Rolling Geodesics, Mechanical Systems and Elastic Curves

Abstract

1. Introduction

1.1. Affine-Quadratic Problem

1.2. Homogeneous Riemannian Manifolds

1.3. The Rolling Problem

1.4. Some Notable Examples

2. Symplectic Background, Hamiltonian Systems

2.1. Left-Invariant Trivializations and the Symplectic Form

2.2. Poisson Manifolds, Coadjoint Orbits

2.3. Representation of Coadjoint Orbits on Lie Algebras- Semi-Simple vs. Semi-Direct

3. Hamiltonian and Poisson Systems: Extremal Curves

3.1. Rolling Hamiltonians

3.2. Affine-Quadratic Hamiltonian

3.3. Isospectral Representations and Integrability

4. Symmetric Mechanical Tops

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI