Sub-Riemannian Geometry and Optimal Transport
Ludovic Rifford October 1, 2012
Preface
The main goal of these lectures is to give an introduction to sub-Riemannian geometry and optimal transport, and to present some of the recent progress in these two fields. This set of notes is divided into three chapters and two appen- dices. Chapter 1 is concerned with the notions of totally nonholonomic distri- butions and sub-Riemannian structures. The concepts of End-Point mappings and singular horizontal paths which play a major role through these lectures are introduced here. Chapter 2 deals with sub-Riemannian geodesics. We study first and second-order variations of the End-Point mapping to derive necessary and sufficient conditions for an horizontal path to be minimizing. We provide several examples, including the Montgomery counter-example of singular min- imizing curve. In Chapter 3, we study the Monge problem for sub-Riemannian quadratic costs. We give a crash-course in optimal transport theory and ex- plain how the sub-TWIST condition together with the Lipschitz regularity of a ”variational” cost implies the well-posedness of Monge’s problem. Then we study the fine regularity properties of sub-Riemannian distances to obtain existence and uniqueness of optimal transport maps in the sub-Riemannian context. We recall basic facts on ordinary differential equations in Appendix 1 and less classical results of differential calculus in normed vector spaces in Appendix 2. The latter plays a key role in Chapter 2.
The reader of these notes should be familiar with the basics in differen- tial geometry and measure theory. Possible references in these fields include the textbooks by Lee [Lee03] and Evans-Gariepy [EG92]. For further reading, we strongly encourage the reader to look at other texts in sub-Riemannian geometry and optimal transport. Multiple viewpoints always lead to deeper understanding and may open new directions for research. Among them, we may suggest the textbooks by Montgomery [Mon02], Agrachev, Barilari and Boscain [ABB12], and Villani [Vil08].
This set of notes grew from a series of lectures that I gave during a CIMPA school in Beyrouth, Lebanon, on the invitation of Fernand Pelletier. I take the opportunity of this preface to warmly thank Ali Fardoun, Mohamad Mehdi and Fernand Pelletier who organized the school, Ahmed El Soufi for his support and friendship, and through him the ”Centre International de Math´ematiques Pures et Appliqu´ees”. My gratitude goes also to all faculties and students who attended this sub-Riemannian CIMPA school in making it a success.
i
Contents
Preface i
1 Sub-Riemannian structures 1
1.1 Totally nonholonomic distributions . . . 1
1.2 Horizontal paths and End-Point mappings . . . 12
1.3 Regular and singular horizontal paths . . . 24
1.4 The Chow-Rashevsky Theorem . . . 37
1.5 Sub-Riemannian structures . . . 41
1.6 Notes and comments . . . 44
2 Sub-Riemannian geodesics 45 2.1 Minimizing horizontal paths and geodesics . . . 45
2.2 The Hamiltonian geodesic equation . . . 51
2.3 The sub-Riemannian exponential map . . . 58
2.4 The Goh condition . . . 65
2.5 Examples of SR geodesics . . . 74
2.6 Notes and comments . . . 80
3 Introduction to optimal transport 83 3.1 The Monge and Kantorovitch problems . . . 83
3.2 Optimal plans and Kantorovitch potentials . . . 86
3.3 A generalized Brenier-McCann Theorem . . . 96
3.4 Optimal transport on ideal and Lipschitz SR structures . . . . 101
3.5 Other examples . . . 113
3.6 Notes and comments . . . 117
A Ordinary differential equations 121 A.1 Preliminaries . . . 121
A.2 Existence and uniqueness results . . . 123
A.3 Linear systems . . . 127
B Elements of differential calculus 129 B.1 First order calculus . . . 129
B.2 Second order study . . . 133
Bibliography 143
iii
Chapter 1
Sub-Riemannian structures
Throughout all the chapter,M denotes a smooth connected manifold without boundary of dimensionn≥2.
1.1 Totally nonholonomic distributions
Distributions
A smoothdistribution∆ of rankm≤n(m≥1) onM is a rankmsubbundle of the tangent bundle T M, that is a smooth map that assigns to each point xofM a linear subspace ∆(x) of the tangent spaceTxM of dimensionm. In other terms, for every x∈M, there are an open neighborhood Vx ofxin M andmsmooth vector fieldsXx1,· · · , Xxmlinearly independent onVxsuch that
∆(y) = Spann
Xx1(y),· · ·, Xxm(y)o
∀y∈ Vx.
Such a family of smooth vector fields is called alocal frameinVxfor the distri- bution ∆. All the distributions which will be considered later will be smooth with constant rankm∈[1, n]. Thus, from now on, ”distribution” always means
”smooth distribution with constant rank”. A co-rankkdistribution onM is a distribution of rankm=n−kand any smooth vector fieldX onM such that X(x)∈∆(x) for anyx∈M is called a section of ∆.
Example 1.1.1. We call trivial distribution on M the rank n distribution
∆ defined by ∆(x) = TxM for all x ∈ M. For topological reasons, such a distribution may not admit non-vanishing sections (for example, by the hairy ball theorem, there is no non-vanishing continuous vector fields on any even dimensional sphere).
Example 1.1.2. InR3 with coordinates(x, y, z), the distribution∆ defined by
∆(x, y, z) =Spann
X(x, y, z), Y(x, y, z)o
∀(x, y, z)∈R3 with
X =∂x−y
2∂z and Y =∂y+x 2∂z, is a rank 2(or co-rank 1) distribution on R3.
1
Example 1.1.3. More generally, if x= (x1, . . . , xn, y1, . . . , yn, z)denotes the coordinates in R2n+1 and the 2n smooth vector fields X1, . . . , Xn, Y1, . . . , Yn are defined by
Xi =∂xi−yi
2∂z, Yi =∂yi+xi
2∂z ∀i= 1, . . . , n, then the distribution∆ defined by
∆(x) =Spann
X1(x), , . . . , Xn(x), Y1(x), . . . , Yn(x)o
∀x∈R2n+1, is a co-rank1 distribution onR2n+1.
Example 1.1.4. Let α be a smooth non-degenerate 1-form on M, that is a 1-form which does not vanish (αx 6= 0 for any x ∈ M). The distribution ∆ defined as
∆(x) =Ker(αx) ∀x∈M, is a co-rank1 distribution onM.
Example 1.1.5. As an example, consider the unit 3-sphere S3 in R4 with coordinates (x1, y1, x2, y2), that is
S3=n
(x1, y1, x2, y2)∈R4|x21+y21+x22+y22= 1o . Let αbe the smooth non-degenerate1-form on S3 defined by
α=
x1dy1−y1dx1+x2dy2−y2dx2
|S3
, then∆ =Ker(α)is a co-rank1 distribution onS3.
We say that a given distribution ∆ onM admits aglobal frame if there are msmooth vector fieldsX1,· · ·, XmonM such that
∆(x) = Spann
X1(x),· · ·, Xm(x)o
∀x∈M.
In general, distributions do not admit global frames (see Example 1.1.1). It is worth noticing that in the particular case ofRn all distributions are trivial.
Proposition 1.1.6. Any distribution inRn admits a global frame.
Proof. Let us first show how to construct a non-vanishing section of a given distribution inRn.
Lemma 1.1.7. Let ∆ be a distribution of rank m in Rn. Then there is a non-vanishing smooth vector fieldX such that X(x)∈∆(x), for anyx∈Rn. Proof of Lemma 1.1.7. Define the multivalued mappingδ:Rn→2Rn by
δ(x) =n
v∈∆(v)| |v|= 1o
∀x∈Rn.
By construction, δ is locally Lipschitz with respect to the Hausdorff distance on compact subsets of Rn. By compactness of ¯B(0n,2), there is ∈ (0,1)
1.1. TOTALLY NONHOLONOMIC DISTRIBUTIONS 3 such that for any x, y ∈ B(0¯ n,2) with |x−y| < , and any v ∈ δ(x), there is w ∈ δ(y) such that |v−w| < 1. Let N ≥ 2 be an integer such that the increasing sequence of ballsB1, . . . ,BN defined by
Bi=B(0n, i) ∀i= 1, . . . , N,
satisfies ¯B(0n,1)⊂ BN. For everyx∈Rn, we denote by Projδ(x)the projection onto the (m−1)- dimensional sphere δ(x). Note that the mapping Projδ(x)is well-defined and ”smooth” on the open set
Ox=n
w∈Rn| hv, wi 6= 0,∀v∈δ(x)o .
For everyi∈ {1, . . . , N−1}, consider a smooth mappingPi :Bi+1 → Bi such that
|Pi(x)−x|< ∀x∈ Bi+1. (1.1) Note that such a smooth function exists becauseBi is a ball and Bi+1 is con- tained in the-neighborhood ofBi. Let ¯w∈δ(0) be fixed. We define the vector fieldX : ¯B(0n,1)→Rn as follows:
We first set
X1(x) = Projδ(x)( ¯w) ∀x∈ B1. Then, givenXi :Bi→Rn, we defineXi+1:Bi+1→Rn as
Xi+1(x) = Projδ(x)
Xi Pi(x)
∀x∈ Bi+1. By construction (by (1.1) and the definition of ), Xi Pi(x)
belongs to Ox for anyx∈ Bi+1. In conclusion, X =XN is smooth on ¯B(0n,1) and satisfies 0n 6=X(x) ∈ δ(x) for any x ∈ B(0n,1). Repeating the construction on the annuli B(0n,2)\B(0n,1), B(0n,3)\B(0n,2), . . ., we obtain a non-vanishing section of ∆ onRn.
We now prove Proposition 1.1.6 by induction onm. Let ∆ be a rank (m+1) distribution onRn. By Lemma 1.1.7, it admits a non-vanishing sectionX on Rn. The multivalued mapping ˜∆ :Rn→2Rn defined by
∆(x) = ∆(x)˜ ∩n
X(x)o⊥
∀x∈Rn,
is a smooth rank m distribution (here {X(x)}⊥ denotes the space which is orthogonal to X(x) with respect to the Euclidean scalar product). Thus by induction, there are smooth vector fieldsX1, . . . , XmonRn such that
∆(x) = Span˜ n
X1(x), . . . , Xm(x)o
∀x∈Rn. The family{X1, . . . , Xm, X} is a global frame for ∆.
A finite family of smooth vector fields{X1, . . . , Xk} is called agenerating family for ∆ onM if there holds
∆(x) = Spann
X1(x),· · · , Xk(x)o
∀x∈M.
Any distribution can be represented by a generating family.
Proposition 1.1.8. Let∆ be a distribution of rankm≤nonM. Then there are k=m(n+ 1) smooth vector fields X1,· · ·, Xk such that {X1,· · ·, Xk} is a generating family for ∆.
Proof. By definition, for everyx∈M, there is an open neighborhoodVx ofx inM andmsmooth vector fieldsXx1,· · · , Xxmlinearly independent onVxsuch that
∆(y) = Spann
Xx1(y),· · ·, Xxm(y)o
∀y∈ Vx.
Since M is paracompact, there is a locally finite coveringV ={Vi}i∈I where each open setVi equals Vxi for somexi∈M.
Lemma 1.1.9. There are a locally finite open covering {Uj}j∈J of M and a partition∪n+1l=1Jl ofJ such that the following properties are satisfied:
(a) for every j∈J, there isi=i(j)∈I such that Uj ⊂ Vi, (b) for everyl∈ {1, . . . , n+ 1}and any j6=j0∈Jl,Uj∩ Uj0=∅.
Proof of Lemma 1.1.9. Recall that every smooth manifold is triangulable. Let T ={Tt}t∈T be a triangulation ofM that refines the covering{Vi}i∈I, in the sense that the closure of each face F of T is a subset of some Vi. For every α∈ {0, . . . , n}, denote byTα={Ttα}t∈Tα the family ofα-dimensional faces in T. For everyα∈ {0, . . . , n},we can construct easily a collection of open sets Wα={Wsα}s∈Sα satisfying the following properties:
- Wα is a refinement of{Vi}i∈I, - ∪t∈TαTtα⊂ ∪s∈SαWsα,
- eachWsα is an open neighborhood of someα-dimensional face ofTα, - for anys6=s0 ∈Sα,Wsα∩ Wsα0 =∅,
- for anys6=s0 ∈S0,Wsα∩ Wsα0 =∅,
- for anyα∈ {1, . . . , n}and anys6=s0 ∈Sα,Wsα∩ Wsα0 ⊂ ∪t∈Tα−1Ttα−1. For that, it suffices to proceed by induction on α and to make use of the properties of a triangulation. We conclude easily.
Let us now show how to construct for every r ∈ {1, . . . , m} a family of sections {X1j, . . . , Xn+1j |1 ≤ j ≤ r} of ∆ such that Span{Xlj(x)|1 ≤ j ≤ r,1≤l≤n+ 1} has dimension≥r for anyx∈M. We proceed by induction onr.
First, for each l ∈ {1, . . . , n+ 1} and each j ∈Jl, there is i =i(j) ∈I such that Uj ⊂ Vi = Vxi. Modifying Xi1 = Xx1i outside Uj if necessary, we may assume thatXi1 is defined onM, does not vanish onUj, and vanishes outside Uj. DefineX11, . . . , Xn+11 by
Xl1= X
j∈Jl
Xi(j)1 ∀l= 1, . . . , n+ 1.
By construction (Lemma 1.1.9 (b)), the interior of the supports of theXi(j)1 ’s are always disjoint. Therefore, each Xl1 is a non-vanishing section of ∆ on
1.1. TOTALLY NONHOLONOMIC DISTRIBUTIONS 5
∪j∈JlUj. This shows that Span{Xl1(x)|1≤l≤n+ 1}has dimension≥1 for anyx∈M.
Assume now that we have constructed a family of smooth vector fields{Xij,|1≤ j≤r,1≤i≤n+ 1}such that
Spann
Xlj(x)|1≤j≤r,1≤l≤n+ 1o
has dimension ≥ r for any x∈ M (with r < m). For every j ∈ J, there is s=s(j)∈ {1, . . . , m}such that
Spann
Xxsi(j)(x), Xlj(x)|1≤j≤r,1≤l≤n+ 1o has dimension≥r+ 1 for anyx∈ Uj. DefineX1r+1, . . . , Xn+1r+1 by
Xlr+1= X
j∈Jl
Xi(j)s(j) ∀l= 1, . . . , n+ 1.
We leave the reader to check that by construction (modifying the Xxs(j)
i(j)’s if necessary as above), the vector space
Spann
Xlj(x)|1≤j ≤r+ 1,1≤l≤n+ 1o has dimension≥r+ 1 for anyx∈M. The proof is complete.
The H¨ormander condition
Recall that for any smooth vector fieldsX, Y onM given by X(x) =
n
X
i=1
ai(x)∂xi, Y(x) =
n
X
i=1
bi(x)∂xi,
in local coordinates x = (x1, . . . , xn), the Lie bracket [X, Y] is the smooth vector field defined as
[X, Y](x) =
n
X
i=1
ci(x)∂xi, wherec1, . . . , cn are the smooth scalar function given by
ci=
n
X
j=1
∂xjbi
aj− ∂xjai
bj ∀i= 1,· · ·, n.
For the upcoming controllability results (like the Chow-Rashesvky Theorem), it is important to keep in mind the following dynamical characterization of the Lie bracket.
Proposition 1.1.10. LetX, Y be two smooth vector fields in an neighborhood ofx∈Rn. Then we have
[X, Y](x) := DxY ·X(x)−DxX·Y(x)
= lim
t→0
e−tY ◦e−tX ◦etY ◦etX(x)−x
t2 , (1.2)
whereetX andetY denote respectively the flows ofX andY.
bx
b
etX(x)
betY ◦etX(x)
b
e−tX◦etY ◦etX(x)
b e−tY ◦e−tX◦etY ◦etX(x)
Proof. All the functions appearing in the proof will be defined locally fortclose to 0 and/or in a neighborhood ofx. Define the smooth functionh4by
h4(t) := e−tY ◦e−tX◦etY ◦etX(x) ∀t.
We haveh04(0) = 0. As a matter of fact, we have for anyt, h04(t) =−Y(h4(t)) + ∂
∂xe−tY
(t,h3(t))
·h03(t) whereh3 is defined byh3(t) := e−tX ◦etY ◦etX(x). Then we have
h03(t) =−X(h3(t)) + ∂
∂xe−tX
(t,h2(t))
·h02(t), whereh2(t) := etY ◦etX
(x) and h02(t) =Y(h2(t)) +
∂
∂xetY
(t,h1(t))
·h01(t),
with h1(t) := etX(x) and h01(t) = X(etX(x)). Since partial derivatives of the form ∂x∂ etX at t = 0 are equal to Id, we get h01(0) = X(x), h02(0) = X(x) +Y(x), h03(0) = Y(x) and h04(0) = 0. Therefore, the left-hand side of (1.2) is equal to 12h004(0). By derivating the above formulas, we get
h001(0) =dX(h1(0))·h01(0) =dX(x)·X(x), and
h002(0) =dY(h2(0))·h02(0) +
"
d dt
"
∂
∂xetY
(t,h1(t))·h01(t)
##
t=0
. ButdY(h2(0))·h02(0) =dY(x)·(X(x) +Y(x)) and
"
d dt
"
∂
∂xetY
(t,h1(t))
·h01(t)
##
t=0
=
"
d dt
∂
∂xetY
(t,h1(t))
#
t=0
·h01(0) + ∂
∂xetY
(0,h1(0))
·h001(0)
=
"
∂2
∂t∂x etY
(0,x)
+ ∂2
∂x2 etY
(0,x)·h01(0)
#
·X(x) +dX(x)·X(x)
= ∂
∂x ∂
∂tetY
(0,x)
·X(x) +dX(x)·X(x)
=dY(x)·X(x) +dX(x)·X(x).
1.1. TOTALLY NONHOLONOMIC DISTRIBUTIONS 7 We infer that h002(0) = dY(x)·(2X(x) +Y(x)) +dX(x)·X(x). In the same way, we have
h003(0) =−dX(h3(0))·h03(0) +
"
d dt
"
∂
∂xe−tX
(t,h2(t))
·h02(t)
##
t=0
,
−dX(h3(0))·h03(0) =−dX(x)·Y(x) and
"
d dt
"
∂
∂xe−tX
(t,h2(t))
·h02(t)
##
t=0
=
"
d dt
∂
∂xe−tX
(t,h2(t))
#
t=0
·h02(0) + ∂
∂xe−tX
(0,h2(0))
·h002(0)
=−dX(x)·(X(x) +Y(x)) +dY(x)·(2X(x) +Y(x)) +dX(x)·X(x)
=−dX(x)·Y(x) +dY(x)·(2X(x) +Y(x)).
Which impliesh003(0) =−2dX(x)·Y(x) +dY(x)·(2X(x) +Y(x)). Finally h004(0) = −dY(h4(0))·h04(0) +
"
d dt
"
∂
∂xe−tY
(t,h3(t))
·h03(t)
##
t=0
=
"
d dt
∂
∂xe−tY
(t,h3(t))
#
t=0
·h03(0) + ∂
∂xe−tY
(0,h3(0))·h003(0)
= −dY(x)·Y(x)−2dX(x)·Y(x) +dY(x)·(2X(x) +Y(x))
= 2(dY(x)·X(x)−dX(x)·Y(x))
= 2[X, Y](x), which concludes the proof.
Remark 1.1.11. We check easily that the following properties are satisfied:
(i) Given smooth vector fieldsX1, X2, Y1, Y2 anda1, a2∈R, we have [a1X1+a2X2, Y1] = a1[X1, Y1] +a2[X2, Y1]
[X1, a1Y1+a2Y2] = a1[X1, Y1] +a2[X1, Y2].
(ii) Given smooth vector fields X andY, we have[X, Y] =−[Y, X].
(iii) Given three smooth vector fieldsX, Y, Z, the Jacobi identity is satisfied:
X,[Y, Z]+
Y,[Z, X]+
Z,[X, Y]= 0.
Remark 1.1.12. Given a smooth diffeomorphism φ from a smooth manifold U to a smooth manifold V andX a smooth vector field on V , we recall that the push-forwardφ∗(X)of X is defined by
φ∗(X)(y) :=Dφ−1(y)φ X(φ−1(y)
∀y∈ V. We have
[φ∗(X), φ∗(Y)] =φ∗([X, Y]).
For any family F of smooth vector fields on an open set O ⊂ M, we denote by Lie(F) the Lie algebra of vector fields generated by F. It is the smallest vector subspace S of X∞(M) (the space of smooth vector fields on M) containing F that also satisfies
[X, Y]∈S ∀X ∈ F, ∀Y ∈S.
It can be constructed as follows: Denote by Lie1(F) the space spanned byF inX∞(M) and define recursively the spaces Liek(F) (k= 1,2, . . .) by
Liek+1(F) = Span
Liek(F)∪n
[X, Y]|X ∈ F, Y ∈Liek(F)o
∀k≥0.
This defines an increasing sequence of vector spaces inX∞(M) satisfying Lie(F) = [
k≥1
Liek(F).
In general, Lie(F) is an infinite-dimensional subspace ofX∞(M).
Example 1.1.13. Let Abe an×nreal matrix, bbe a vector inRn, andX, Y be the smooth vector fields in Rn defined by
X(x) =Ax, Y(x) =b ∀x∈Rn.
The non-zero Lie brackets of X andY are always constant vector fields of the form
ad1X(Y) := [X, Y] =−Ab, ad2X(Y) :=
X,ad1X(Y)=A2b, and
adk+1X (Y) :=h
X,adkX(Y)i
= (−1)k+1Ak+1b ∀k≥0.
By the Cayley-Hamilton Theorem,An can be expressed as a linear combination ofA0, . . . , An−1. Therefore, Lie(X, Y)is the set of vector fieldsZ inRn of the form
Z(x) =λAx+
n−1
X
i=0
λiAib ∀x∈Rn, with λ, λ0, . . . , λn−1∈R. It is a finite-dimensional Lie algebra.
Example 1.1.14. Let X, Y be the two smooth vector fields inR2 (with coor- dinates x= (x1, x2)) defined by
X(x) =∂x1, Y(x) =f(x1)∂x2 ∀x∈R2,
where f is a smooth scalar function. Then, Lie(X, Y) is the space of smooth vector fields spanned byX and
adkY(X) =f(k)∂x2 fork≥0.
Thus, Lie(X, Y) is infinite-dimensional whenever the derivatives off span an infinite-dimensional space of functions.
1.1. TOTALLY NONHOLONOMIC DISTRIBUTIONS 9 For any pointx∈M, Lie(F)(x) denotes the set of all tangent vectorsX(x) withX ∈Lie(F). It follows that Lie(F)(x) is always a linear subspace ofTxM, hence finite-dimensional.
Example 1.1.15. Returning to Example 1.1.14 and denoting by (e1, e2) the canonical basis ofR2, we check that
Lie(X, Y)(x) =Spann
e1, f(k)(x1)e2|k= 0,1,2, . . .o
∀x∈R2. In particular, Lie(X, Y)(x) = Re1 if f(x) and all its derivatives at x vanish and Lie(X, Y)(x) =R2 otherwise.
We say that the smooth vector fields X1, . . . , Xm satisfy theH¨ormander condition on some open setO ⊂M if and only if
Lien
X1,· · · , Xmo
(x) =TxM ∀x∈ O.
A distribution ∆ onM is calledtotally nonholonomiconM if for everyx∈M, there are an open neighborhoodVxofxinM and a local frameXx1,· · ·, Xxmon Vxwhich satisfies the H¨ormander condition on Vx. This definition is intrinsic, it does not depend upon the choice of the local frameXx1, . . . , Xxm. This is a consequence of the following result:
Proposition 1.1.16. Let {X1, . . . , Xm},{Y1, . . . , Ym} be two families of lin- early independent smooth vector fields on an open setO ⊂M such that
Spann
X1(x), . . . , Xm(x)o
=Spann
Y1(x), . . . , Ym(x)o
∀x∈ O. Then there holds for any integerk≥1,
Liekn
X1, . . . , Xmo
(x) =Liekn
Y1, . . . , Ymo
(x) ∀x∈ O.
Proof. It is sufficient to show that the following inclusion holds for any integer k≥2,
Liekn
X1, . . . , Xmo
(x)⊂Liekn
Y1, . . . , Ymo
(x) ∀x∈ O.
Since the Yj(x) are always linearly independent, there are smooth functions αji :O →Rwithi, j= 1, . . . , m,such that
Xi(x) =
m
X
j=1
αji(x)Yj(x) ∀x∈ O,∀i= 1, . . . , m.
Then for everyi= 1, . . . , mand every smooth vector fieldZ, there holds [Xi, Z] =
m
X
j=1
αjiYj, Z
=
m
X
j=1
αji[Yj, Z]−
m
X
j=1
dαji(Z)Yj. Since Span
X1(x), . . . , Xm(x) ⊂ Span
Y1(x), . . . , Ym(x) for any x, this shows that
Lie2n
X1, . . . , Xmo
(x)⊂Lie2n
Y1, . . . , Ymo
(x) ∀x∈ O. We conclude easily by an inductive argument.
We also observe that any generating family for ∆ does satisfy the H¨ormander condition provided ∆ is totally nonholonomic.
Proposition 1.1.17. Let ∆ be a totally nonholonomic distribution on M and{X1, . . . , Xk} be a generating family for ∆. Then X1, . . . , Xk satisfy the H¨ormander condition onM.
Proof. We need to show that Lien
X1,· · ·, Xko
(x) =TxM ∀x∈M.
Letx∈M be fixed. By assumption, there is an open neighborhoodVx and a local frameYx1,· · · , YxmonVx which satisfies the H¨ormander condition onVx. Proceeding as in the proof of Proposition 1.1.16, we show that
Liekn
X1, . . . , Xko
(x)⊂Liekn
Yx1, . . . , Yxmo (x),
for every integerk ≥ 1. This proves that X1, . . . , Xk satisfy the H¨ormander condition onM.
Remark 1.1.18. Since for any smooth vector fieldX, there holds[X, X] = 0, a one dimensional distribution cannot be totally nonholonomic.
Degree of nonholonomy
If ∆ is a rankmtotally nonholonomic distribution onM, then for everyx∈M, there are an open neighborhoodVxofxandmsmooth vector fieldsXx1, . . . , Xxm which satisfy the H¨ormander condition onVx. We calldegree of nonholonomy of ∆ atxthe smallest integerr=r(x)≥1 such that
Liern
X1, . . . , Xmo
(x) =TxM.
Thanks to Proposition 1.1.16, this definition does not depend upon the choice of the local frame. Moreoever, we shall say that ∆ is totally nonholonomic of degreerif the nonholonomy degree of any point inM is≤r.
Example 1.1.19. The distribution given in Example 1.1.2 is totally nonholo- nomic. We check easily that
[X, Y] =∂z ∀i, j= 1, . . . , n, which means that∆ has degree2.
Example 1.1.20. More generally, the distribution given in Example 1.1.3 is totally nonholonomic of degree 2. We check easily that
[Xi, Yj] =δij∂z ∀i, j= 1, . . . , n.
Example 1.1.21. The Martinet distribution inR3 (with coordinates(x, y, z)) is defined as
∆(x, y, z) =Spann
X(x, y, z), Y(x, y, z)o
∀x∈R3,
1.1. TOTALLY NONHOLONOMIC DISTRIBUTIONS 11 where
X=∂x, Y =∂y+x2 2 ∂z. The first Lie bracket ofX, Y is given by
[X, Y] =x∂z. For any(x, y, z)∈R3 with x6= 0, the three vectors
X(x, y, z), Y(x, y, z),[X, Y](x, y, z)
are linearly independent. Hence, ∆ is a totally nonholonomic distribution of degree2 on R3\ {x= 0}. The Lie bracket[[X, Y], Y]is given by
[[X, Y], Y] =∂z.
Then,∆ is a totally nonholonomic distribution of degree 3 onR3. Example 1.1.22. More generally, ifX, Y are given by
X =∂x, Y =∂y+xl∂z,
with l ∈ N∗, we check easily that the distribution spanned by X and Y is a totally nonholonomic distribution of degreel+ 1.
Example 1.1.23. Assume that M has dimensionn = 2p+ 1 and let α be a 1-form on M satisfying
α∧(dα)p6= 0
then the distribution given by∆ =Ker(α)is totally nonholonomic of degree2.
Such a1-form is called a contact form and the associated distribution is called a contact distribution. As a matter of fact, given x¯ ∈M, there is a local set of coordinates(x1, . . . , xn)in an open neighborhood V¯ ofx¯ such thatαhas the form
α=
2p
X
i=1
aidxi
! +dxn,
wherea1, . . . , a2p are smooth scalar function onV¯ such that ai(¯x) = 0 ∀i= 1, . . . ,2p.
Hence, the family of smooth vector fieldsX¯1, . . . ,X¯2p given by X¯i=∂xi−ai∂xn ∀i= 1, . . . ,2p,
defines a local frame for∆ =Ker(α)in V¯. On the one hand, the n= 2p+ 1- formα∧(dα)p atx¯ reads
(α∧(dα)p)x¯= X
σ∈P2p
Y
l=1,...,p
∂ajl
∂xil
−∂ail
∂xjl
dxn∧(dxi1∧dxj1). . .∧ dxip∧dxjp
|x¯, (1.3)
whereP2pdenotes the set ofp-tuples of the formσ= ((i1, j1), . . . ,(ip, jp))with {i1, j1, . . . , ip, jp} ={1, . . . ,2p} and il < jl for all l = 1, . . . , p. On the other hand, we check easily that
X¯i,X¯j(¯x) = ∂xiaj−∂xjai
∂xn(¯x) ∀i, j= 1, . . . ,2p.
Therefore, if there is¯i∈ {1, . . . ,2p} such that[ ¯X¯i,X¯j](¯x) = 0 for anyj 6=i, then all the products appearing in (1.3) vanish, which implies that(α∧(dα)p)x¯= 0, contradiction. We deduce that for everyi∈ {1, . . . , n}, there holds
Spann
X¯1(¯x), . . . ,X¯2p(¯x),X¯i,X¯1(¯x), . . . ,X¯i,X¯2p(¯x)o
=T¯xM. (1.4) This means that ∆ =Ker(α)is totally nonholonomic of degree 2.
Example 1.1.24. As an example, the 1-form given in Example 1.1.5 is a contact form onS3. There holds
α∧dα = (x1dy1−y1dx1+x2dy2−y2dx2)∧(2dx1∧dy1+ 2dx2∧dy2)
= 2x1dy1∧dx2∧dy2−2y1dx1∧dx2∧dy2
+2x2dx1∧dy1∧dy2−2y2dx1∧dy1∧dx2. A basis of the tangent space to S3 at x = (x1, y1, x2, y2) ∈ S3 is given by (V1, V2, V3) with
V1 = −y1e1+x1e2−y2e3+x2e4
V2 = −x2e1+y2e2+x1e3−y1e4
V3 = −y2e1−x2e2+y1e3+x1e4. Then
(α∧dα)x(V1, V2, V3) =
2x21 x21+y21+x22+y22
−2y12 −x21−y12−x22−y22 + 2x22 x21+y21+x22+y22
−2y22 −x21−y12−x22−y22
= 2 x21+y12+x22+y222= 2.
This means that the restriction of the 3-form α∧dα to the tangents spaces to S3 does not vanish.
1.2 Horizontal paths and End-Point mappings
Horizontal paths
Let ∆ be a distribution of rankm≤ninM. A continuous pathγ: [0, T]→Rn is said to be horizontal with respect to ∆ if it is absolutely continuous with square integrable derivative (see Appendix A) and satisfies
˙
γ(t)∈∆ γ(t)
a.e. t∈[0, T].
For every x∈ M and every T > 0, we denote by Ωx,T∆ the set of horizontal paths γ: [0, T] →M starting atx. If ∆ admits a global frameX1, . . . , Xm, then there is a one-to-one correspondence between Ωx,T∆ and an open subset of L2([0, T];Rm).
1.2. HORIZONTAL PATHS AND END-POINT MAPPINGS 13 Proposition 1.2.1. LetF=
X1, . . . , Xm be a global frame for∆. Then for every x∈M and every T >0, there is an open subset UFx,T ofL2([0, T];Rm) such that the mapping
u∈UFx,T 7−→γu∈Ωx,T∆ ,
(whereγu: [0, T]→M is the unique solution to the Cauchy problem
˙ γu(t) =
m
X
i=1
ui(t)Xi(γu(t)) a.e. t∈[0, T], γu(0) =x,) (1.5) is one-to-one.
Proof. The set of controlsu∈L2([0, T];Rm) such that the solutionγuof (1.5) is well-defined on [0, T] is a non-empty open set. Moreover, by construction, any path γu is absolutely continuous with square integrable derivative and almost everywhere tangent to ∆. This proves that the map under study is well-defined. Letγ∈Ω∆,x,T be such that there areu, v∈L2([0, T];Rm) such that
˙ γ(t) =
m
X
i=1
ui(t)Xi(γ(t)) =
m
X
i=1
vi(t)Xi(γ(t)) a.e. t∈[0, T].
Since the tangent vectors X1(γ(t)), . . . , Xm(γ(t)) are always linearly inde- pendent inTγ(t)M, we infer thatu(t) =v(t) for almost everyt∈[0, T], which proves that our map is injective. Furthermore, givenγ∈Ωx,T∆ , for almost every t∈[0, T], the pathγis differentiable attand there is a uniqueu(t)∈Rmsuch that ˙γ(t) =Pm
i=1ui(t)Xi(γ(t)). By construction, the functionu: [0, T]→Rm belongs toL2([0, T];Rm).
As seen before, a general distribution may have no global frame, but it can be represented byk=m(n+ 1) vector fields (see Proposition 1.1.8).
Proposition 1.2.2. Let F =
X1,· · · , Xk be a generating family for∆ on M. Then, for everyx∈M and every T >0, there is an open subset UFx,T of L2 [0, T];Rk
such that the mapping
u∈UFx,T 7−→γu∈Ωx,T∆ ,
(whereγu: [0, T]→M is the unique solution to the Cauchy problem
˙ γu(t) =
k
X
i=1
ui(t)Xi(γu(t)) a.e. t∈[0, T], γu(0) =x,) (1.6) is onto.
Proof. Letγ ∈Ωx,T∆ be fixed. For everyt ∈[0, T], there is an open setOtof γ(t) inM andmintegersit1, . . . , itm∈ {1, . . . , k}such that
Span n
Xit1(x), . . . , Xitm(x)o
= ∆(x) ∀x∈ Ot.
The curveγ([0, T]) is compact and is contained in∪t∈[0,T]Ot. Hence, there are N timest1, . . . , tN ∈[0, T] together with a partition of unity{ψj} such that
[0, T]⊂
N
[
j=1
Otj, Supp (ψj)⊂ Otj,
N
X
j=1
ψj = 1.
For everyj, there is a smooth mappingUj:T M →Rm such that
v=
m
X
l=1
Uj(v)Xitjl (x),
for every (x, v)∈T M withx∈ Otj andv∈∆(x). Then, there holds for almost every t∈[0, T] and any j∈ {1, . . . , N},
γ(t)∈ Otj =⇒ γ(t) =˙
m
X
l=1
Uj( ˙γ(t))Xitjl γ(t) . By the properties satisfied by{ψj}, we infer that
˙ γ(t) =
N
X
j=1
ψj γ(t)
"m X
l=1
Uj( ˙γ(t))Xitjl γ(t)
#
=
N
X
j=1 m
X
l=1
ψj γ(t)
Uj( ˙γ(t))
Xitjl γ(t) , for almost every t ∈ [0, T]. Each mapping t 7→ ψj γ(t)
Uj( ˙γ(t)) belongs to L2 [0, T];R. We infer easily the existence of u ∈ L2 [0, T];Rk such that γ=γu.
Remark 1.2.3. If M is compact, then solutions to (1.5) (resp. (1.6)) are defined for anyu∈L2([0, T];Rm) (resp. u∈L2([0, T];Rk)).
Given a family of smooth vector fields F =
X1,· · ·, Xk on M and x ∈ M, T > 0, a function u ∈ UFx,T ⊂L2 [0, T];Rk is called a control and the corresponding solution of (1.6) is called the trajectory starting at x and associated with the control u. Since any horizontal path can be viewed as a trajectory associated to a control system like (1.6), we restrict in the next paragraph our attention to End-Point mappings associated with finite families of smooth vector fields.
End-Point mappings LetF =
X1, . . . , Xk be a family ofk≥1 smooth vector fields onM. As be- fore, givenxandT >0, there is a maximal open subsetUFx,T ⊂L2 [0, T];Rk such that for everyu∈UFx,T, there is a unique solution to the Cauchy problem
˙ γu(t) =
k
X
i=1
ui(t)Xi(γu(t)) a.e. t∈[0, T], γu(0) =x. (1.7)
1.2. HORIZONTAL PATHS AND END-POINT MAPPINGS 15 The End-Point mapping associated to F at x in time T > 0 is defined as follows,
EFx,T : UFx,T −→ M u 7−→ γu(T).
Givenu∈UFx,T, we denote by XFu the time-dependent vector field defined by XFu(t, x) :=
m
X
i=1
ui(t)Xi(x) a.e. t∈[0, T],∀x∈M.
Its flow ΦuF(t, x) is well-defined and smooth on a neighbourhood ofx; we denote by DxΦuF(t, x) its differential at (t, x) with respect to the x variable. The following result holds. (We refer the reader to Appendix A for reminders in differential equations and to Appendix B for reminders in differential calculus in infinite dimension.)
Proposition 1.2.4. The End-Point mappingEFx,T is of classC1 onUFx,T and for every controlu∈UFx,T, its differentiable atu,
DuEFx,T : L2([0, T];Rk)−→TEx,T
F (u)M
is given by
DuEFx,T(v) =DxΦuF(T, x)· Z T
0
DxΦuF(t, x)−1
·XFv t, EFx,t(u)
dt (1.8) for everyv∈L2([0, T];Rk). Moreover, the mapping
u∈UFx,T 7−→ DuEFx,T (1.9)
is locally Lipschitz.
Proof. Any smooth manifold can be smoothly embedded in an Euclidean space.
Then without loss of generality we can assume thatM is a smooth submanifold of some RN and consequently that the Xi’s are the restrictions of smooth vector fields ˜X1, . . . ,X˜k which are defined in an open neighborhood ofM in RN. Givenu∈UFx,T andv∈L2([0, T];Rk) let us look at
lim→0
1
EFx,T u+v
−EFx,T u . Using the previous notations, we have
γu+v(T) = Z T 0
k
X
i=1
(ui(t) +vi(t))Xi γu+v(t) dt
= Z T 0
k
X
i=1
(ui(t) +vi(t)) ˜Xi γu+v(t)
dt, (1.10) with γu+v(0) = x. For every i = 1, . . . , k and every t ∈ [0, T], the Taylor expansion of each ˜Xi at γu(t) gives
X˜i γu+v(t)
= ˜Xi γu(t)
+Dγu(t)X˜i· γu+v(t)−γu(t)
+|γu+v(t)−γu(t)|o(1). (1.11)
Setting δx(t) :=γu+v(t)−γu(t) for anyt, we may assume thatδx has size, then (1.10) yields formally
δx(T) = Z T 0
k
X
i=1
ui(t)Dγu(t)X˜i · δx(t)dt +
m
X
i=1
vi(t) ˜Xi γu(t) + o().
This suggests that the function t ∈ [0, T] 7→ δx(t) should be solution to the Cauchy problem
δ˙x(t) =
" k X
i=1
ui(t)Dγu(t)X˜i
# δx(t) +
" k X
i=1
vi(t) ˜Xi γu(t)
#
a.e. t∈[0, T], (1.12) with δx(0) = 0. By (1.10)-(1.12) together with Gronwall’s Lemma (see Ap- pendix A) we check easily that for everyv∈L2 [0, T];Rk, the quantity
1
EFx,T u+v
−EFx,T u
−δx(T)
tends to zero as tends to zero. For almost every t∈[0, T], denote by Au(t) the matrix in MN(R) representing the linear operator Pk
i=1ui(t)Dγu(t)X˜i in the canonical basis ofRN and for everyt∈[0, T], denote byBu(t) the matrix in MN,k(R) whose the columns are the ˜Xi(γu(t))’s. Denote bySu : [0, T] → MN(R) the solution to the Cauchy problem
S˙u(t) =Au(t)Su(t) a.e. t∈[0, T], Su(0) =In. Note thatSu(t) is exactly the Jacobian of the flow Φu˜
F(with ˜F ={X˜1, . . . ,X˜k}) at (t, γu(t)) with respect to thexvariable. The solution of (1.12) at timeT is given by (see Appendix A)
δx(T) =DuEFx,T(v) =Su(T)Z T 0
Su(t)−1Bu(t)v(t)dt.
Thus we check that (1.8) is satisfied. Let us now prove the local Lipschitzness ofu7→DuEFx,T and indeed give more details on the estimates that were needed in the above proof. Let ¯ua control be fixed inUFx,T ⊂L2([0, T];Rk). The curve γu¯([0, T])⊂M ⊂RN is compact. Let >0 be fixed, the setV ⊂RN defined by
V :=n
γu¯(t) +z|t∈[0, T], z∈B(0, )o
is relatively compact. Then there isK >0 such that all the ˜Xi’s are bounded byK onV and all the ˜Xi’s areK-Lipschitz onV. Set
δ:=
KT eKTku¯kL2
and pick a control u∈ L2([0, T];Rk) with ku−u¯kL2 < δ. We claim that u belongs to UFx,T and that the trajectory γu : [0, T] → M ⊂ RN (which is
1.2. HORIZONTAL PATHS AND END-POINT MAPPINGS 17 associated withu) is contained inV. Argue by contradiction and assume that there is ¯t∈[0, T] such thatγu(t) is on the boundary ofV. Taking ¯t >0 smaller if necessary, we may assume thatγu(t) belongs toV for anyt∈[0,¯t). Set
f(t) :=|γu(t)−γu¯(t)| ∀t∈[0,¯t].
Then we have for everyt∈[0,¯t), f(t) =
Z t 0
k
X
i=1
ui(s) ˜Xi γu(s)
−
k
X
i=1
¯
ui(s) ˜Xi γu¯(s) ds
≤ Z t
0
k
X
i=1
(ui(s)−u¯i(s)) ˜Xi γu(s)
ds +Z t
0
k
X
i=1
u¯i(s)
X˜i γu(s)
−X˜i γu¯(s)
ds
≤ Kt u−u¯
L2+Z t 0
k
X
i=1
¯ ui(s)
Kf(s)ds.
By Gronwall’s Lemma (see Appendix A) and definition ofδ, we infer that f ¯t
≤KT u−u¯
L2eKTku¯kL2 < .
Thus we get a contradiction and the claimed is proved. Letu, u0 ∈L2([0, T];Rk) withku−u¯kL2,ku0−u¯kL2 < δ, by repeating the same argument we get
|γu0(t)−γu(t)| ≤Kt u0−u
L2eKT(ku¯kL2+δ) ∀t∈[0, T].
(This shows that End-Point mappings are locally Lipschitz.) Denote bySu, Su0 : [0, T]→MN(R) the solutions to the Cauchy problems
S˙u(t) =Au(t)Su(t) a.e. t∈[0, T], Su(0) =In, S˙u0(t) =Au0(t)Su0(t) a.e. t∈[0, T], Su0(0) =In, whereAu, Au0 are defined by
Au(t) :=
k
X
i=1
ui(t)JX˜i γu(t)
, Au(t) :=
k
X
i=1
u0i(t)JX˜i γu0(t) ,
for almost everyt∈[0, T] (JX˜i(γu(t)) (resp. JX˜i(γu0(t)) denotes the Jacobian matrix of ˜Xi at γu(t) (resp. at γu0(t))). Taking K > 0 larger if necessary, we may assume that it is an upper bound for theJX˜i’s onV and a Lipschitz constant for theJX˜i’s onV. Then we have for everyt∈[0, T],
kSu(t)k =
In+Z t 0
Au(s)Su(s)ds
≤ 1 +Z t 0
k
X
i=1
|ui(s)|
JX˜i γu(t)
kSu(s)k ds
≤ 1 +Z t 0
K
k
X
i=1
|ui(s)| kSu(s)kds.