Regularization Effects for Certain Dynamical Systems through Gaussian Noises

(1)

Oussama Amine

Regularization Effects for Certain Dynamical Systems through

Gaussian Noises

Thesis submitted for the degree of Philosophiae Doctor

Department of Mathematics

Faculty of Mathematics and Natural Sciences

2020

(2)

Series of dissertations submitted to the

Faculty of Mathematics and Natural Sciences, University of Oslo No. 2279

ISSN 1501-7710

reproduced or transmitted, in any form or by any means, without permission.

Cover: Hanne Baadsgaard Utigard.

Print production: Reprosentralen, University of Oslo.

(3)

(4)

(5)

Preface

This thesis is submitted in partial fulﬁllment of the requirements for the degree of Philosophiae Doctor at the University of Oslo. The research presented here was conducted at the University of Oslo and at Queen’s University, under the supervision of Professor Frank Proske and associate Professor Abdol-Reza Mansouri. This work was supported by the Faculty of Mathematics and Natural Sciences at the University of Oslo, partially, through the Stochastics of Renewable Energy Markets (STORE) project.

The thesis is a collection of four papers, that span 3 years of work with the aim of studying, using probabilistic as well as analytic tools, the phenomenon of regularization by noise. The common theme thus is that of the effect of noise on dynamical systems generated through ordinary differential equations and more specifically the role that certain perturbations play in improving the behavior of these systems even when the original non-perturbed version is known to be ill-posed. The papers are preceded by an introductory chapter on the background and motivation for our current work as a whole, as well as for each of the papers separately. It also provides a general idea on where the work is situated in the field of regularization by noise. The four papers have resulted from joint work with Professor Proske, while the last one was crystallized, shaped and improved through discussions with Professor Mansouri. The first paper is also joint work with Msc Emmanuel Coffie and Doctor Fabian Harang. The second and third papers are also joint work with Doctor David Banos.

(6)

Acknowledgements

From UiO I would like to thank: Professor Arne B. Huseby for his help during my first year as a PhD student, Professor Tom Lindstrøm who has been a source of inspiration throughout my years at UiO, Professor Snorre H. Christiansen for all the nice discussions over the many cups of coffee, Professor Sergey Neshveyev for his wonderful lectures, associate Professor Ulrik S. Fjordholm for the very informative seminar on the transport and continuity equations and his help with understanding some of the concepts related to them, associate Professor Salvador Ortiz-Latorre, Doctor David R. Banos, Professor Fred E. Benth, Professor Giulia Di Nunno, Martin Helsø for his help with LÂTEX , Professor Anders R. Swensen, Professor Sven O. Samuelsen, Professor Nils L. Hjort and the late Professor Hans P. Langtangen who is deeply missed.

From Queen’s university I would like to thank everybody I met there during my visit, especially associate Professor Bahman Gharesifard, Professor Andrew Lewis, Professor Ram Murty, Professor James A. Mingo, Professor Fady Alajaji, associate Professor Francesco Cellarosi, associate Professor Thomas Barthelmé, Professor Oleg I. Bogoyavlenskij, associate Professor Ivan Dimitrov, Professor Gregory G. Smith, Professor Mike Roth, associate Professor Serdar Yüksel, Professor Atabey Kaygun, Emine Yildrim and Jennifer Read.

From Ukraine I thank my friend Yuriy Prykhodko for his hospitality and generosity. Also Doctor Georgii V. Riabov, Professor Andrey A. Dorogovtsev, Doctor Olga Izyumtseva, Professor Andrey Pilipenko, Professor Oleg Klesov and Katerina for making my research stay in Kiev memorable.

I would also like to thank several other people I met during my PhD years.

Especially Professor Yaozhong Hu, Professor Youssef Ouknine and Professor Shiqi Song for helping me improve my understanding of some aspects of stochastic analysis. Thanks to Professor Nils Berglund for the nice discussions we had in Sarajevo. I thank also Professor Massimiliano Gubinelli for the long discussions on mathematics and other topics as well as for his generosity in answering my questions in L’Aquila. From Germany also, I thank Martin Bauer for all the interesting discussions as well as Doctor Lukas Wresch for his help with understanding the papers of Davie and Shaposhnikov as well as for making several suggestions for improving the introduction to this thesis. I also thank my co-author Emmanuel Coﬃe and wish him good luck in his PhD studies.

I thank as well Nacira Agram for all the nice discussions we had as well as for her support during my years of study.

My supervisors: I thank Professor Bernt Øksendal who was the reason for my interest in the ﬁeld of stochastic analysis after I was fortunate to take a course taught by him. I also thank him for all the discussions we had during the past 3 years.

To Professor Abdol-Reza Mansouri, I can’t express how grateful I am for the opportunity that I had to work with you during my stay at Queen’s. Not only did I change as a mathematician after my stay there, but I have changed and grown as a human being. Thank you for accepting to supervise me as well as for giving me the opportunity to work with you.

(7)

Last, but not least, I thank my main supervisor Professor Frank Norbert Proske who made all of this possible the moment he accepted to take me as his student, and even before that. It is no exaggeration when I say that every word in this work is either directly or indirectly due to you. You have been an inspiration to me not only in relation to mathematics but, most importantly on the human level, your generosity and selﬂessness are truly rare. I want you to know that I will always be grateful to you dear Frank.

My friends: I thank Yassine for making life in Oslo more enjoyable and all the nice moments we spend together discussing about everything.

I thank Lara for the great time we spent learning new languages.

I also would like to thank Ijlal for being available when I needed someone to talk to as well as for sparking my interest in the ﬁeld of security and cryptography.

I am truly happy to have met you!

Teachers: I wouldn’t be here writing these lines without the help, sacriﬁce and generosity of all the teachers that I had throughout my years of study. I take the opportunity to mention two who inﬂuenced me deeply in relation to mathematics. Professor Chebbaki and Professor Farouj, thank you very much for all you have taught me.

Family: A big hug to all of my family, my father and mother as well as my two beautiful brothers. I am grateful to have you in my life.

Oussama Amine Oslo, June 2020

(8)

(9)

List of Papers

Paper I

Amine, Oussama; Coﬃe, Emmanuel; Harang, Fabian Andsem and Proske, Frank Norbert “A Bismut-Elworthy-Li Formula for Singular SDE’s Driven by a Fractional Brownian Motion and Applications to Rough Volatility Modeling”.

To appear in Communications in Mathematical Sciences.

Paper II

Amine, Oussama; Baños, David and Proske, Frank Norbert “Regularity Properties of the Stochastic Flow of a Skew Fractional Brownian Motion”.

To appear in Inﬁnite Dimensional Analysis Quantum Probability and Related Topics.

Paper III

Amine, Oussama; Baños, David and Proske, Frank Norbert “C^∞ regularization by Noise of Singular ODE’s”. Submitted for publication.

Paper IV

Amine, Oussama; Mansouri, Abdol-Reza and Proske, Frank Norbert “Well- posedness of the Deterministic Transport Equation with Singular Velocity Field Perturbed along Fractional Brownian Paths”. Submitted for publication.

(10)

(11)

Chapter 1 Introduction

In the present work, we show how certain types of universal random perturbations aﬀect certain basic dynamical systems that are a priori ill-posed in their original form. More precisely¹ our contribution can be described through the following example. Consider the following ordinary diﬀerential equation

_dX(t,x)

dt =b(t, X(t, x)) X(0, x) =x

or in integral form

X(t, x) =x+ _t

0 b(s, X(s, x))ds when x∈R^d and t∈[0, T].

Let us take the Lagrangian point of view and "follow" a hypothetical particle starting from a spatial position x at time 0. If we think of b(.) as a function outputting an instantaneous velocity as a function of its argument, then when asking a question about the existence of a solution to the above equation we are in fact asking questions about the existence of a path traced by a particle starting at time 0 from x and changing its velocity as function of its position through b. More precisely what we are asking for is a map X :R^d → C

[0, T];R^d that sends each initial position to a trajectory X(., x). The Cauchy-Lipschitz theory tells us that this map exists, locally in time, if bis locally Lipschitz in the spatial argument uniformly in time. Moreover the solution map, called the flow, X is locally Lipschitz in x. The appearance of Lipschitz regularity with respect to bothb and X is not arbitrary and one can show that in general the flow inherits its spatial regularity from b. This Lipschitz regularity, of b at least, is crucial for the uniqueness of the solutions to the equation above. The fact that uniqueness can also follow from the Lipschitz regularity of a candidate flow map² is not obvious at first sight but can be shown as was done in the work of Van Kampen [20].

This is in fact optimal and examples can be constructed to show this. In fact, consider the following initial value problem

_dX(t)

dt = X(t)²³ X(0) = 0

does not admit a solution map sincet→ 0 and t→ t³/27 are solutions to the above equation starting from 0. In fact there exists a whole family of solutions

1For a more concrete enumeration of our contribution see Section 1.4.

2See 1.2.3 for a deﬁnition.

(14)

indexed by c namely

X^c(t) =

0 , if t≤c (t−c)³/27 , if t≥c for any c∈ [0,∞].

The intuition for this phenomenon is that the non-Lipschitzianity of the vector ﬁeld allows particles that can stay at rest for an arbitrary length of time without moving . This can also be explained by saying that the push-forward of a certain initial mass of particles becomes inﬁnite or zero at a latter point in time. The reason for this is, of course, the fact that

t→ t²³

is not locally Lipschitz in any open set containing the point 0. On the other hand, on any open set not containing 0 the behavior described above cannot take place and therefore developing a new, and necessarily relaxed, notion of solution to the diﬀerential equation above that can give a canonical way to select a speciﬁc solution among the possibly many at problematic points like 0 above while leaving the unproblematic ones untouched might be possible.

Thus, if we are willing to relax the notion of a solution to the above equation then there might be hope, and this is in fact the seminal work of DiPerna-Lions [7] on Lagrangian ﬂows. Roughly speaking, the ﬂow map that we seek need not satisfy the equation at every point but only almost everywhere with respect to some measure. The result holds for b such that:

b∈L¹

(0, T) ;W_loc^1,p

and div(b) ∈L¹((0, T) ;L^∞)

for some p ≥1. The previous result was extended in, again a seminal work of, Ambrosio [2].

In contrast to the Lipschitz case, the transfer of regularity is not anymore obvious and the work [10] shows that no transfer of Sobolev diﬀerentiability is to be expected in general.

The intuitive picture that one can draw to summarize the previous discussion is that unless the vector ﬁeld enjoys some sort of spatial diﬀerentiability, the dynamical system given by 1 can exhibit a whole spectrum of “pathologies” e.g.

coalescence, particles splitting or being created from nothing.

Let us try a different approach by adding an additional velocity field that is of a very generic type, take the simplest namely a constant velocity field, then³ for a fixed initial position we have

X(t) =x₀ + _t

0 b(X(s))ds+ _t

0 λds X(t) =x₀ +

_t

0 b(X(s))ds+λt

3This example is taken from [11].

(15)

This in turns implies, supposing in what follows that b is continuous. that Y(t) =x₀ +

_t

0 b(Y(s) +λs)ds, where we have used the very useful transformation

Y(t) =X(t)−λt Deﬁne the function B by

B(s) := d

dtB(s) := b(s) b(s) +λ,

then

Y(t) =x₀+ _t

0 b(Y(s) +λs)ds

=x₀+ _t

0 (λ+b(Y(s) +λs))·(B(Y(s) +λs))ds

=x₀+ _t

0

d

dtB(Y(s) +λs)ds

=x₀−B(x₀) +B(Y(t) +λt).

(1.1)

Now, suppose that there exist two solutions, say X₁ and X₂, then

|Y₁(t)−Y₂(t)|= |B(Y₁(t) +λt)−B(Y₂(t) +λt)|

If b is in addition bounded, then for some very large instantaneous velocity λ we have that B(s)< L << 1. Hence B is Lipschitz with Lipschitz constant strictly less than 1 and

|Y1(t)−Y₂(t)| < L|Y1(t)−Y₂(t)|

This heuristic argument shows that for, a bounded continuous vector ﬁeld, b to give rise to a well-posed ODE, an instantaneous perturbation of the form dλ(t) :=λdt is suﬃcient.

At least two problems can be observed with the previous approach:

1. If b is truly singular e.g. discontinuous, then a constant instantaneous speed does not help with the singularities that b might have and in turn transfer to the dynamical system i.e. the ﬂow. Hence something that has, for a lack of a better description, inﬁnite speed would be ideal for the task at hand. This should happen for both signs as well to avoid a totally nonsensical situation.

2. Although not mentioned explicitly, the argument uses in an essential way the one-dimensional character of the equation to deﬁne B. Hence

(16)

another operator has to play the role that B played above. Moreover, and with the previous point in mind, truly singular vector ﬁelds can exhibit complex types of singularities in the multidimensional setting and thus the perturbation that we should consider ought to have some property of instantaneous change of direction in addition to that of speed.

The story so far leads us to the “best” candidate⁴, namely Brownian motion λ(t) := B(t). A slight inconvenience, at ﬁrst sight at least, is that we cannot write dB(t) = ˙B(t)dt but if we work with dB(t) directly, the equation in integral form can be written as

X(t) =x₀ + _t

0 b(X(s))ds+B(t).

We cannot of course write the operator B as before but if we take one step back from its deﬁnition above we can see that a similar conclusion can be attained with following operator instead

A: [0, T]×R^d → R^d (t, x)→ At(x) :=

_t

0 b(x+B(s))ds

In order to conclude we have to show that, for any two solutions X₁ and X₂, we have

The task is now related to the operator A and its properties e.g. regularity, and how it interacts with solutions to the equation at hand.⁵

In fact defining new functionals through the perturbation (t→ B(t)) that enjoy better regularity than that enjoyed by b in the original problem is a recurring theme. We make a small digression to see how different variations of the above idea, namely the use of the regularizing effect resulting from the addition of (t→B(t)), results in well-posedness through auxiliary operators.

4When restricting the discussion to continuous perturbations. If one is willing to sacriﬁce continuity then there exists results in the setting when the perturbation is given by the paths of a Levy process (see for example [14]).

5In fact we show in Paper III that the operator Aas well as higher order operators that generalize it "encode" the regularizing eﬀect thatB, or in the case of Paper IV a generalization ofB, generate as a function of the strength of their erraticity as measured by the property oftwo sided strong local non-determinism. This brings to mind the result of [19] where the regularization eﬀect of the operatorx→

f(x−γ(s))a(s)ds, where γ is a smooth curve and a is a smooth cutoff function in a neighbourhood of 0, were linked to geometric properties of the curve namely the degree of curvature, in a specific sense, that γ possess’. A geometric approach, as in [19], to the study of the regularization effect of rough perturbations, through the operatorA, on singular vector fieldsb, seems to be, though very interesting, beyond reach at the moment.

(17)

The classical approach: Zvonkin-Veretennikov and the PDE approach way

1.1 The classical approach: Zvonkin-Veretennikov and the PDE approach way

Note that there is a subtle issue that we have not addressed yet, namely the fact that B is a stochastic process, (t, ω) → B(t, ω) a function of time and randomness, and thus must be deﬁned on some space that can accommodate the two terms “stochastic” and “process”, namely a ﬁltered probability space.

Moreover, common wisdom tells us that in order to say anything useful about its properties, the law of B must enter the picture one way or another. This raises the question of whether or not we have the same notion of solution, as the one we had prior to the addition of B. As a consequence the meaning of

"well-posedness" risks being changed in the presence of all these extra ingredients that come with the introduction of B.

The traditional approach, and probably most natural one given the historical context, is to think of the solution of the B-perturbed equation as a stochastic differential equation i.e. the solution, if it exists of course, is again itself a stochastic process that is compatible with the filtered probability space that came with B. This is called in the literature a strong notion of solution. The other notion being that of weak solution and here it is allowed that the solution be defined on a new filtered probability space with a new Brownian motion, say B defined on this new space. Thus, if we think of the perturbation B as on of the inputs to the problem and the solution X as its output, the notion of causality B →X is lost in the weak formulation.

Similarly, a weak and strong notions of uniqueness exist and we can say that we have weak uniqueness if for any two (weak) solutions (X₁, B₁) and (X₂, B₂) the law of X₁ and X₂ are the same, while we have strong uniqueness if for any two solutions (X₁, B) and (X₂, B) we have X₁ = X₂ almost surely.

These notions are related through the Yamada-Watanabe theorem

Theorem 1.1.1. Weak existence + path-wise uniqueness =⇒ Strong uniqueness.

Now with the lingo in place we can state the ﬁrst result in the area of regularization by additive perturbation. Note that this is just part of the result of Zvonkin-Veretennikov as the original results and proofs yield existence and uniqueness.

Theorem 1.1.2([22], [21]). If b is bounded measurable then the equation X(t) =x₀+

_t

0 b(X(s))ds+B(t)

interpreted as a stochastic diﬀerential equation, has a unique strong solution.

Note on the proof strategy: Since we have changed the character of the original equation, we might as well use the tools that come with this new notion of solution. In our context it is Itô’s lemma for stochastic processes which says

(18)

that for any function u(t, x) in C^1,2 we have u(t, X(t)) =u(0, x₀)+

_t

0 ∂_tu(s, X(s))ds+

_t

0 ∂_xu(s, X(s))dX_s+1 2

_t

0 ∂_xxu(s, X(s))ds i.e.

u(t, X(t)) =u(0, x₀) + _t

0 ∂_tu(s, X(s))ds+ _t

0 ∂_xu(s, X(s))b(X(s))ds+

_t

0 ∂_xu(s, X(s))dB(s) + 1 2

_t

0 ∂_xxu(s, X(s))ds where, for any functiong with some appropriate conditions, the term

_t

0 g(s)dB(s)

is the Itô integral, deﬁned as the L²-limit of ﬁnite linear combinations with respect to the increments of the Brownian motions. Note that this is just the integral form of the classical chain rule with the exception of the additional term

1 2

_t

0 ∂_xxu(s, X(s))ds,

which can be interpreted as a correction due to the “inﬁnite” speed that B has.

Another useful tool that we get in this framework is the Itô isometry E |

_T

0 g(s)dB(s)|²

= E ^T

0 |g(s)|²ds

The last ingredient is a result from the theory of parabolic PDE’s:

Theorem 1.1.3. [22, Theorem 2] Consider the following Cauchy problem

∂

∂tu(t, x) =b(x)u_x(t, x) + 1

2u_xx(t, x) u(T, x) =x

(1.2) Then there exists T > 0 such that the above problem admits a unique (weak) solution and u∈ W_p^1,2([0, T]×U) for any bounded domain U and for all p >1.

Moreover there exists a constant ρ >0 such that

|u(t, x)−u(t, y)| ≥ρ|x−y|.

With the necessary ingredients in place, we can now describe the idea behind the Zvonkin-Veretennikov approach. The strategy, based on the Yamada- Watanabe theorem, yields the existence of a unique strong solution from the existence of a weak solution and the property of path-wise uniqueness. The

(19)

The classical approach: Zvonkin-Veretennikov and the PDE approach way solution u to (1.1.3) is going to play the role of the regularizing transformation here and its regularity, related to the Brownian perturbation, is the key ingredient for the proof of well-posedness. We sketch a proof for this point namely that of path-wise uniqueness and in doing so we suppose that b, in addition to being bounded, is Hölder continuous with a certain Hölder exponent θ. This simpliﬁes some parts of the proof since it implies that the solution is twice continuously diﬀerentiable in x with a second derivative that is Hölder continuous (see [8]).

(Sketch) proof of path-wise uniqueness: Indeed, let ube the solution to the previous Cauchy problem, then by applying Itô’s formula we get

u(t, X(t)) =u(0, x₀) + _t

0 u_x(s, X(s))dB(t)

for any solutionX to the SDE. Taking two arbitrary solutions and applying the Itô isometry we get that

E

(u(t, X₁(t))−u(t, X₂(t)))²

=E _t

0 (u_x(s, X₁(s))−u_x(s, X₂(s)))²ds

On the other hand, by Hadamard formula we have that u_x(s, X₁(s))−u_x(s, X₂(s))=

₁

0 u_xx(s, ξX₁(s) + (1−ξ)X₂(s))dξ

(X₁(s)−X₂(s))

≤C(X₁(s)−X₂(s))

by Theorem 1.1.3 and the Hölder continuity assumption on b.

Thus we get E

(u(t, X₁(t))−u(t, X₂(t)))²

≤CE _t

0 (X₁(s)−X₂(s))²ds

On the other hand, the second part of Theorem 1.1.3 gives us that

|u(t, x)−u(t, y)| ≥ρ|x−y| for some ρ >0.

Hence E

(X₁(t)−X₂(t))²

≤ 1 ρ²E

(u(t, X₁(t))−u(t, X₂(t)))²

≤ C ρ²

_t

0 E

(X₁(s)−X₂(s))² ds

Gronwall’s lemma implies that E

(X₁(t)−X₂(t))²

= 0

and hence X₁(t) =X₂(t) a.s. for allt∈[0, T], and by continuity we get that the

two processes X₁ and X₂ are equal a.s.

(20)

A common characteristic of the diﬀerent approaches that are based on the Zvonkin-Veretennikov approach is the change of focus from the term involving the singular term b to something related to the perturbed equation but that is of better regularity e.g. the solution u of the above Cauchy problem. Thus the problem is reduced to one of studying the properties of solutions to certain types of PDE’s as in Theorem 1.1.3. This strategy is very general and can be used to not only show the well-posedness of the SDE but also to show the regularity of the solution with respect to the initial condition. The PDE approach, however, comes at a cost and that is the dependence of the whole method on the Markovianity of the perturbation.

1.2 The path-by-path approach: Davie’s way

The blessing as well as the curse of the previous approach is in the modiﬁcation that happened to the notion of solution of our original ODE. Suppose we really insist on solving our perturbed equation in the space of continuous functions C

[0, T];R^d

without any mention of “stochastic process”. Then we can formulate the following question:

Question: Let b be a bounded measurable drift and choose a realization of B i.e. ﬁx an ω and consider the continuous function (t→ B(t) :=B(t, ω)) for that ω, does the following ODE

X(t) =x₀+ _t

0 b(X(s))ds+B(t)

have a unique solution? This question was posed by N. V. Krylov and was communicated⁶, through I. Gyongy, to A. M. Davie who answered it positively in his seminal work [5]. In this work he showed that

Theorem 1.2.1. For any bounded Borel measurable b : [0, T]×R^d → R^d and x₀ ∈R^d for almost all Brownian paths, the equation

X(t) =x₀ + _t

0 b(s, X(s))ds+B(t) has exactly one solution in the space of continuous paths.

The previous result has been further improved in the crucial work of [17].

Namely, the author shows

Theorem 1.2.2. For any bounded Borel measurable b: [0, T]×R^d →R^d and for almost all Brownian paths, the following problem

X(t, x) =x+_t

0 b(s, X(s, x))ds+B(t) X(0, x) =x

has a unique solution ((t, x)→ X(t, x)).

6The ﬁrst written statement we found of this problem is in [15] page 29 by N. V. Krylov himself.

(21)

The path-by-path approach: Davie’s way The proof of this beautiful result relies on an, almost forgotten, result of Egbert van Kampen [20]. We reproduce the proof of the original result of Van Kampen using the exposition in [9] then we sketch a slight extension immediate from [17] in order to show the principle at play.

Theorem 1.2.3(Van Kampen 1936). Consider the following ODE _dX(t)

dt =b(t, X(t))

X(t₀) =x₀, (1.3)

where b is a continuous function on [0, T]×R^d on R:= [t₀, t₀+a]×[x₀− b, x₀ +b]. Suppose further that there exists a function η : [t₀, t₀+a]×[t₀, t₀+ a]×[x₀−β, x₀+β] with β < b such that

1.2.3.1. For every ﬁxed (t₁, x₁), X(t) :=η(t, t₁, x₁) is a solution of _dX(t)

dt = b(t, X(t)) X(t₁) =x₁

1.2.3.2. η is uniformly Lipschitz in x₁.

1.2.3.3. η(t₃, t₁, x_t₁) = η(t₃, t₂, η(t₂, t₁, x_t₁)) whenever the expressions are deﬁned.

Then η(t, t₀, x₀) deﬁnes the unique solution of the 1.3 on R.

Proof. (Sketch)[9] Pick a solution X(t) of 1.3, we shall show that on a small time intervalX(t) =η(t, t₀, x₀).

Using the continuity of b on R, we can ﬁnd a small γ such that for all t₀ ≤s, t₁, t≤t₀ +γ we have, by Lipschitz regularity of η,

|η(t, t₁, X(t₁))−η(t, s, X(s))| ≤L|X(t₁)−η(t₁, s, X(s))| for some positive constant L.

Now ﬁx t∈ [t₀, t₀+γ] and deﬁne, for s∈[t₀, t₀ +γ]

σ(s) :=η(t, t₀, x₀)−η(t, s, X(s)) Then to conclude, we have to show that

σ(t) :=η(t, t₀, x₀)−X(t) = 0.

Indeed, we have

|σ(t₁)−σ(s)| =|η(t, t₁, X(t₁))−η(t, s, X(s))| ≤L|X(t₁)−η(t₁, s, X(s))|. On the other hand, since both η(t₁, s, X(s)) and X(t₁) are solutions, we have

η(t₁, s, X(s)) =X(s) + _t₁

s

b(u, η(u, s, X(s)))du

(22)

and

X(t₁) =X(s) + _t₁

s

b(u, X(u))du.

This yields , when t₁ → s, that

|X(t₁)−η(t₁, s, X(s))|= o(1)|t₁ −s|. Hence

|σ(t₁)−σ(s)| =Lo(1)|t₁−s|.

This means that (s→ σ(s)) is diﬀerentiable and, since σ(t₀) = 0, it is identically 0.

If the the "flow"⁷ η is not Lipschitz regular then by inspecting the above proof we can conjecture that a vector field with better regularity than just continuity might compensate for it. This is in fact the case and the following argument in [17] shows that if the vector field is β-Hölder regular and the flow* is α-Hölder regular such that α(β + 1) > 1 then we can recover a similar result to Van Kampen’s.

Letb now be uniformly β-Hölder and η satisﬁes the conditions in 1.2.3 except for the the Lipschitz regularity which we substitute now with a uniformα-Hölder regularity. Suppose further that α(β+ 1)> 1 then, using the same notation of the previous proof, we have

|σ(t₁)−σ(s)| =|η(t, t₁, y(t₁))−η(t, s, y(s))| ≤M|y(t₁)−η(t₁, s, y(s))|^α for some constant M. On the other hand, by the boundedness of

|y(t₁)−η(t₁, s, y(s))|= _t₁

s

b(u, y(u))du− _t₁

s

b(u, η(u, s, y(s)))du

≤ _t₁

s

|b(u, y(u))−b(u, η(u, s, y(s)))|du

≤ C|t1 −s|.

(1.4)

Using the same reasoning but with the Hölder continuity of b instead, we have

7We will abuse the use of the word ﬂow to denote a family of solutions enjoying the properties of 1.2.3. We will denote such occurrences by a ﬂow*.

(23)

The path-by-path approach: Davie’s way

|y(t₁)−η(t₁, s, y(s))|= _t₁

s

b(u, y(u))du− _t₁

s

b(u, η(u, s, y(s)))du

≤ _t₁

s

|b(u, y(u))−b(u, η(u, s, y(s)))|du

≤ _t₁

s

|y(u)−η(u, s, y(s))|^βdu

≤ C|t₁−s|^1+β.

(1.5)

Hence

|σ(t₁)−σ(s)|=Co(1)|t₁ −s|^1+δ, where δ := α(β + 1)−1> 0.

This means that (s→σ(s)) is diﬀerentiable and, since σ(t₀) = 0, it is identically 0.

The previous manipulations show that even if the vector field does not fall into the framework of the Cauchy-Lipschitz regularity, a regular enough flow* might compensate for it. This suggests an intuitive principle which says that uniqueness can happen anywhere on the interpolating spectrum between a Lipschitz vector field and, automatically, continuous flow on the one hand and a continuous vector field plus a Lipschitz flow* on the other, with anything in the middle as well, provided the two regularities add up in an appropriate manner. However, the previous discussion should be put together with the following two points:

1. A continuous vector ﬁeld is very far from the result of Davie and the arguments above, if they are to work, should be modiﬁed in order to accommodate this. A manipulation similar to the example at the beginning of the introduction might be necessary in order to move the discussion from b to some operator based on it.

2. If b is discontinuous, then it is diﬃcult to imagine that there might exist a ﬂow* with any kind of regularity if there is no "external" mechanism for regularity transfer.

The above two points are in fact the key to the method used in [17] in the context of

X(t) =x+ _t

0 b(s, X(s))ds+B(t) with b bounded and Borel measurable.

Now looking back at (1.4) and (1.5) we see, supposing η exists, that

(24)

|X(t₁)−η(t₁, s, X(s))|= _t₁

s

b

u, X(s) +B(u)−B(s) + _u

s

b(r, X(r))dr

du

− _t₁

s

b

u, X(s) +B(u)−B(s) + _u

s

b(r, η(r, s, X(s)))dr

du

= _t₁

s

b(u, B(u) +ζ₁(u))du− _t₁

s

b(u, B(u) +ζ₂(u))du, (1.6) where

ζ₁(u) :=X(s)−B(s) +_u

s b(r, X(r))dr ζ₂(u) :=X(s)−B(s) +_u

s η(r, s, X(s))dr are two Lipschitz functions.

The following two remarks allow us to conclude uniqueness:

1. The averaging operator is almost Lipschitz (Lemma 3.6 [17]).

2. The equation above, when interpreted as a SDE, admits a stochastic ﬂow that is α-regular for any α∈ (0,1) (Proposition 2.3 [18]).⁸

These are essentially the two ingredients which make up the proof in [17]

modulo some complications related to "almost Lipschitz" in point 1 above.

The original proof in [5] used in an essential way the regularizing eﬀect of the averaging operator A deﬁned above as

A: [0, T]×R^d →R^d (t, x) →At(x) :=

_t

0 b(s, x+B(s))ds

but not the existence of a stochastic flow solving the SDE interpretation of the perturbed equation. This made the proof much more difficult since the regularizing effect of the averaging operator by itself is not sufficient to yield the result, and one needs better estimates of this regularization effect along solutions of the equation. This is essentially the passage from Lemma 3.1 and Lemma 3.2 to Lemma 3.5 and its almost sure version Lemma 3.6 in [5].

It is worth mentioning that a purely deterministic path-by-path approach is possible and this is the work of [4] where only the the regularizing eﬀect of the averaging operator is used in combination with a non-linear Young integration theory with respect to increments of the averaging operator. This comes, however, at the cost of very stringent requirements at the level of the vector ﬁeldb, namely b is in the Besov space B_∞,∞^α which for α ∈ (0,1) is just the space of Hölder

8This holds only for b∈ L^q

[0, T], L^p

R^d

with ^d_p + ²_q <1 but the boundedness of b permits the use of such a result.

(25)

An alternative approach: On Compactness and Flows continuous functions. The reasons for such a requirement on b should be clear from the discussion above.

It is also worth mentioning here as well the remarkable work [12] which abstracts the idea of using the non-linear Young integral while at the same time takes into account the probabilistic side which did not enter in the approach of [4]. The many examples shown in [12] are witness to the strength of this point of view, however, it is not clear how this can be applied to the study of the finer properties of the stochastic flow and their relation to higher regularization effects one obtains when using more regularizing perturbations than justB.

1.3 An alternative approach: On Compactness and Flows

There seems to be a discrepancy between the purely deterministic approach and the one using probabilistic tools. We will try to give an explanation for this and in doing so we will introduce the method that the current thesis is based on.

If we look back at the proof of [5]’s result using [17]’s insight, we see that the regularizing effect of the averaging operator is only sufficient to conclude if it is combined with the regularity of the stochastic flow. Indeed if we look at the following expression

_t₁

s

b

u, X^x(s) +B(u)−B(s) + _u

s

b(r, X^x(r))dr

du with X^x a solution to our diﬀerential equation. This is trivially equal to

_t₁

s

b

t₂, X^x(s) +B(t₂)−B(s) + _t₂

s

b(r, X^x(r))dr

dt₂. Iterating once more

_t₁

s

b

t₂, X^x(s) +B(t₂)−B(s)+

_t₂

s

b

t₃, X^x(s) +B(t₃)−B(s) + _t₃

s

b(r, X^x(r))dr

dt_t₃

dt₂. (1.7) Thus we see that there is a compounding regularization eﬀect of Brownian perturbation through the averaging operator. Moreover there is a temporal structure reminiscent of the chronological calculus of A. Agrachev and R.

Gamkrelidze [1], and an approach that takes this temporal structure into account might result in a quantification of "regularization effects of higher order". The issue is that any attempt to study the effect of the perturbation B on the above expression as time progresses is hindered by the highly non-linear dependence on B. If only there was a way to linearize the above expression to untangle the effect ofB through all of the above nonlinearities.

(26)

Indeed there is one, and it is achieved by diﬀerentiating along the perturbations B using the tools of Malliavin calculus. Taking this point of view, it can be shown using, the equivalent of the chain rule for this new calculus, that when b is smooth and compactly supported, the following holds

D_θX^x(t) = _t

θ

b(u, X^x(u))D_θX^x(u)du+I,

where b is the spatial derivative of b and I is the identitiy matrix. Here X is (unique strong) solution, by the smoothness of b, of

X^x(t) =x+ _t

0 b(u, X^x(u))du+B(t),

where we have used the property that the Malliavin derivative of B(t) is the identity matrix I when θ < t. Iterating the above expression, we get the representation

D_θX^x(t) = ∞ k=0

θ<u1<···<uk<t

b(u₁, X^x(u₁))· · ·b(u_k, X^x(u_k))du₁· · ·du_k. A similar manipulation yields, for the derivative of the ﬂow which exists by standard arguments in the smooth and compactly supported b case, that

∂

∂xX^x(t) = ∞ k=0

0<u1<···<uk<t

b(u₁, X^x(u₁))· · ·b(u_k, X^x(u_k))du₁· · ·du_k. As both, D_θX(t) and _∂x^∂ X^x(t) are random variables, controlling their moments is a logical next step to seek, and indeed we get

E

| ∂

∂xX^x(t)|^p

≤C(r,b∞)×

∞ k=0

0<u1<···<uk<t

b(u₁, x+B(u₁))· · ·b(u_k, x+B(u_k))du₁· · ·du_k^p

L^ps(Ω,R^d×d)

(1.8) for r, s∈ [0,∞) such that sp= 2^q for some integer q and ¹_r + ¹_s = 1, where we used the Cameron-Martin-Girsanov theorem to change X_t^x to x+B(t) under the expectation. Decomposing the norm of the sum in terms of the norm of its one dimensional components, we see that the task of controlling the moments is reduced to establishing an estimate of the type

E

t0<u1<···<un<t

_n

i=1

D^αⁱb_i(u_i, x+B(u_i))

du₁· · ·du_n

≤ C(n, d,b_i)|t−t₀|^ξ

(27)

An alternative approach: On Compactness and Flows for some ξ ∈ (0,1), where b_i are smooth compactly supported real valued functions on [0, T]×R^d, α_i ∈ {0,1}^d is a multi-index such that |α_i|= 1, D^αⁱ is the derivative in the spatial argument with respect to the index α_i.

Indeed the following key estimate was established in [5] (see also [13] and [16] when b is in a larger class)

E

t0<u1<···<un<t

_n

i=1

D^αⁱb_i(u_i, x+B(u_i))

du₁· · ·du_n

≤ Cⁿ_n

i=1b_i∞(t−t₀)^n/2 Γ(ⁿ₂ + 1) , where Γ is the Gamma function.

Thus we recover the higher order equivalent of the averaging operator, only this time the higher order temporal regularization eﬀect is taken into account.

This is a key estimate that contains the true regularization power of theB paths and it is a key tool in Davie’s original work in order to establish the almost Lipschitz regularity of the averaging operator even when b is merely bounded.

Of course, the above reasoning is only valid when b is smooth but a strength of the above estimate is its dependence on b through only its b∞ norm. Thus, if we can show that the sequence Xⁿ, of solutions to the SDE with vector ﬁelds bⁿ, converges toX, and the same for the derivatives in x, then we might be able to close the loop.

Indeed, and using a compactness result developed in [6], one can show that the sequence Xⁿ is relatively compact in L²(Ω), and even converges in L²(Ω) strongly to X. This is achieved using the estimate above but applied in the control of the Malliavin derivative of the approximating solution i.e. D_tXⁿ⁹. Moreover, using the control on the moments of _∂x^∂ X_t^x,n we get that the limit above is Sobolev diﬀerentiable.

Since we have an almost Lipschitz ﬂow, we can use the methodology in [17]

to recover Davie’s original result i.e. uniqueness in the path-by-path sense.

Note that there is no mention of any auxiliary results from the theory of PDEs and all of the analysis is at the level of the ﬂow. This is in fact one of the strengths of this method as the above method can be applied in situations where the approach based on PDE theory is inapplicable.

A weakness, at ﬁrst sight at least, however, of the above approach is its reliance on the probabilistic properties of the perturbationB, namely through the above key estimate as well as the compactness criterion in L²(Ω). This should be contrasted with the purely deterministic approach mentioned in the previous section. This objection is a valid one, but only in theory, in practice, however, the following two points tell us something diﬀerent

1. Although the results in [4] are formulated using the notion of an irregular path which is a deterministic notion indeed, the ﬁnal result is stated with respect to fractional Brownian motion (fBm) which is a Gaussian process that generalizes Brownian motion to the case when the increments are not independent. This is due to the fact that, although the notion of an

9To be precise, we need to controlDt₂X_tⁿ−Dt₁X_tⁿ.

(28)

irregular path is simple to state, giving examples of such paths is diﬃcult and is only possible in fBm case through recourse to its law.

2. For vector ﬁelds b which are not distributions, the approach in [4]

necessitates the continuity of b. The discussion above shows, clearly we hope, that the truly problematic regime is the one of discontinuous b e.g. b∈L^q

[0, T], L^p R^d

with ^d_p + ²_q ≤ 1.

This begs the following question: Can we establish a similar result to [5] for truly singular vector ﬁelds, in the case when the perturbation is given by fBm B^H? The main contribution of this thesis is a positive answer to this question through the Van Kampen principle as was done in the Brownian motion case in [17]. We summarize our contributions in the next section.

1.4 Summary of Papers

Paper I The ﬁrst paper is the genesis for an improved estimate in relation to the derivative(s) of the ﬂow of the following SDE

X(t) =x+ _t

0 b(u, X(u))ds+B^H(t),0≤t≤ T, (1.9) where B^H(t),0 ≤ t≤ T is a d−dimensional fractional Brownian motion with Hurst parameter H ∈(0, ¹₂) and where the vector ﬁeld b issingular in the sense that

b∈L^1,∞_∞,∞ :=L¹(R^d;L^∞([0, T];R^d))∩L^∞(R^d;L^∞([0, T];R^d)).

Equation (1.9) was the main object of study in [3], the main result being Theorem 1.4.1. Consider the following SDE

X^s,x(t) =x+

_t

s

b(u, X^s,x(u))du+B^H(t)−B^H(s), X^s,x(s) =x,0≤ s≤ t≤T (1.10) Let s∈[0, T], b∈ L^1,∞_∞,∞ and k≥1. Then if H < C(d, k)< ¹₂ there exists a unique (global) strong solution X_·^s,x of the SDE (1.10). Moreover, for every x∈ R^d, t∈ [s, T] X_t^s,x is Malliavin diﬀerentiable in the direction of the Brownian motion B in the representation

B^H(t) = _t

0 K_H(t, s)I_d×ddB(s). (1.11) Further, X^s,·(t) is locally Sobolev diﬀerentiable μ − a.e. That is, more precisely,

X_t^s,· ∈

p≥2

L²(Ω;W^k,p(U)) for bounded and open sets U ⊂ R^d.

(29)

Summary of Papers Our main contribution is an improvement of the constantC(d, k), appearing in the previous theorem from

2(3d−1)1 ∧ _d(2k+1)¹

to _2(d−1+2k)¹ . This is a signiﬁcant improvement, from multiplicative to linear dependence on d, of previous results.

The above estimate has also inﬂuenced as well as aﬀected the key result in [3] namely Proposition 3.2, which is the equivalent estimate, in the case of fBm perturbations, of the estimate above in [5].

We then use our result to prove a Bismut-Elworthy-Li formula with respect to the unique strong solution of (1.10) (Theorem I.2.6). We then use this formula to solve the problem of of computing Greeks, the Δ to be precise, of ﬁnancial claims in a Black-Scholes model with stochastic volatility who’s dynamics are given by (1.10) (Theorem I.3.1).

Paper II In this paper we prove the higher order differentiability of a stochastic flow associated with a stochastic differential equation driven by an additive multi-dimensional fractional Brownian noise, where the bounded variation part is given by the local time of the unknown solution process, that is

X^x(t) =x+αL_t(X^x)·1_d+B^H(t), 0≤t≤ T (1.12) where α∈R, 1_d is the vector with entries 1 and L_t(X^x), 0≤ t≤T is the local time of the unknown solution process, which one can deﬁne as

L_t(X^x) = lim

ε0

_t

0 ϕ_ε(X_s^x)ds,

where the limit is in probability and ϕ_ε approximates, in distribution, the Dirac delta function δ₀ in zero. Here a commonly used approximationϕ_ε is given by

ϕ_ε(x) =ε⁻^d²ϕ(ε⁻¹²x), ε > 0, (1.13) where ϕ is a d−dimensional Gaussian probability density.

More precisely, our key contribution is showing that solutions X^x(t), to the stochastic diﬀerential equation (1.12), for all 0≤ t≤ T,

(x−→X^x(t)) ∈

p≥2

L²(Ω;W^k,p(U)).

whenever H < _2(d−1+2k)¹ .

Paper III In this paper we ask a question in the opposite direction, namely: Can we construct a noise that has inﬁnite, in the sense of restoring classical diﬀerentiability, regularization power on

X^x(t) =x+ _t

0 b(u, X^x(u))du+B(t), X^x(0) = x,0 ≤t≤T, (1.14)

Regularization Effects for Certain Dynamical Systems through Gaussian Noises

Oussama Amine