L ´EVY MODELS
FRED ESPEN BENTH AND MAREN DIANE SCHMECK
Abstract. Merton’s classical portfolio optimisation problem for an investor, who can trade in a risk-free bond and a stock, can be extended to the case where the driving noise of the log-returns is a pure jump process instead of a Brownian motion. Benth et al. [5], [6] solved the problem and found in the HARA-utility case the optimal control implicitly given by an integral equation. There are several ways to approximate a Levy process with infinite activity:
by neglecting the small jumps or approximating them with a Brownian motion, as discussed in Asmussen and Rosinski [2]. In this setting, we study stability of the corresponding optimal investment problems. The optimal controls are solutions of integral equations, for which we study convergence. We are able to characterize the rate of convergence in terms of the variance of the small jumps. Additionally, we prove convergence of the corresponding wealth processes and indirect utilities (value functions).
1. Introduction
In Merton’s [17] seminal paper on optimal portfolio management under uncertainty, it is proved that a risk averse investor will place a constant proportion of her total wealth in risky assets.
Optimality is measured as the expected utility of terminal wealth, with a power or HARA utility function measuring the risk preferences of the investor. Moreover, the dynamics of the risky assets follow a geometric Brownian motion. The optimal proportion is given explicitly as the ratio of the excess return over the risk free, normalized by the volatility of the risky asset and the risk aversion of the investor.
The constant proportion rule is among the popular strategies for portfolio management in practice. Merton’s portfolio selection problem has also gained a lot of attention in the scientific literature over the years, with generalizations in various directions. Recent extensions of the origi- nal Merton problem include the case of stochastic coefficients in Delong and Kl¨uppelberg [10], and bounded downside risk through restrictions on Value-at-Risk and Expected Shortfall in Kl¨uppelberg and Pergamenchtchikov [15]. For a general treatment and discussion, we refer to Øksendal and Sulem [20].
One stream in the literature relevant for our considerations focuses on analysing the effects of more realistic models for the risky asset price dynamics on the optimal portfolio management problem. For example, Benth, Karlsen and Reikvam [5] examined Merton’s problem when the risky asset price dynamics is given by an exponential pure-jump Levy-process. Also in this case it is optimal to invest a constant proportion of the investor’s wealth in the risky asset, however, the proportion is given in terms of a solution of an integral equation involving the excess return of the asset and the characteristics of the jump processes. One may easily include consumption into the portfolio problem, regaining qualitatively similar solutions as in the classical Merton’s problem.
Emmer and Kl¨uppelberg [12] examine constraints of an upper bound for the risk when stock prices follow an exponential L´evy process. There is also work on optimal portfolios with HARA-utility in multidimensional cases, see for example Calleg and Vargiolu [8], where assets are driven by a multidimensional Poisson process, or Pasin and Vargiolu [21] in the case of exponential additive processes.
Date: January 26, 2011.
Key words and phrases. Stochastic control, portfolio optimization, L´evy processes, utility optimization, integral equations, approximations.
1
L´evy processes are popular in financial modelling since they are able to explain many of the stylized facts of asset prices (see Cont and Tankov [9] for a discussion of L´evy processes in finance).
In particular, some processes like the normal inverse Gaussian (NIG) or the hyperbolic L´evy process have become particularly relevant since they are able to capture the return distribution of most asset prices (see e.g. Barndorff-Nielsen [3] and Eberlein and Keller [11]). These L´evy processes are pure-jump, and therefore give distinctively different paths of the asset prices compared to a Brownian motion with continuous paths. In empirical analysis of financial price data, one may detect big jumps, however, the small jumps are very hard to separate from the observations of a Brownian motion. Thus, it is not a simple task to decide whether a L´evy process with jumps or a Brownian motion is governing the small variations in a stock price, say.
In this paper we focus on the stability of Merton’s problem with respect to model choice. In particular, we analyse what happens when the small jumps of the L´evy process driving the asset price dynamics is approximated by a Brownian motion. This would mimic a situation where we have two investors, one believing in a pure-jump L´evy process, and another which thinks the small variations in prices come from a Brownian motion. Asmussen and Rosinski [2] show that in fact the small jumps of a L´evy process has a central limit type behaviour towards a Brownian motion, which tells that one may empirically not be able to distinguish between two such models. The question is then to what extent this transfer over the the optimal portfolio selection problem. We pose the problem as an approximation of asset price models, where we either ignore or subsitute jumps in the L´evy process smaller than a threshold . To substitute, we use a Brownian motion.
Indeed, our analysis shows that the optimal investment in the risky asset is stable with respect to the different approximations. We are able to classify the convergence rate as being proportional to the variance of the small jump part of the L´evy process.
A general approach to stability of stochastic control problems are provided by Larsen and Zitkovi´ˇ c [16]. They investigate the influence of estimation errors in the parameters of the under- lying financial assets. Jakobsen, Karlsen and La Chioma [14] are deriving stability results for the Hamilton-Jacobi-Bellman equation for stochastic control problems, and derive error estimates for approximative viscosity solutions. In a paper by Benth, Di Nunno and Khedher [4], stability for option pricing and hedging have been considered based on similar L´evy approximations as in the present paper. Here, the authors prove that prices and hedges converge at a rate given by the variance of the small jumps of the L´evy process, similar to our findings.
Our results are presented as follows. In Section 2 we state the control problems and recall some results on these. Afterwards we discuss in Section 3 how the approximation of the L´evy process influences the integral equation which gives implicitly the solution of the control problem.
In Section 4 we study the convergence of the controls and derive convergence rates, which is illustrated by some numerical examples. The convergence of the value functions is treated in Section 5, and in Section 6 we analyse the wealth processes.
2. A review of Merton’s portfolio optimization problem
We recall the Merton’s portfolio optimization problem in the L´evy case with and without consumption, and review some relevant results from Benth et al. [5, 6].
Let (Ω,F,P) be a complete probability space and {Ft}t≥0 a given filtration satisfying the usual conditions. We consider a financial market consisting of a stock and a bond. Let the bond dynamics be given by
dB(t) =rB(t)dt ,
wherer >0 is the constant interest rate. The value of the stock follows a process given by S(t) =S(0)eξt+L(t)
whereξis a constant andL(t) a pure jump L´evy process with L´evy-Khintchine decomposition L(t) =
Z t
0
Z
|z|<1
zN˜(ds, dz) + Z t
0
Z
|z|≥1
zN(dt, dz).
Here,N(dt, dz) is a Poisson random measure onR+×R\ {0} with intensity measuredt×ν(dz), andν(dz) being the L´evy measure, that is, aσ-finite Borel measure onR\ {0} with
Z
R\{0}
min(1, z2)ν(dz)<∞.
We denote by ˜N(ds, dz) =N(dt, dz)−dtν(dz) the compensated Poisson random measure ofN. In the sequel, we suppose that the L´evy process is exponentially integrable, that is, its L´evy measure satisfies the condition
(1)
Z ∞
1
e2zν(dz)<∞.
This will ensure that the stock price dynamics has finite expectation, but also it will be necessary for the analysis to come in order to derive convergence rates for approximative portfolio strategies.
Consider an investor who puts her money in the stock and bond to optimize her utility. Let π(t) denote the fraction of her wealth invested in the stock and c(t) her rate of consumption at timet. The dynamics of the wealth X(π,c)(t) becomes (see Benth et al. [5])
(2) dX(π,c)(t) =X(π,c)(t)(r+ (ˆµ−r)π(t))dt−c(t)dt+X(π,c)(t−)π(t−) Z
R\{0}
ez−1 ˜N(dt, dz), where
ˆ µ=ξ+
Z
R\{0}
ez−1−z1|z|<1ν(dz)
is the drift of the stock price dynamics andX(t−) denotes the left-limit of a processX(t). We denote byX(π,c)(0) =xthe initial wealth of the investor, and assumer <µ. The last conditionsˆ ensures that the stock gives a higher average return than the bond.
We define the set ofadmissible controls Ax to consist of those investment-consumption plans (π, c) such that
(1) π(t) is progressively measurable with values in [0,1], (2) c(t) is a positive and adapted process such thatRt
0E[c(s)]ds <∞for allt≥0, (3) c(t) is such thatX(π,c)(t)≥0 almost everywhere for allt≥0.
We will restrict our attention to admissible controls, (π, c)∈ Ax. Observe that we constrain the invested fraction of wealth in the stock to be between 0 and 1, meaning that we cannot short sell stocks or borrow money to invest more that our wealth in stocks. One may extend the theory in Benth et al. [5] to π ∈[π, π], for π < 0 and π > 0. In the considerations to come, we can also include such a case with some additional effort.
The utility derived by the investor comes from consumption, and we suppose that she has a power utility function of HARA type, that is,U(x) =xγ/γfor a risk aversion parameterγ∈(0,1).
Lettingδ >0 be a constant discount rate, the value function is defined by V(x) = sup
c,π∈Ax
Ex hZ ∞
0
e−δtcγt γ
dti . (3)
By dynamic programming, the Hamilton-Jacobi-Bellman (HJB) equation takes the form max
c≥0,π∈[0,1]
(r+ (ˆµ−r)πt)xv0(x)−cv0(x)−δv(x) +cγ γ
+ Z
R\{0}
v(x+πx(ez−1))−v(x)−πxv0(x)(ez−1) ν(dz)
= 0. (4)
Benth et al. [5] show thatV is a viscosity solution of the HJB-equation. Moreover, the optimal investment strategy turns out to be a constantπ∗solving implicitly the integral equation
Z
R\{0}
(1 +π∗(ez−1))γ−1(ez−1)−(ez−1)ν(dz) =r−µ .ˆ (5)
The optimal consumption is given as a constant rate of the wealth, c∗(t) =X(π,c)(t) 1−γ
δ−k(γ) (6)
where
k(γ) =γ(r+ (ˆµ−r)π∗) + Z
R\{0}
(1 +π∗(ez−1))γ−1−γπ(ez−1)ν(dz).
We remark in passing that one may consider the simplified problem of maximizing terminal wealth only, and not consume anything from the portfolio. The value function becomes in this case
V(x) = sup
π∈Ax
Ex 1
γXπ(T)γ
, (7)
where we use the obvious definition of the set of admissible controls and the wealth processXπ(t) (the latter is given byX(π,0)(t)). As it turns out, the optimal investment strategy is still a constant fraction of wealth placed in the stock, solving the integral equation (5).
We end this section with a discussion on conditions ensuring the existence and uniqueness of an optimal portfolio investment strategyπ∗∈[0,1]. For this purpose, define
(8) F(π) =
Z
R\{0}
(1 +π(ez−1))γ−1(ez−1)−(ez−1)ν(dz) + (ˆµ−r),
which is a continuous function on [0,1] under our exponential integrability hypothesis on ν(dz).
It holds
F(0) = ˆµ−r , which is positive by assumption on ˆµandr. We have:
Lemma 1. Assume that the L´evy measure and γsatisfy (9)
Z
R\{0}
(ez−1)(1−e−(1−γ)z)ν(dz)>µˆ−r . Then there exists a uniqueπ∗∈(0,1) solving (5).
Proof. By commuting differentiation and integration (see Folland [13]), we find forF in (8) that F0(π) =−(1−γ)
Z
R\{0}
(ez−1)2(1 +π(ez−1))γ−2ν(dz).
Since (exp(z)−1)2 and (1 +π(exp(z)−1))γ−2 are both positive as long as π ∈ [0,1], we find that F0(π) <0. Hence, F is strictly decreasing on [0,1]. Therefore, we have a unique solution π∗∈(0,1) of (5) as long asF(1)<0. But this is ensured by the condition in the Lemma.
Note that (exp(z)−1)(1−exp(−(1−γ)z)) is positive for allz∈R. Hence, the left-hand side of the condition (9) is positive. Hence, the condition therefore gives a relation between the L´evy measure and the risk aversion on one hand, and the excess return ˆµ−r on the other. Given the optimal controlπ∗, we have the optimal consumption processc∗(t) as well.
For the analysis to come, it is convenient to introduce a functionf(π, z) defined as (10) f(π, z) = (1 +π(ez−1))γ−1(ez−1)−(ez−1).
Furthermore, letg(π) be
(11) g(π) =
Z
R\{0}
f(π, z)ν(dz).
Then, from the definition of F(π) we see that the integral equation (5) may be formulated com- pactly as
g(π) =r−µ.ˆ (12)
We shall make use of these two functions when we move on in the next Section to consider approximations of the control problem of Merton.
3. The control problem with approximated driving process
In this section we examine the convergence properties of Merton’s portfolio problem when we approximate the L´evy process L(t) in the stock price dynamics. In particular, we consider two approximations, one where the small jumps of Lare neglected, and another where we substitute the small jumps by a scaled Brownian motion. These two approximations will lead to different HJB-equations, and thus to different controls and value functions. We analyse the convergence to the original portfolio problem, and establish rates.
3.1. ApproximatingL by neglecting the small jumps. By appealing to the L´evy-Kintchine representation ofL(t), we can write for a given 0< <1,
L(t) = Z t
0
Z
|z|<1
zN˜(ds, dz) + Z t
0
Z
|z|≥1
zN(ds, dz) (13)
= Z t
0
Z
|z|<
zN˜(ds, dz) + Z t
0
Z
<|z|<1
zN˜(ds, dz) + Z t
0
Z
|z|≥1
zN(dt, dz).
(14)
Introduce an approximation ofL(t) which neglects jumps smaller than: LN,(t) =
Z t
0
Z
<|z|<1
zN˜(ds, dz) + Z t
0
Z
|z|≥1
zN(dt, dz).
Then, the Levy measureνN, ofLN,(t) is νN,(dz) :=
(ν(dz), |z|>
0, otherwise.
The neglection of the small jumps influences the Levy measure and so indirectly also all terms which include a L´evy integral, as the drift of the stock price dynamics, the optimal controls and eventually the value function.
With obvious definition, we denote by XN,(π,c)(t) the wealth process for admissible controls (π, c)∈ Ax,N,. Furthermore,VN,(x) denotes the value function.
Tracing through the derivation of Benth et al. [5] usingLN, in the stock price dynamics, leads to the following integral equation for the optimal controlπ∗N,
Z
R\{0}
f(πN,∗ , z)νN,(dz) =r−µˆN,
(15) with
ˆ
µN,=ξ+ Z
|z|>
ez−1−z1|z|<1ν(dz)
= ˆµ− Z
|z|<
ez−1−z ν(dz).
Furthermore, the optimal consumption process, denotedc∗N,(t) will be the same as for the non- approximated case, except that we insert the optimal controlπN,∗ in (6) and useνN,as the L´evy measure in the definition ofk.
Let us investigate conditions for the existence of a unique solution to (15). Introduce the functionFN,(π) as
(16) FN,(π) = Z
R\{0}
(1 +π(ez−1))γ−1(ez−1)−(ez−1)νN,(dz) + (ˆµN,−r).
The optimal investment strategy π∗N, is given as a root of the functionFN,. Observe that the derivative ofFN, is negative, similar as to the case of= 0. Thus, FN, is a continuous function which is strictly decreasing. It will have a root in the interval [0,1] if and only ifFN,(0)>0 and FN,(1)<0. But this is equivalent to
(17) µˆ−r >
Z
|z|<
ez−1−z ν(dz)
and (18)
Z
R\{0}
(ez−1)(1−e−(1−γ)z)νN,(dz)<µˆ−r− Z
|z|<
ez−1−z ν(dz). We note that by Taylor expansion, the integral
Z
|z|<
ez−1−z ν(dz),
will be approximately equal to σ2() which tends to zero as → 0. Recalling that ˆµ > r, we are ensured the existence and uniqueness of a solutionπ∗N, by choosingsufficiently small if the condition (9) in Lemma 1 holds (that is, the condition for existence and uniqueness ofπ∗∈[0,1]).
To emphazise the difference of (15) from the original equation (5), reorganize to show that (15) is equivalent to
Z
R\{0}
f(π∗N,, z)ν(dz)− Z
|z|<
f(πN,∗ , z)ν(dz) =r−(ˆµ− Z
|z|<
ez−1−z ν(dz)). Or, using the functiong in (11), we have
g(πN,∗ ) =r−µˆ+ Z
|z|<
1 +πN,∗ (ez−1)γ−1
(ez−1)−z ν(dz).
Introduce the functionh(π, z) for|z|<1 by
h(π, z) := (1 +π(ez−1))γ−1(ez−1)−z . (19)
Then, it finally follows thatπN,∗ is the solution of the integral equation g(π∗N,) =r−µˆ+
Z
|z|<
h(πN,∗ , z)ν(dz).
(20)
In the analysis of convergence ofπ∗N,to π∗as→0, this representation of the optimal control is attractive.
3.2. ApproximatingLby substituting small jumps by Brownian motion. An alternative to truncating off the small jumps, is to approximate them by an appropriately scaled Brownian motion as discussed Asmussen and Rosinski [2]. More precisely, we introduce the process
LW,(t) =σ()W(t) +LN,(t), (21)
whereW(t) is a Brownian motion (independent ofL(t)) and σ2() :=
Z
|z|<
z2ν(dz), (22)
is the variance of the small jumps (at least for symmetric L´evy processes). Note thatσ2() is finite sinceν(dz) integratesz2 around the origin by definition. Moreover, by monotone convergence, it holds that
→0limσ2() = 0.
It will be clear later thatσ2() gives the rate of convergence in the approximations of the original portfolio optimization problem.
As in Benth et al. [6], an additional Brownian component does not change the general form of the solution of the control problem. We denote the wealth equation by XW,(π,c) for admissible controls (π, c) ∈ Ax,W,, with an obvious definition of these. The value function in this case is denotedVW,(x).
We can derive an integral equation for the optimal investment strategy, still being a constant π∗W,, but now solving the integral equation
Z
R\{0}
f(π∗W,, z)νN,(dz) =r−µˆW,+ (1−γ)σ2()πW,∗ ,
with
ˆ
µW,=ξ+ Z
|z|>
ez−1−z1|z|<1ν(dz) +1 2σ2()
= ˆµ− Z
|z|<
ez−1−z ν(dz) +1 2σ2().
The optimal consumptionc∗W,(t) is given by
c∗W,(t) =X(π∗W,,c)(t) 1−γ δ−kW,(γ) (23)
where
kW,(γ) =γ(r+ (ˆµW,−r)πW,∗ )−1
2σ2()(π∗W,)2γ(γ−1) +
Z
R\{0}
(1 +πW,∗ (ez−1))γ−1−γπW,∗ (ez−1)νN,(dz).
Again, we reformulate the equation for the optimal investment strategy in terms ofg, in order to find
g(πW,∗ ) =r−µˆ+σ2()
(1−γ)πW,∗ −1 2
+
Z
|z|<
h(πW,∗ , z)ν(dz). (24)
Additionally to R
|z|<h(π∗W,, z)ν(dz), that also appeared in (20), we have a term with σ2() on the right hand side of (24).
We state conditions for the existence and uniqueness of an optimal strategyπW,∗ ∈[0,1]. Define the functionFW,(π) as
(25) FW,(π) = Z
R\{0}
(1 +π(ez−1))γ−1(ez−1)−(ez−1)νN,(dz) + (ˆµW,−r)−(1−γ)σ2()π . Similar to the case of neglecting the small jumps, FW,(π) is a strictly decreasing continuous function, which has a root in the interval [0,1] if and only ifFW,(0)>0 and FW,(1)<0. This is equivalent to
(26) µˆ−r >
Z
|z|<
ez−1−z ν(dz)−1 2σ2() and
(27) Z
R\{0}
(ez−1)(1−e−(1−γ)z)νN,(dz)<µˆ−r− Z
|z|<
ez−1−z ν(dz)−(1
2 −γ)σ2(). For the same reasons as before, we are ensured the existence and uniqueness of a solutionπ∗W, by choosingsufficiently small if the condition (9) in Lemma 1 holds.
Let us discuss an example. In finance, the normal inverse Gaussian (NIG) distribution turns out to model the (log-)returns of financial asset prices very well. There exist several empirical studies where this distribution was applied, but we refer to Bølviken and Benth [7] which studied Norwegian stock prices. The NIG L´evy process is a pure-jump process, where the L´evy measure is explicitly known as
ν(dz) = αδ
π|z|K1(α|z|)eβzdz
for a NIG(µ, β, α, δ) process. Here,K1 is the modified Bessel function of the third kind of index 1. In Rydberg [18] it was suggested to approximate the small jumps of the NIG process by a Brownian motion scaled byσ(), as discussed above. In Asmussen and Rosinski [2] this example was further elaborated, and they show that
σ2()∼2δ π × .
We remark that the NIG L´evy process was also used in Benth et al. [5] as a motivation for their studies of the Merton portfolio optimization problem for pure-jump L´evy processes.
4. Convergence rates for the optimal investment strategy
In this Section we prove that the approximative investment strategies πN,∗ and π∗W, both converge to π∗ as →0. Moreover, we derive rates of convergence for both approximations in terms of the variance of the small jumpsσ2().
4.1. Approximation of L by neglecting small jumps. Consider the case where we derive the optimal portfolio strategyπ∗N, based on an approxmation where the small jumps are simply neglected. We have the following result.
Proposition 1. The controlπN,∗ solving (15)converges to the control π∗ derived from (5) when →0. In particular, it holds
|π∗N,−π∗| ≤CNσ2(), for a constantCN >0 independent of .
Proof. Recall the definition of the functionf in (10) to see that
∂
∂πf(π, z) :=f0(π, z) = (γ−1)(ez−1)2(1 +π(ez−1))γ−2
is negative for allz ∈R. Hence, f is a continuous and strictly decreasing function ofπ∈[0,1].
Moreover, it is negative for all π ∈ (0,1), z ∈R sincef(0) = 0. Then, it follows that g(π) = R
R\{0}f(π, z)ν(dz) in (11) is also strictly decreasing and negative. As
|f(π, z)| ≤ |f(1, z)|and Z
R\{0}
|f(1, z)|ν(dz)<∞,
the parameter-dependent integral defining g(π) is continuous by Theorem 11.4 in Schilling [19].
Therefore, the inverse g−1 exists and is continuous on the image g([0,1]), and we can write the optimal control as
π∗=g−1(r−µ).ˆ
Moreover, from elementary calculus the derivative of the inverse of a function g(π) = y can be written as
(g−1)0(y) = 1
g0(g−1(y))= 1 g0(π)
= 1
∂
∂π
R f(π, z)ν(dz) = 1 R ∂
∂πf(π, z)ν(dz)
where we are allowed to commute integration and differentiation using Theorem 11.5 in Schilling [19]
as long as|∂π∂ f(π, z)| ≤w(z) forw(z) being an integrable function. But forz >0 we find that
|∂f
∂π| ≤(1−γ)(ez−1)2 whereas forz <0 we find
|∂f
∂π| ≤(1−γ)(ez−1)2e−(2−γ)z.
By the exponential integrability hypothesis on ν(dz), this defines an integrable function w(z) verifying the commuting of integration of differentiation.
We continue with the proof of convergence. For z > 0, ∂f /∂π is monotonely increasing in π∈[0,1] and forz <0 it is monotonely decreasing. Additionally,∂f /∂πis negative for allz∈R. So,
∂
∂πf(π, z)≤ ∂
∂πf(0, z), z <0
∂
∂πf(π, z)≤ ∂
∂πf(1, z), z >0.
Next we apply this to find an approximation for (g−1)0(y):
Z
R\{0}
∂f
∂π(π, z)ν(dz) = Z
R+
∂f
∂π(π, z)ν(dz) + Z
R−
∂f
∂π(π, z)ν(dz)
≤ Z
R+
∂f
∂π(1, z)ν(dz) + Z
R−
∂f
∂π(0, z)ν(dz) where we have
∂f
∂π(0, z) = (γ−1)(ez−1)2
∂f
∂π(1, z) = (γ−1)(ez−1)2ez(γ−2). Hence,
Z
R\{0}
f0(π, z)ν(dz) ≥
Z
R+
f0(1, z)ν(dz) + Z
R−
f0(0, z)ν(dz)
= (1−γ) Z ∞
0
(ez−1)2e−(2−γ)zν(dz) + Z 0
−∞
(ez−1)2ν(dz)
=:L−1. This implies that
|(g−1)0(y)|= 1
R
R\{0}f0(π, z)ν(dz)
≤L .
With this bound on the derivative of the inverse function, we move on to estimate the error:
By applying the Mean Value Theorem in calculus to the equations (12) and (20), we find
|πN,∗ −π∗|=
g−1 r−µˆ+ Z
|z|<
h(πN,∗ , z)ν(dz)
!
−g−1(r−µ)ˆ
=|(g−1)0(θ)|
r−µˆ+ Z
|z|<
h(π∗N,, z)ν(dz)−(r−µ)ˆ
≤L Z
|z|<
h(πN,∗ , z)ν(dz)
for some
θ∈[min{r−µ, rˆ −µˆ+ Z
|z|<
h(πN,∗ , z)ν(dz)},max{r−µ, rˆ −µˆ+ Z
|z|<
h(πN,∗ , z)ν(dz)}].
Next, let us estimate the term involvingh. Sinceh(π, z) is decreasing inπ∈[0,1], we have h(0, z)≥h(π, z)≥h(1, z).
Note that h can be both positive and negative, which makes it difficult for estimations of the absolute value of the integral ofh. Divide the domain of definition ofhinzinto those parts where his positive and negative for fixedπandγ:
Aπ:={z∈R:h(π, z)≥0}, Bπ:={z∈R:h(π, z)<0}. Then, forz∈Aπ, we find from Taylor expansions
|h(π, z)| ≤ |h(0, z)|=|ez−1−z| ≤
∞
X
n=2
|z|n
n! ≤z2e|z|,
and forz∈Bπ we have
|h(π, z)| ≤ |h(1, z)|=|ez(γ−1)(ez−1)−z|=|eγz−ez(γ−1)−z|
≤
∞
X
n=2
|γn−(γ−1)n|
n! |z|n≤z2
∞
X
n=0
|γn+2−(γ−1)n+2|
n! |z|n≤z2e|z|, as|γn+2−(γ−1)n+2| ≤1. It follows
|π∗N,−π∗| ≤L Z
|z|<
h(π∗N,, z)ν(dz)
≤L Z
{|z|<}∩Aπ∗ N,
h(πN,∗ , z)ν(dz) +
Z
{|z|<}∩Bπ∗ N,
h(πN,∗ , z)ν(dz)
≤LZ
{|z|<}∩Aπ∗ N,
|h(πN,∗ , z)|ν(dz) + Z
{|z|<}∩Bπ∗ N,
|h(πN,∗ , z)|ν(dz)
≤LeZ
{|z|<}∩Aπ∗ N,
z2ν(dz) + Z
{|z|<}∩Bπ∗ N,
z2ν(dz)
≤Leσ2().
This completes the proof.
Remark that sinceh(1, z) is increasing inz∈[−1,1], we find that h(1, z)≥h(1,−1) =e−γ(1−e) + 1. It follows thath(π, z) is positive for allz∈Randπ∈[0,1] if
γ >ln(e−1)≈0.541.
(28)
Thus, ifγ≥ln(e−1), we have
|h(π, z)| ≤ |h(0, z)|=ez−1−z . This implies that
| Z
R0
h(π, z)ν(dz)| ≤eσ2() which simplies the proof above.
Going back to the case of an NIG L´evy processL(t), then by neglecting the small jumps and solving the portfolio optimization problem would yield an error which could be bounded as
|πN,∗ −π∗| ≤C× .
Thus, the speed of convergence is of order 1 with respect to the truncation error.
4.2. Approximation of L by substituting small jumps by Brownian motion. We move on showing that the approximation using a Brownian motion leads to a convergence ofπ∗W, toπ∗ with the same rate as for the case where small jumps are neglected. We formulate the result as a proposition:
Proposition 2. The controlπ∗W, solving (24)converges to the controlπ∗ derived from(12)when →0. In particular, it holds
|πW,∗ −π∗| ≤CWσ2(), for a constantCW >0independent of .
Proof. The proof follows the same line of arguments as in the proof of Prop. 1.
|π∗W,−π∗|=
g−1(r−µˆ+σ2() (1−γ)πW,∗ −1 2
+ Z
|z|<
h(π∗W,, z)ν(dz))−g−1(r−µ)ˆ
≤L
r−µˆ+σ2() (1−γ)π∗W,−1 2
−(r−µ)ˆ +
Z
|z|<
h(πW,∗ , z)ν(dz)
≤L σ2() 3
2 −γ +
Z
|z|<
h(πW,, z)ν(dz)
.
Invoking the estimations onhfrom the proof of Prop. 1 gives the result.
Inspecting the proofs of Prop. 1 and 2 shows that the constant in the convergence rate when neglecting the small jumps is given byCN =Le, whereas for the Brownian motion approximation it isCW =L(3/2−γ) +CN > CN. Thus, the error estimate is in fact slightly worse when we use an approximation which gives a L´evy process with approximately the same variance, compared to an approximation where some of the noise is removed.
4.3. Examples. First, let us assume the driving processL(t) is a Poisson processN(t), compen- sated by its jump intensityλ,
L(t) =N(t)−λt .
This is admittedly not a process which has ”small jumps”, since all jumps are of constant size 1.
However, we would like to look at an example where we perturb this process by adding a Brownian motion component, that is
L(t) =L(t) +W(t).
This is not fitting into our analysis above, but the case here is to see the effect of perturbing the processL(t) in a simple setting where much is known analytically. It serves as a ”non-example”
which is still relevant for our considerations.
We find the optimal controlπ∗ by solving (12), which in this case becomes ((1 +π∗(e−1))γ−1(e−1)−(e−1))λ=r−((γ−λ) + (e−1)λ). This yields the solution
π∗=
r−γ+λ (e−1)λ
γ−11
−1
! 1 e−1.
The corresponding equation for solving π∗, the optimal fraction to invest in the stock for the driving processL(t), is
((1 +π∗(e−1))γ−1(e−1)−(e−1))λ=r−((γ−λ) + (e−1)λ) + ((1−γ)π∗−1 2) , which we solve numerically.
To be concrete, we suppose an annual interest rate r = 4.5% and a jump intensity given by λ= 0.5, corresponding to an asset that jumps on average every second day. Furthermore, the risk aversion is set equal to γ = 0.5. This yields an optimal investment in the stock of π∗ = 0.7367, that is, 73.67% of the wealth should go into the stock. In Figure 1(a) we plot the ”exact solution”
π∗ against the approximative π∗ for ranging from 0.01 to 1. The approximative investment strategy is found by solving the equation using the built-in Matlab routine fzero. Since the approximative model has more noise/uncertainty, it naturally leads to an investment strategy less thanπ∗. Noteworthy is that the difference is rather big even for small’s. For example, if= 0.1, we have a relative error of approximately 5.3%, whereas for= 0.01 it is 0.7%.
We see from the figure where we plot the error on log-scale against a log-epsilon that it corre- sponds very good to the line−1 + 0.93×log(), which means that the error goes approximately asC×0.93, slightly worse than a linear convergence in.
We now move on to consider the more interesting case of an NIG L´evy process and the approx- imation of such. We restrict our attention to the situation where we neglect the small jumps, and investigate the deviation between the ”correct” portfolio strategyπ∗ and the approximativeπN,∗ .
0 0.1 0.2 0.3 0.4 0.5 0.6
0.62 0.64 0.66 0.68 0.7 0.72 0.74 0.76 0.78 0.8
!
optimal investment "*
(a) The exact optimal investmentπ∗(solid line) and the approximated oneπ∗N,(dashed line).
−5.5 −5 −4.5 −4 −3.5 −3 −2.5 −2 −1.5 −1 −0.5
−5
−4
−3
−2
−1 0 1
log(!)
log(error)
(b) The error on a log-scale.
Figure 1. Error whenL(t) follows a Poisson processN(t)
Let the parameters beα= 50, andδ= 0.03 on a daily scale (this seems to be natural estimates of stocks, see Bølviken and Benth [7]). Furthermore we supposer= 0.04/250 and ξ= 0.02/250.
The interest rate is therefore 4% while logreturns have a mean of 2%, measured annually, when we assume there are 250 trading days in a year. The compound interest rate ˆµ on the average stock price becomes 0.093, or 9.3%, annually. This is clearly abover, implying that the condition ˆ
µ−r >0 is satisfied.
We computed the optimal π∗ using = 10−10 to avoid the singularity at the center of the L´evy measure of the NIG. The resulting optimal investment strategy becameπ∗= 70.59%. Next, by starting with = 0.0001 and stepping down 0.00005, 0.00001, 0.000005, 0.000001,0.0000005, and 0.0000001, we get the errors as depicted in Figure 2. We remark that the truncation of the infinite integral limits goes at 1, where we observed that the tails of the NIG L´evy measure gave values which were of the magnitude 10−20, where as in the center around 0 it reached values of the magnitude 10−2. Strictly speaking, in this example we do not investigate the approximation of a
0 0.2 0.4 0.6 0.8 1 1.2 x 10−4
−14
−12
−10
−8
−6
−4
−2 0 2x 10−4
!
error
Figure 2. Error in the optimal control with a NIG L´evy process.
NIG L´evy process, but a L´evy process withtruncatedNIG L´evy measure at|z|<1. Nevertheless, it reflects what to be expected for the true NIG case.
5. Convergence of the value functions
We have seen that the controls πN,∗ and πW,∗ converge to the control π∗ as → 0. We next investigate the convergence of the corresponding value functions. We first discuss maximization of terminal wealth, and then move on to analyse the convergence of the optimal consumption and the value functions.
5.1. Maximising expected utility of terminal wealth. We consider the problem of optimizing terminal wealth, which means there is no consumption involved in the control problem. We find the following for the case of truncation of the small jumps:
Proposition 3. It holds for everyx∈R+,
→0limVN,(x) =V(x).
Proof. Recalling the wealth process Xπ(t) with no consumption, we find that it is a geometric jump diffusion process with constant coefficients and with solution
Xπ∗(t) =xexp{t(r+ (ˆµ−r)π) +t Z
R\{0}
ln(1 +π∗(ez−1))−π∗(ez−1)1|z|<1ν(dz) +
Z t
0
Z
R\{0}
ln(1 +π∗(ez−1)) ˜N(ds, dz)}.
Applying the formula in Ex. 1.6 in Øksendal and Sulem [20] , we find V(x) =E[U(Xπ∗(T))] =E
1
γ(Xπ∗(T))γ
= 1
γxγexp{γT(r+ (ˆµ−r)π∗).
+T Z
R\{0}
(1 +π∗(ez−1))γ−1−γπ∗(ez−1)1|z|<1ν(dz)}
If small jumps are neglected, we find similarly VN,(x) =E[U(XN,∗ (T))]
= 1
γxγexp{γT(r+ (ˆµN,−r)πN,∗ ) +T
Z
R\{0}
(1 +π∗N,(ez−1))γ−1−γπ∗N,(ez−1)1|z|<1νN,(dz)}.
(29)
We have for|z|<1
|(1 +π∗N,(ez−1))γ−1−γπ∗N,(ez−1)| ≤ |eγz−1−γ(ez−1)|, (30)
and for|z|>1,
|(1 +π∗N,(ez−1))γ−1| ≤ |eγz−1|.
Both estimates are integrable on their definition area. Asπ∗N, converges toπ∗ and ˆµN, to ˆµ, the
proposition follows with Lebesgues convergence theorem.
For the case of a Brownian approximation of the truncated small jumps we have:
Proposition 4. It holds for everyx∈R+,
→0limVW,(x) =V(x).
Proof. A Brownian motion approximation of the small jumps as in (3.2) results in an additional Brownian component andσ()-terms in the wealth process:
XW,∗ (t) =xexpn t
r+ (ˆµW,−r)πW,−1
2σ2()π2W,) +1
2σ()πW,Bt
+t Z
R\{0}
ln(1 +πW,∗ (ez−1))−πW,∗ (ez−1)1|z|<1νN,(dz) +
Z t
0
Z
R\{0}
ln(1 +πW,∗ (ez−1)) ˜N(ds, dz)}.
The value function becomes then VW,(x) =E[U(XW,∗ (T))]
= 1 γxγexp
γT(r+ (ˆµW,−r)π∗W,−1
2σ2()πW,2 ) +T
Z
R\{0}
(1 +πW,∗ (ez−1))γ−1−γπW,∗ (ez−1)1|z|<1νN,(dz) )
.
Hence, as for the case ofVN, above,VW,(x) converges toV(x).
5.2. Maximising exptected utility of consumption. As we noted in Section 2, maximising wealth over optimal investment and consumption pairs (π, c) results in the same optimal strategy π∗as when maximising expected utility over terminal wealth. As our results show, approximations of these control problems leads to convergence of the optimal investment stategies, as well as the value functions for maximization of the utility of terminal wealth. We show next that including consumption does not alter these conclusions.
Proposition 5. Let the value function be of the form (3) and suppose that the discount factorδ satisfies
δ > γµˆ+ Z ∞
1
eγz−1ν(dz). Then it holds for everyx∈R+,
→0limVN,(x) =V(x).
Proof. As the optimal consumption is a constant fraction of wealth, the wealth process is again a geometric jump diffusion process with constant coefficients and with solution
Xπ∗(t) =xexp{t(r+ (ˆµ−r)π∗−c∗) +t Z
R\{0}
ln(1 +π∗(ez−1))−π∗(ez−1)1|z|<1ν(dz) +
Z t
0
Z
R\{0}
ln(1 +π∗(ez−1)) ˜N(ds, dz)}
where
c∗= 1−γ δ−k(γ). Then the value function takes the form
V(x) =E[ Z ∞
0
e−δt(c∗(t))γ γ dt]
= 1 γ(c∗)γxγ
Z ∞
0
exp{t[−δ+γ(r+ (ˆµ−r)π∗−c∗) +
Z
R\{0}
(1 +π∗(ez−1))γ−1−γπ∗(ez−1)1|z|<1ν(dz)]}dt
where we have exchanged integration and expectation by appealing to Fubini’s theorem and for- mula in Ex. 1.6 in Øksendal and Sulem [20]. For the value function to be finite, we must require that,
−δ+γ(r+ (ˆµ−r)π∗−c∗) + Z
R\{0}
(1 +π∗(ez−1))γ−1−γπ∗(ez−1)1|z|<1ν(dz)<0 (31)
Condition (31) depends onπ∗ andc∗. With π∗bounded in [0,1] andc∗ positive we find γ(r+ (ˆµ−r)π∗−c∗)≤γµ.ˆ
Furthermore, the integrand in (31) is positive and increasing in π for z > 1 and negative and decreasing inπforz <1. Then
Z
R\{0}
(1 +π∗(ez−1))γ−1−γπ∗(ez−1)1|z|<1ν(dz)<
Z ∞
1
eγz−1ν(dz) and condition (31) is fullfilled by the assumption onδ. Thus,
V(x) =−1
γ(c∗)γxγ[−δ+γ(r+ (ˆµ−r)π∗−c∗) +
Z
R\{0}
(1 +π∗(ez−1))γ−1−γπ∗(ez−1)1|z|<1ν(dz)]−1. Neglecting small jumps, we find similarly
VN,(x) = 1 γ(c∗N,)γ
Z ∞
0
xγexp{t[−δ+γ(r+ (ˆµN,−r)πN,∗ −c∗N,) +
Z
R\{0}
(1 +π∗N,(ez−1))γ−1−γπ∗N,(ez−1)1|z|<1νN,(dz)]}dt
=−1
γ(c∗N,)γxγ[−δ+γ(r+ (ˆµN,−r)πN,∗ −c∗N,) +
Z
R\{0}
(1 +π∗N,(ez−1))γ−1−γπ∗N,(ez−1)1|z|<1νN,(dz)]−1 With
c∗N,= 1−γ δ−kN,(γ)
and
kN,(γ) =γ(r+ (ˆµN,−r)π∗N,) + Z
R\{0}
(1 +πN,∗ (ez−1))γ−1−γπN,∗ (ez−1)νN,(dz).
kN,(γ) converges tok(γ) as ˆµN, andπ∗N,converges and by appealing to Lebesgue’s convergence theorem using estimate (30). The integral
Z
R\{0}
(1 +π∗N,(ez−1))γ−1−γπ∗N,(ez−1)1|z|<1νN,(dz)
did already appear in (29) in the value function connected to maximising terminal wealth, where
its convergence was discussed. Then the proposition follows.
Approximating the truncated jumps by a Brownian motion preserves convergence of the value function also in the consumption case, as the next result shows:
Proposition 6. Let the value function be of the form (3). Then it holds for everyx∈R+,
→0limVW,(x) =V(x).
Proof. In this case the value function has the form VW,(x) = 1
γ(c∗W,)γ Z ∞
0
xγexp{t[−δ+γ(r+ (ˆµW,−r)π∗W,−1
2σ2()(πW,∗ )2−c∗W,) +
Z
R\{0}
(1 +πW,∗ (ez−1))γ−1−γπW,∗ (ez−1)1|z|<1νN,(dz)]}dt with
c∗W,= 1−γ δ−kW,(γ)
A Brownian approximation as in (3.2) results in an additionalσ2()-term also inkW,(γ):
kW,(γ) =γ(r+ (ˆµW,−r)π∗W,)−1
2σ2()(π∗W,)2γ(1−γ) +
Z
R\{0}
(1 +πW,∗ (ez−1))γ−1−γπW,∗ (ez−1)νN,(dz).
The convergence of the corresponding value functionVW, follows as for the caseVN, above.
6. Convergence rate for the wealth process
Still remaining is the convergence of the wealth processes. Convergence in probability of the wealth processes is clear. For convergence in L2 we can derive a rate which is, not surprisingly, proportional toσ2().
Proposition 7. The wealth process XN,(t) converges to the original process X(t) in L2. For every T <∞ it is furthermore for the case of no consumption
sup
t∈[0,T]E[|X(t)−XN,(t)|2]≤Kσ2() whereK depends onT.
Proof. We can write the difference between the wealth processes as
X(t)−XN,(t) = Z t
0
(r+ (ˆµ−r)π)X(s)−(r+ (ˆµN,−r)πN,)XN,(s)ds+ Z t
0
cX(s)−cN,XN,(s)ds +
Z t
0
Z
R\{0}
πX(s−)−πN,XN,(s−)1|z|>
(ez−1) ˜N(ds, dz)
= Z t
0
(r+ (ˆµ−r)π)(X(s)−XN,(s))ds +
Z t
0
XN,(s) (r+ (ˆµ−r)π)−(r+ (ˆµN,−r)πN,))ds
− Z t
0
c X(s)−XN,(s) ds−
Z t
0
(c−cN,)XN,(s)ds +
Z t
0
π X(s−)−XN,(s−) Z
R\{0}
(ez−1) ˜N(ds, dz) +
Z t
0
Z
R\{0}
XN,(s−)(π−πN,1|z|>)(ez−1) ˜N(ds, dz)
Then, it follows:
E[|X(t)−XN,(t)|2]≤c1E hZ t
0
X(s)−XN,(s)ds2i +c2E
hZ t
0
XN,(s)ds2in
(ˆµ−r)π−(ˆµN,−r)πN,
2o
+c3E hZ t
0
Z
R\{0}
(X(s−)−XN,(s−))(ez−1) ˜N(ds, dz)2i +c4E
hZ t
0
Z
R\{0}
XN,(s−)(π−πN,1|z|>)(ez−1) ˜N(ds, dz)2i
for constantsc1, ..., c4. The constants denoted byc1, c2 are going to vary from step to step during this proof. Fort≤T we find by Cauchy-Schwarz
E hZ t
0
X(s)−XN,(s)ds2i
≤T Z t
0
E h
X(s)−XN,(s)2i ds
E hZ t
0
XN,(s)ds2i
≤T Z t
0
E h
XN,2 i ds
Furthermore, we find
E hZ t
0
Z
R\{0}
(π(X(s−)−XN,(s−))(ez−1)ν(dz)ds)2i
=E hZ t
0
Z
R\{0}
X(s−)−XN,(s−)2
(ez−1)ν(dz)dsi
= Z
R\{0}
(ez−1)2ν(dz) Z t
0
E h
(X(s)−XN,(s))2i ds (32)
E hZ t
0
Z
R\{0}
XN,(s−)(π−πN,)1|z|>(ez−1) ˜N(ds, dz)2i
=E hZ t
0
Z
R\{0}
XN,2 (s−)(π−πN,1|z|>)2(ez−1)2ν(dz)dsi
= Z t
0
E h
XN,2 (s)i ds
Z
R\{0}
(π−πN,1|z|>)2(ez−1)2)ν(dz) (33)
Hence,
E[|X(t)−XN,(t)|2]≤c1
Z t
0
E h
X(s)−XN,(s)2i ds
+c2 Z t
0
E
XN,2 (s) dsn
(ˆµ−r)π−(ˆµN,−r)πN,2
+ Z
R\{0}
(π−πN,1|z|>)2(ez−1)2ν(dz)o Also it follows that
Z
R\{0}
(π−πN,1|z|>)2(ez−1)2ν(dz) = Z
R\{0}
(π−πN,+πN,1|z|<)2(ez−1)2ν(dz)}
≤c1
Z
R\{0}
(ez−1)2ν(dz)(π−πN,)2+c2
Z
|z|<
(ez−1)2ν(dz)
≤c1|π−πN,|2+c2σ2() (34)
and
(ˆµ−r)π−(ˆµN,−r)πN,2
= (ˆµ−r)(π−πN,) +πN,(ˆµ−µˆN,)2
≤c1|π−πN,|2+c2|ˆµ−µˆN,|2. (35)
Therefore,
E[|X(t)−XN,(t)|2]≤˜c1
Z t
0
E[ X(s)−XN,(s)2
]ds+ ˜c2 (|π−πN,|2+σ2() Z t
0
E[XN,2 (s)]ds.
Then it follows by Gronwall’s inequality:
E[|X(t)−XN,(t)|2]≤˜c2 Z t
0
e˜c1(t−s)E[XN 2 ]ds
σ2() + (π−πN )2
≤c1σ2() +c2σ4()
≤c1σ2() Hence, for everyT <∞we conclude
sup
t∈[0,T]
E[|X(t)−XN,(t)|2]≤Kσ2() whereK depends onT, and the Proposition follows.
The convergence and convergence rates in this paper are analysed for the specific case of power utility in a Merton framework. The proofs, especially for the convergence rate of the optimal control, depend on features of the concrete form of the solutions in this specific setting. In a more general setting a concrete solution is not available. Additionally it is not clear if the optimal control and the consumption rate are constant in time.