Optimal stopping with delayed information

(1)

Dept. of Math. University of Oslo Pure Mathematics No. 23 ISSN 0806–2439 June 2004

Optimal stopping with delayed information

Bernt Øksendal

^1,2

Revised in May 2005

1 Center of Mathematics for Applications (CMA) Department of Mathematics, University of Oslo P.O. Box 1053 Blindern , N–0316, Oslo, Norway and

2 Norwegian School of Economics and Business Administration, Helleveien 30, N–5045, Bergen, Norway

Abstract

We study a general optimal stopping problem for a strong Markov process in the case when there is a time lagδ >0 from the timeτ when the decision to stop is taken (a stopping time) to the time τ +δ when the system actually stops. Equivalently, we impose the constraint that the admissible times for stopping are stopping (Markov) times with respect to thedelayed flow of information. It is shown that such a problem can be reduced to a classical optimal stopping problem by a simple transformation.

The results are applied

(i) to find the optimal time to sell an asset

(ii) to solve an optimal resource extraction problem, in both cases under delayed information flow.

MSC (200): 93C41, 93E20, 60J25, 91B28

Key words: Optimal stopping, delayed information flow, strong Markov processes, optimal time to sell, optimal time to stop resource extraction.

1 Introduction

Let Y(t) be a strong Markov process in R^k on a filtered probability space (Ω,F,{F_t}t≥0, {P^y}y∈Rⁿ), where P^y is the probability measure giving the law of {Y(t)}t≥0 when Y(0) = y∈R^k. Letδ ≥0 be a fixed constant. In this paper we consider optimal stopping problems of the form

(1.1) Φ_δ(y) := sup

α∈T_δ

E^yhZ α 0

f(Y(t))dt+g(Y(α))i

(2)

where E^y denotes expectation with respect to P^y and f : R^k → R, g : R^k → R are given continuous functions such that

(1.2) E^yhZ α

0

|f(Y(t))|dt+|g(Y(α))|i

<∞ for all α∈ T_δ

where we interpret g(Y(α)) as 0 if α = ∞. Here T_δ is the set of δ-delayed stopping times, defined as follows

Definition 1.1 A function α: Ω→[δ,∞] is called a δ-delayed stopping time if (1.3) {ω;α(ω)≤t} ∈ F_t−δ for all t ≥δ

or, equivalently,

(1.4) {ω;α(ω)≤s+δ} ∈ Fs for all s≥0 The set of all δ-delayed stopping times is denoted by T_δ.

In other words, if we interpretα(ω) as the time to stop, thenα∈ T_δif the decision whether or not to stop at or before timetis based on the information represented byFt−δ. In particular, if δ= 0 thenTδ =T0 is the family of classical stopping times and (1.1) becomes the classical optimal stopping problem, discussed in many texts (see e.g. [Ø, Ch. 10]).

In the delayed case problem (1.1) models the situation when there is a delay δ > 0 in the flow of information available to the agent searching for the optimal time to stop. An alternative way of stating this, is that there is a delay δ > 0 from the decided stopping time τ ∈ T₀ (based on the complete current information available from the system) to the time α =τ+δ ∈ Tδ when the system actually stops. This new formulation is based on the following simple observation:

Lemma 1.2 (i) τ ∈ T₀ ⇐⇒α:=τ+δ∈ T_δ (ii) α∈ T_δ ⇐⇒τ :=α−δ∈ T₀

Proof. It suffices to prove (i).

First, assume τ ∈ T₀. Then, for t≥δ,

{ω;τ(ω) +δ≤t}={ω;τ(ω)≤t−δ} ∈ F_t−δ, and hence α:=τ+δ∈ T_δ.

Conversely, if α:=τ +δ∈ Tδ then

{ω;τ(ω)≤t}={ω;τ(ω) +δ ≤t+δ}={ω;α(ω)≤t+δ} ∈ F_(t+δ)−δ =F_t,

and hence τ ∈ T₀.

(3)

Remark 1.3 In view of this result we see that it is possible to give another interpretation of problem (1.1), namely

(1.5) Φδ(y) = sup

τ∈T0

E^y

hZ τ+δ 0

f(Y(t))dt+g(Y(τ+δ)) i

In this formulation the problem appears as an optimal stopping problem over classical stopping times τ ∈ T₀, but with delayed effect of the stopping: If the stopping time τ ∈ T₀ is chosen, then the system itself is stopped at time τ +δ, i.e. after a delay δ >0.

Note that T_δ⊂ T₀ for δ >0 and hence

Φ_δ(y)≤Φ₀(y)

and we can interpret Φ₀(y)−Φ_δ(y) as the loss of value due to the delay of information.

In this paper we show that the delayed optimal stopping problem (1.1) can be reduced to a classical optimal stopping problem by a simple transformation (Theorem 2.1).

We call α^∗ ∈ T_δ an optimal stopping time for the problem (1.1) if (1.6) Φ_δ(y) = E^yhZ α^∗

0

f(Y(t))dt+g(Y(α^∗))i .

This paper may be regarded as a partial extension of [AK2], where the geometric Brow- nian motion case is studied and solved (see Example 3.1), with a more general (Markovian) delay δ(X) ≤ 0. See also [AK1]. For a related type of problem involving impulse control with delivery lags see [BS].

2 Optimal stopping with δ-delayed information

We are now ready to state and prove the main result of this paper:

Theorem 2.1 a) Consider the two optimal stopping problems:

Φ_δ(y) := sup

α∈T_δ

E^yhZ α 0

f(Y(t))dt+g(Y(α))i (2.1)

Φ(y) := sup˜

τ∈T₀

E^yhZ τ 0

f(Y(t))dt+ ˜g_δ(Y(τ))i (2.2)

where

(2.3) g˜_δ(y) = E^yhZ δ 0

f(Y(t))dt+g(Y(δ))i . Then we have

Φδ(y) = ˜Φ(y) for all y∈R^k, δ≥0.

(4)

b) Moreover, α^∗ ∈ T_δ is an optimal stopping time for the delayed problem (2.1) if and only if

(2.4) α^∗ :=τ^∗+δ

where τ^∗ ∈ T₀ is an optimal stopping time for the non-delayed problem (2.2).

Proof. a) Define

(2.5) J^(α)(y) =E^yhZ α 0

f(Y(t))dt+g(Y(α))i

; α∈ T_δ, and

(2.6) J˜^(τ)(y) = E^yhZ τ 0

f(Y(t))dt+ ˜g_δ(Y(τ))i

; τ ∈ T₀. Choose α∈ T_δ and put

τ =α−δ∈ T0

Then α=τ+δ and hence J^(α)(y) =E^yhZ α

0

f(Y(t))dt+g(Y(α))i

=E^yhZ τ+δ 0

f(Y(t))dt+g(Y(τ +δ))i

=E^yhZ τ 0

f(Y(t))dt+ Z τ+δ

τ

f(Y(t))dt+g(Y(τ +δ))i

=E^yhZ τ 0

f(Y(t))dt+E^yh

θ_τnZ δ 0

f(Y(t))dt+g(Y(δ))oii , (2.7)

where θ_τ is theshift operator, defined by

θ_τ{h(Y(s))}=h(Y(τ +s)) for s≥0, for all measurable h:R^k −→R, and we have used that

θτ

Z δ 0

f(Y(t))dt

= Z τ+δ

τ

f(Y(t))dt.

We refer to [BG] for more information about Markov processes. By the strong Markov property we now get from (2.7) that

J^(α)(y) =E^yhZ τ 0

f(Y(t))dt+E^yh

θ_τnZ τ 0

f(Y(t))dt+g(Y(δ))o F_τii

=E^yhZ τ 0

f(Y(t))dt+E^Y^(τ)hZ δ 0

f(Y(t))dt+g(Y(δ))ii

=E^yhZ τ 0

f(Y(t))dt+ ˜g_δ(Y(τ))i

= ˜J^(τ)(y).

(2.8)

(5)

Hence, by Lemma 1.2 (ii),

Φδ(y) = sup

α∈T_δ

J^(α)(y) = sup

α−δ∈T0

J^(α)(y)

= sup

α−δ∈T0

J˜^(α−δ)(y) = sup

τ∈T0

J˜^(τ)(y) = ˜Φ(y), as claimed.

b) Supposeτ^∗ ∈ T₀ is optimal for (2.2). Define α^∗ :=τ^∗+δ.

Then α^∗ ∈ Tδ by Lemma 1.2 and by (2.8) combined with a) we have J^(α^∗⁾(y) = ˜J^(τ^∗⁾(y) = ˜Φ(y) = Φ_δ(y).

Hence α^∗ is optimal for (2.1).

Conversely, if α^∗ ∈ T_δ is optimal for (2.1) a similar argument gives that τ^∗ :=α^∗−δ is

optimal for (2.2).

3 Application 1: The optimal time to sell an asset

In this section we illustrate Theorem 2.1 by solving the following problem:

Example 3.1 (The optimal time to sell an asset)

This case (without the jump part) was first solved by [AK1], with a more general (Markovian) delay δ(X)≥0.

Suppose the value X(t) of an asset at time t is modelled by a geometric L´evy process of the form

(3.1) dX(t) = X(t⁻)h

µ dt+σ dB(t) + Z

R

zN˜(dt, dz)i

, X(0) =x >0, whereµ, σandxare constants. HereB(t) andη(t) := Rt

0

R

RzN˜(ds, dz) is a Brownian motion and an independent pure jump L´evy process, respectively, where

N˜(dt, dz) =N(dt, dz)−ν(dz)dt

is the compensated Poisson random measure ofη(·),N(dt, dz) is the Poisson random measure of η(·) and ν(dz) is the L´evy measure of η(·). We assume that

(3.2) 0≥z ≥ −1 a.s. ν.

(6)

This guarantees that X(t) never jumps down to a negative value. For convenience, we also assume that

(3.3) E[η²(t)]<∞ for all t≥0.

Then by the Itˆo formula for L´evy processes (see e.g. [ØS]) the solution of equation (3.1) is X(t) =xexph

(µ− ¹₂σ²)t+σB(t) +

Z t 0

Z

R

{ln(1 +z)−z}ν(dz)ds+ Z t

0

Z

R

ln(1 +z) ˜N(ds, dz)i

; t ≥0.

(3.4)

We now study the following problem

(3.5) Φδ(s, x) = sup

α∈T_δ

E^s,x[e^−ρ(s+α)(X(α)−q)],

where E^s,x denotes expectation with respect to the probability law P^s,x of the time-space process

dY(t) = dt

dX(t)

; Y(0) = s

x

and ρ >0,q >0 are constants. We assume that

(3.6) ρ > µ.

One possible interpretation of this problem is that Φ_δ(s, x) represents the maximal expected discounted net payment obtained by selling the asset at a δ-delayed stopping time (ρ is the discounting exponent and q is the transaction cost).

It is well-known that in the no delay case (δ= 0) the solution of the problem (3.5) is the following (under some additional assumptions on the L´evy measure ν):

(3.7) Φ0(s, x) = e^−ρsΨ0(x)

where

(3.8) Ψ₀(x) =

(x−q ; x≥x^∗₀ C₀x^λ ; 0< x < x^∗₀ Hereλ >1 is uniquely determined by the equation

(3.9) −ρ+µλ+¹₂σ²λ(λ−1) + Z

R

{(1 +z)^λ−1−λ z}ν(dz) = 0, and x^∗₀ and C₀ are given by

x^∗₀ = λ q λ−1 (3.10)

C₀ = 1

λ(x^∗₀)^1−λ. (3.11)

(7)

The corresponding optimal stopping time τ^∗ ∈ T₀ is

(3.12) τ^∗ = inf{t >0;X(t)≥x^∗₀}

Thus it is optimal to sell at the first time the priceX(t) equals or exceeds the value x^∗₀. We refer to [ØS, Example 2.5] for details.

To find the solution in the delay case (δ >0) we note that we have f = 0 and g(y) =g(s, x) =e^−ρs(x−q)

Hence, by (2.2),

˜

g_δ(y) =E^y[g(Y(δ))] =E^s,x[e^−ρ(s+δ)(X(δ)−q)]

=e^−ρ(s+δ)(E^x[X(δ)]−q) = e^−ρ(s+δ)(x e^µδ−q)

=e^{−ρs+δ(µ−ρ)}(x−q e^−µδ) =K e^−ρs(x−q),˜ (3.13)

where

(3.14) K =e^δ(µ−ρ) and q˜=q e^−µδ.

Thus ˜g_δ has the same form as g, so we can apply the results (3.7)–(3.12) to find ˜Φ(y) and the corresponding optimalτ^∗:

(3.15) Φ(y) = ˜˜ Φ(s, x) =e^−ρsΨ(x)˜ where

(3.16) Ψ(x) =˜

(K(x−q)˜ ; x≥x˜^∗ C x˜ ^λ ; 0< x <x˜^∗, with λ as in (3.9). Here ˜x^∗ and ˜C are given by

˜

x^∗ = λq˜ λ−1 (3.17)

C˜ = 1

λ(˜x^∗)^1−λ. (3.18)

The corresponding optimal stopping time for problem (2.2) and (2.1), respectively, is

˜

τ^∗ = inf{t >0;X(t)≥x˜^∗} (3.19)

α^∗ = ˜τ^∗+δ.

(3.20)

Using Theorem 2.1 we conclude the following:

(8)

Theorem 3.2 The value function Φ_δ(y) for the delayed optimal stopping problem (3.5) is given by

Φ_δ(y) = ˜Φ(y),

where Φ˜ is as in (3.15)–(3.18). The corresponding optimal stopping time α^∗ ∈ T_δ is α^∗ = inf{t >0;X(t)≥x˜^∗}+δ.

Remark 3.3 Assume for example that

µ >0.

Then comparing (3.17) with the non-delayed case (3.10) we see that ˜q > q and hence

˜ x^∗ < x^∗₀

Thus, in terms of thedelayed effect of the stopping time formulation (see (1.5)), it is optimal to stop at the first timet= ˜τ^∗ whenX(t)≥x˜^∗. This is sooner than in the non-delayed case, because of the anticipation that during the delay time interval [τ^∗, τ^∗+δ] X(t) is likely to increase (since µ >0). See Figure 1.

t τ₀^∗

z }| {

δ

α^∗= ˜τ^∗+δ

˜ τ^∗

˜

x^∗=^λqe_λ−1^−µδ (delay case)

˜ x^∗

x^∗=_λ−1^λq x^∗₀ (δ= 0 case)

X(t)

Figure 1. The optimal stopping times for Example 3.1 (µ >0)

(9)

4 Application 2: An optimal resource extraction prob- lem

In the no delay case the following example was discussed in [Ø] (continuous case) and [ØS]

(jump diffusion case). Our example models the situation when there is a time lag δ > 0 between the decided stopping time τ ∈ T0 and the time α =τ +δ ∈ Tδ when the result of the stopping decision comes into effect.

Example 4.1 (Optimal time to stop resource extraction) Suppose the price P(t) at time t per unit of a resource (oil, gas, . . . ) is given by

(4.1) dP(t) = P(t⁻)h

µdt+σdB(t) + Z

R

zN(dt, dz)˜ i

; P(0) =p >0

where, as in Example 3.1, µand σ are given constants and we assume that z ≥0 a.s. with respect to ν.

Let Q(t) denote the amount of remaining resources at time t. As long as the extraction field is open, we assume that the extraction rate is proportional to the remaining amount, i.e.

(4.2) dQ(t) = −λQ(t)dt; Q(0) =q >0

where λ >0 is a known constant.

If we decide to stop the extraction and close the field at a (delayed) stopping timeα ∈ T_δ, then the expected total discounted net profit J^α(s, p, q) is assumed to have the form

(4.3) J^α(s, p, q) =E^(s,p,q)h

α

Z

0

e^−ρ(s+t)(λP(t)Q(t)−K)dt+θe^−ρ(s+α)P(α)Q(α)i

where K >0 is the (constant) running cost rate and ρ >0, θ >0 are other constants. The expectationE^(s,p,q) is taken with respect to the probability law P^(s,p,q) of the strong Markov process

(4.4) Y(t) :=



 s+t P(t) Q(t)



, which starts at y=



 s p q



 at time t= 0.

The explanation of the quantity J^α(s, p, q) in (4.3) is the following:

As long as the field is open (i.e. as long as t < α) the gross income rate from the production is price times production rate, i.e. P(t)λQ(t). Subtracting the running cost rate K we get the net profit rate

λP(t)Q(t)−K for 0≤t < α.

(10)

If the field is closed at time α the net value of the remaining resources is estimated to be θP(α)Q(α). Discounting and integrating/adding these quantities and taking expectation we get (4.3).

We want to find the value function Φ_δ(s, p, q) and the corresponding optimal delayed stopping time α^∗ ∈ Tδ such that

(4.5) Φ_δ(y) = Φ_δ(s, p, q) = sup

α∈T_δ

J^α(s, p, q) =J^α^∗(s, p, q)

In the case of no delay (δ = 0) it is shown in [ØS, p. 158–162] that if the following relations between the parameters hold:

(4.6) 0< θ(λ+ρ−µ)< λ

then the optimal stopping time τ₀^∗ ∈ T0 is

(4.7) τ₀^∗ = inf{t >0;P(t)Q(t)≤w^∗₀}, where

(4.8) w^∗₀ = (−r2)K(λ+ρ−µ)

(1−r₂)ρ(λ−θ(λ+ρ−µ)), r₂ <0 being the negative solution of the equation

(4.9) h(r) := −ρ+ (µ−λ)r+ ¹₂σ²r(r−1) + Z

R

{(1 +z)^r−1−rz}ν(dz) = 0

In this case we have

f(y) = f(s, p, q) = e^−ρs(λpq−K) and

g(y) =g(s, p, q) =θe^−ρspq Thus

˜

g_δ(y) =E^yh

δ

Z

0

e^−ρ(s+t)(λP(t)Q(t)−K)dti

+E^y[θe^−ρ(s+δ)P(δ)Q(δ)]

=

δ

Z

0

e^−ρ(s+t)(λE[P(t)Q(t)]−K)dt+θe^−ρ(s+δ)E^y[P(δ)Q(δ)]

=

δ

Z

0

e^−ρ(s+t)(λpqe^(µ−λ)t−K)dt+θe^−ρ(s+δ)pqe^(µ−λ)δ

(11)

=e^−ρsh

{(λ+ρ−µ)⁻¹λ(1−e^{−(λ+ρ−µ)δ}) +θe^{−(λ+ρ−µ)δ}}pq− K

ρ (1−e^−ρδ)i

=e^−ρs[F₁pq−F₂], where (4.10)

F₁ = (λ+ρ−µ)⁻¹λ(1−e^{−(λ+ρ−µ)δ}) +θe^{−(λ+ρ−µ)δ} (4.11)

and F₂ = K

ρ (1−e^−ρδ) (4.12)

Therefore, according to Theorem 2.1 we have (4.13) Φ_δ(y) = sup

τ∈T0

E^yh

τ

Z

0

e^−ρ(s+t)(λP(t)Q(t)−K)dti

+E^y[e^−ρ(s+τ)(F₁P(τ)Q(τ) +F₂)]

The method used in [ØS] to provide the solution (4.7)–4.9) in the no delay case can easily be modified to find the optimal stopping time τ^∗ for the problem (4.13). The result is (4.14) w^∗_δ = (−r₂)K(λ+ρ−µ)e^(λ−µ)δ

(1−r)ρ[λ−θ(λ+ρ−µ)] =w₀^∗e^(λ−µ)δ. We have proved:

Theorem 4.2 The optimal stopping time α^∗ ∈ T_δ for the delayed optimal stopping problem is

(4.15) α^∗ =τ_δ^∗+δ,

where

(4.16) τ_δ^∗ = inf{t >0;P(t)Q(t)≤w^∗_δ}, with w^∗_δ given by (4.14).

Remark 4.3 Note that the thresholdw_δ^∗ for the decision to close down in the case of a time lag in the action only differs from the corresponding threshold w₀^∗ in the no delay case by the factore^(λ−µ)δ.

Assume, for example, thatλ > µ. Then we should decide to stopsoonerin the delay case than in the no delay case, because of the anticipation that P(t)Q(t) will probably decrease during the extra time δ it takes before the closing down actually takes place.

References

[AK1] L. H. R. Alvarez and J. Keppo: The impact of delivery lags on irreversible investment demand under uncertainty. Manuscript March 1998.

(12)

[AK2] L. H. R. Alvarez and J. Keppo: The impact of delivery lags on irreversible investment under uncertainty. European J. Operational Research 136 (2002), 173–180.

[BG] R. M. Blumenthal and R. K. Getoor: Markov Processes and Potential Theory. Aca- demic Press 1968.

[BS] A. Bar-Ilan and A. Sulem: Explicit solution of inventory problems with delivery lags.

Math. Operations Research 20 (1995), 709–720.

[Ø] B. Øksendal: Stochastic Differential Equations. 6th edition. Springer-Verlag 2003.

[ØS] B. Øksendal and A. Sulem: Applied Stochastic Control of Jump Diffusions. Springer- Verlag 2004.

[S] A: Shiryaev: Optimal Stopping Rules. Springer-Verlag 1978.