Optimal control with partial information for stochastic Volterra equations

(1)

Volume 2010, Article ID 329185,25pages doi:10.1155/2010/329185

Research Article

Optimal Control with Partial Information for Stochastic Volterra Equations

Bernt Øksendal

^{1, 2}

and Tusheng Zhang

³

1CMA and Department of Mathematics, University of Oslo, P.O. Box 1053 Blindern, 0316 Oslo, Norway

2Norwegian School of Economics and Business Administration (NHH), Helleveien 30, 5045 Bergen, Norway

3School of Mathematics, University of Manchester, Oxford Road, Manchester M13 9PL, UK

Correspondence should be addressed to Bernt Øksendal,oksendal@math.uio.no Received 26 October 2009; Revised 26 February 2010; Accepted 9 March 2010 Academic Editor: Agn`es Sulem

Copyrightq2010 B. Øksendal and T. Zhang. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

In the first part of the paper we obtain existence and characterizations of an optimal control for a linear quadratic control problem of linear stochastic Volterra equations. In the second part, using the Malliavin calculus approach, we deduce a general maximum principle for optimal control of general stochastic Volterra equations. The result is applied to solve some stochastic control problem for some stochastic delay equations.

1. Introduction

Let Ω,F,Ft, Pbe a filtered probability space and Bt, t ≥ 0 a Ft-real valued Brownian motion. LetR₀ R\ {0}andνdzaσ-finite measure onR0,BR0. LetNdt, dzdenote a stationary Poisson random measure onR×R0with intensity measuredtνdz. Denote by Ndt, dz Ndt, dz−dtνdzthe compensated Poisson measure. Suppose that we have a cash flow where the amountXtat timetis modelled by a stochastic delay equation of the form:

dXt

A₁tXt A₂tXt−h _t

t−hA₀t, sXsds

dt C1tdBt

R0

C2t, zNdt, dz; t≥0,

Xt ηt; t∈−h,0. 1.1

(2)

Hereh >0 is a fixed delay andA1t, A2t, A0t, s,C1t, C2t, z,andηare given bounded deterministic functions.

Suppose that we consume at the rate ut at time tfrom this wealth Xt, and that this consumption rate influences the growth rate ofXtboth through its valueutat time tand through its former value ut−h, because of some delay mechanisms in the system determining the dynamics ofXt.

With such a consumption rateutthe dynamics of the corresponding cash flowX^ut is given by

dX^ut

A1tX^ut A2tX^ut−h _t

t−hA0t, sX^usds B1tut B2tut−h

dtC1tdBt

R0

C2t, zNdt, dz; t∈−h,0, X^ut ηt; t≤0,

1.2

whereB1tand B2tare deterministic bounded functions.

Suppose that the consumer wants to maximize the combined utility of the consumption up to the terminal timeTand the terminal wealth. Then the problem is to findu·such that

Ju:E _T

0

U₁t, utdtU₂X^uT

1.3

is maximal. Here Ut,· and U2· are given utility functions, possibly stochastic. See Section 4.

This is an example of a stochastic control problem with delay. Such problems have been studied by many authors. See, for example,1–5 and the references therein. The methods used in these papers, however, do not apply to the cases studied here. Moreover, these papers do not consider partial information controlsee below.

It was shown in 6 that the system1.2 is equivalent to the following controlled stochastic Volterra equation:

X^ut _t

0

Kt, susds _t

0

Φt, sCsdBs _t

0

R0

Φt, sC2s, zNds, dz

Φt,0η0 ₀

−hΦt, shA2shηsds

₀

−h

h

0 Φt, τA0τ, sdτ ηsds,

1.4

(3)

where

Kt, s Φt, sB1s Φt, shB2sh, 1.5

andΦis the transition function satisfying

∂Φ

∂t A₁tΦt, s A₂tΦt−h, s _t

t−hA₀t, τΦτ, sdτ, Φs, s I; Φt, s 0 fort < s.

1.6

So the control of the system1.2reduces to the control of the system1.4. For more information about stochastic control of delay equations we refer to 6 and the references therein.

Stochastic Volterra equations are interesting on their own right, also for applications, for example, to economics or population dynamics. See, for example, Example 1.1 in7and the references therein.

In the first part of this paper, we study a linear quadratic control problem for the following controlled stochastic Volterra equation:

X^ut ξt _t

0

K1t, sX^us D1t, sus K2t, sdBs

_t

0

R0

K₄t, s, zX^usNds, dz _t

0

D₂t, sX^usds

_t

0

R0

D3t, s, zusNds, dz _t

0

R0

K5t, s, zNds, dz

_t

0

K₃t, susds,

1.7

whereutis our control process andξtis a given predictable process withEξ²t<∞for allt≥0, whileKi, Diare bounded deterministic functions. In reality one often does not have the complete information when performing a control to a system. This means that the control processes is required to be predictable with respect to a subfiltration{Gt}withGt⊂ Ft. So the space of controls will be

U

us;usisGt-predictable and such thatE _T

0

|us|²ds

<∞

. 1.8

Uis a Hilbert space equipped with the inner product

u1, u2E T

0

u1su2sds

. 1.9

(4)

|| · ||will denote the norm inU. LetAG be a closed, convex subset ofU, which will be the space of admissible controls. Consider the linear quadratic cost functional

Ju E

_T

0

Q1su²sds _T

0

Q2sX^us²ds _T

0

Q3susds

_T

0

Q4sXûsdsa1XûT²a2XûT

1.10

and the value function

J inf

u∈AGJu. 1.11

InSection 2, we prove the existence of an optimal control and provide some characterizations for the control.

In the second part of the paperfromSection 3, we consider the following general controlled stochastic Volterra equation:

X^ut ξt _t

0

bt, s, X^us, us, ωds _t

0

σt, s, X^us, us, ωdBs

_t

0

R0

θt, s, X^us, us, z, ωNds, dz,

1.12

whereξtis a given predictable process withEξ²t < ∞for allt ≥ 0. The performance functional is of the following form:

Ju E

_T

0

ft, X^ut, ut, ωdtgX^uT, ω

, 1.13

whereb:0, T×0, T×R×R×Ω → R,σ:0, T×0, T×R×R×Ω → R,θ:0, T×0, T× R×R×R₀×Ω → Randf:0, T×R×R×Ω → RareFt-predictable andg:R×Ω → Ris FTmeasurable and such that

E _T

0

ft, X^ut, utdtgX^uT

<∞, 1.14

for anyu∈ AG, the space of admissible controls. The problem is to findu∈ AGsuch that Φ: sup

u∈AG

Ju Ju. 1.15

Using the Malliavin calculus, inspired by the method in8, we will deduce a general maximum principle for the above control problem.

(5)

Remark 1.1. Note that we are oﬀthe Markovian setting because the solution of the Volterra equation is not Markovian. Therefore the classical method of dynamic programming and the Hamilton-Jacobi-Bellman equation cannot be used here.

Remark 1.2. We emphasize that partial information is diﬀerent from partial observation, where the control is based on noisy observations of the currentstate. For example, our discussion includes the case Gt Ft−δ δ > 0 constant, which corresponds to delayed information flow. This case is not covered by partial observation models. For a comprehensive presentation of the linear quadratic control problem in the classical case with partial observation, see9, with partial information see10.

2. Linear Quadratic Control

Consider the controlled stochastic Volterra equation 1.7 and the control problem 1.10, 1.11. We have the following Theorem.

Theorem 2.1. Suppose that

R0K₄²t, s, zνdzis bounded andQ₂s≥0,a₁ ≥0 andQ₁s≥δ for someδ >0. Then there exists a unique elementu∈ AGsuch that

J Ju inf

v∈AGJv. 2.1

Proof. For simplicity, we assumeD₃t, s, z 0 andK₅t, s, z 0 in this proof because these terms can be similarly estimated as the corresponding terms for Brownian motionB·. By 1.7we have

E X^ut²

≤7E ξt²

7E

⎡

⎣_t

0

K₁t, sX^usdBs

2⎤

⎦7E

⎡

⎣_t

0

D₁t, susdBs

2⎤

⎦

7E

⎡

⎣_t

0

K₂t, sdBs

2⎤

⎦7E

⎡

⎣_t

0

K₃t, susds

2⎤

⎦

7E

⎡

⎣_t

0

D₂t, sX^usds

2⎤

⎦7E

⎡

⎣_t

0

R0

K₄t, s, zX^usNds, dz

2⎤

⎦

≤7E ξt²

7E _t

0

K₁²t, sX^us²ds

7E _t

0

D₁²t, sus²ds

7 _t

0

K²₂t, sds7 _t

0

K²₃t, sdsE t

0

u²sds

7tE t

0

D²₂t, sX^us²ds

7E t

0

R0

K²₄t, s, zνdz X^us²ds

. 2.2

(6)

Applying Gronwall’s inequality, there exists a constantC1such that

E X^ut²

≤

C₁E _t

0

u²sds

C₁ e^C¹^T. 2.3

Similar arguments also lead to

E

X^u¹t−X^u²t²

≤C2e^C²^T

⎛

⎝E

⎡

⎣t 0

K3t, su2s−u1sds

2⎤

⎦

E _t

0

D₁t, s²u2s−u₁s²ds

2.4

for some constantC2. Now, letun ∈ AGbe a minimizing sequence for the value function, that is, lim_n_{→ ∞}Jun J. From the estimate2.3we see that there exists a constantcsuch that

E _T

0

Q₃susds _T

0

Q₄sX^usdsa₂X^uT

≤cuc. 2.5

Thus, by virtue of the assumption onQ1, we have, for some constantM,

M≥Jun≥δun²−cun −c. 2.6

This implies that {un} is bounded in U, hence weakly compact. Let u_n_k, k ≥ 1 be a subsequence that converges weakly to some elementu0inU. SinceAGis closed and convex, the Banach-Sack Theorem impliesu₀ ∈ A_G. From2.4we see thatu_n → uinUimplies that Xûⁿt → XûtinL²Ωfor everyt≥0 andXûⁿ· → Xû·inU. The same conclusion holds also forZût : Xût−X⁰t. SinceZû is linear inu, we conclude that equipped with the weak topology both onUandL²Ω,Zût:U → L²Ωis continuous for everyt≥ 0 and Zû·:U → Uis continuous. Thus,

X^ut:U−→L²Ω, X^u·:U−→U 2.7

are continuous with respect to the weak topology ofUandL²Ω. Since the functionals ofX^u involved in the definition ofJuin1.10are lower semicontinuous with respect to the weak

(7)

topology, it follows that

klim→ ∞Junk lim

k→ ∞E _T

0

Q₁su²_n_ksds _T

0

Q₂sX^u^nks²ds _T

0

Q₃sunksds

_T

0

Q₄sXû^nksdsa₁Xû^nkT²a₂Xû^nkT

≥E _T

0

Q1su²₀sds _T

0

Q2sX^u⁰s²ds _T

0

Q3su0sds

_T

0

Q₄sXû⁰sdsa₁Xû⁰T²a₂Xû⁰T

Ju0,

2.8

which implies thatu0is an optimal control.

The uniqueness is a consequence of the fact thatJuis strictly convex inuwhich is due to the fact thatX^uis aﬃne inuandx²is a strictly convex function. The proof is complete.

To characterize the optimal control, we assumeD1t, s 0 andD3t, s, z 0; that is, consider the controlled system:

X^ut ξt _t

0

K1t, sX^us K₂t, sdBs _t

0

K₃t, susds

_t

0

R0

K4t, s, zX^usNds, dz _t

0

D2t, sX^usds

_t

0

R0

K₅t, s, zNds, dz

2.9

Set

dFt, s:d_sFt, s K1t, sdBs

R0

K4t, s, zNds, dz D2t, sds. 2.10

For a predictable processhs, we have _t

0

hsdFt, s: _t

0

K1t, shsdBs _t

0

R0

K4t, s, zhsNds, dz _t

0

D2t, shsds.

2.11

(8)

Introduce

M₁t ξt ^∞

n1

_t

0

dFt, s1 _s₁

0

dFs1, s₂

· · · _s_n−1

0

ξsndFsn−1, sn,

M₂t _t

0

K₂t, s1dBs1

∞ n1

_t

0

dFt, s1 _s₁

0

dFs1, s₂

· · · _s_n−2

0

dFsn−2, s_n−1 _s_n−1

0

K2sn−1, sndBsn,

M₃t _t

0

R0

K₅t, s1, zdNds 1, dz ^∞

n1

_t

0

dFt, s1 _s₁

0

dFs1, s₂

· · · _s_n−2

0

dFs_n−2, s_n−1 _s_n−1

0

K5s_n−1, sn, zdNds n, dz,

Lt, s K₃t, s ^∞

n1

_t

s

dFt, s1 _s₁

s

dFs1, s₂

· · · _s_n−1

s

K₃sn, sdFs_n−1, s_n.

2.12

Lemma 2.2. Under our assumptions, the above series converges at least inL¹Ω. ThusM_i, i1,2,3 andLare well-defined.

Proof. We first note that

E

⎡

⎣_t

0

hsdFt, s

2⎤

⎦E _t

0

K²₁t, sh²sds

E _t

0

R0

K²₄t, s, zh²sνdzds

E

⎡

⎣_t

0

D2t, shsds

2⎤

⎦≤CTE _t

0

gt, sh²sds

2.13

fort≤T, where

gt, s K₁²t, s

R0

K₄²t, s, zνdz D²₂t, s 2.14

(9)

is a bounded deterministic function. Because of the similarity, let us prove only thatM1 is well-defined. Repeatedly using2.13, we have

E

⎡

⎣_t

0

dFt, s1 _s₁

0

dFs1, s₂· · · _s_n−1

0

ξsndFs_n−1, s_n

2⎤

⎦

≤C_T _t

0

ds₁gt, s1Es1

0

dFs1, s₂· · · _s_n−1

0

ξsndFs_n−1, s_n 2

≤ · · ·

≤Cⁿ⁻¹_T _t

0

ds₁gt, s1 _s₁

0

ds₂gs1, s₂· · · _s_n−1

0

ds_ngsn−1, s_nE ξ²sn

≤Rⁿ⁻¹_T E _T

0

ξ²sds tⁿ⁻¹

n−1!

2.15

for some constantR_T. This implies that

E

_t

0

dFt, s1 _s₁

0

dFs1, s₂· · · _s_n−1

0

ξsndFs_n−1, s_n

≤R^n−1/2_T

E _T

0

ξ²sds

1/2 t^n−1/2 n−1!.

2.16

Thus, we have

E|M1t|≤E|ξt| ^∞

n1

R^n−1/2_T

E _T

0

ξ²sds

1/2 t^n−1/2

n−1! <∞. 2.17

The following theorem is a characterization of the optimal control.

Theorem 2.3. Assume that

R0K₄²t, s, zνdz and

R0K²₅t, s, zνdz are bounded and E_T

0 ξ²sds < ∞. SupposeAG U. Letube the unique optimal control given inTheorem 2.1.

Thenuis determined by the following equation:

2Q1sus 2E _T

0

ut _T

s∨tQ2lLl, tLl, sdl dt| Gs

2a₁E _T

0

utLT, tLT, sdt| Gs

Q₃s E _T

s

Q₄lLl, sdl| Gs

2E T

s

Q2lM1l M2l M3lLl, sdl| Gs

a2ELT, s| Gs 2a₁EM1T M₂T M₃TLT, s| Gs 0,

2.18

almost everywhere with respect tomds, dω:ds×Pdω.

(10)

Proof. For anyw∈U, sinceuis the optimal control, we have

Juw d

dεJuεw

ε00. 2.19 This leads to

E

2 _T

0

Q1suswsds2 _T

0

Q2sX^usd

dεX^uεws

ε0

ds

_T

0

Q₃swsds _T

0

Q₄sd

dεX^uεws

ε0

ds

2a1X^uTd

dεX^uεwT ε0a2

d

dεX^uεwT ε0

0

2.20

for allw∈U. By virtue of2.9, it is easy to see that

Y^wt: d

dεX^uεwt

ε0 2.21 satisfies the following equation:

Y^wt _t

0

K₁t, sY^wsdBs _t

0

K₃t, swsds

_t

0

E

K₄t, s, zY^wsNds, dz _t

0

D₂t, sY^wsds.

2.22

Remark that Y^w is independent of u. Next we will find an explicit expression for X^u. Let dFt, sbe defined as in2.10. Repeatedly using2.9we have

X^ut ξt _t

0

K1t, s1X^us1 K₂t, s1dBs1

_t

0

K₃t, s1us1ds1

_t

0

R0

K₄t, s1, zX^us1Nds 1, dz _t

0

D₂t, s1X^us1ds

_t

0

R0

K5t, s1, zNds 1, dz

(11)

ξt _t

0

K1t, s1

ξs1

_s₁

0

K1s1, s2X^us2 K2s1, s2dBs2

_s₁

0

R0

K4s1, s2, zX^us2Nds 2, dz _s₁

0

K3s1, s2us2ds2

_s₁

0

D2s1, s2X^us2ds2 _s₁

0

R0

K5s1, s2, zNds 2, dz

dBs1

_t

0

R0

K₄t, s1, z

ξs1

_s₁

0

K1s1, s₂X^us2 K₂s1, s₂dBs2

_s₁

0

R0

K4s1, s2, zX^us2Nds 2, dz _s₁

0

K3s1, s2us2ds2

_s₁

0

D₂s1, s₂X^us2ds2 _s₁

0

R0

K₅s1, s₂, zNds 2,dz

Nds 1, dz

_t

0

R0

D2t, s1, z

ξs1

_s₁

0

K1s1, s2X^us2 K2s1, s2dBs2

_s₁

0

R0

K₄s1, s₂, zX^us2Nds 2, dz _s₁

0

K₃s1, s₂us2ds2

_s₁

0

D2s1, s2X^us2ds2 _s₁

0

R0

K5s1, s2, zNds 2, dz

ds1

_t

0

K2t, s1dBs1

_t

0

K3t, s1us1ds1 _t

0

R0

K5t, s1, zNds 1, dz · · ·

ξt ^∞

n1

_t

0

dFt, s1 _s₁

0

dFs1, s2· · · _s_n−1

0

ξsndFs_n−1, sn

^∞

n1

_t

0

dFt, s1 _s₁

0

dFs1, s2

· · · _s_n−2

0

dFsn−2, s_n−1 _s_n−1

0

K₂sn−1, s_ndBsn ^∞

n1

_t

0

dFt, s1 _s₁

0

dFs1, s2

· · · _s_n−2

0

dFsn−2, s_n−1 _s_n−1

0

K₃sn−1, s_nusndsn

^∞

n1

_t

0

dFt, s1 _s₁

0

dFs1, s2

· · · _s_n−2

0

dFsn−2, s_n−1 _s_n−1

0

R0

K₅sn−1, s_n, zNds n, dz

_t

0

K₂t, s1dBs1

_t

0

K₃t, s1us1ds1

_t

0

R0

K5t, s1, zNds 1, dz.

2.23

(12)

Similarly, we have the following expansion forY^w:

Y^wt _t

0

K₃t, swsds^∞

n1

_t

0

dFt, s1 _s₁

0

dFs1, s₂

· · · _s_n−2

0

dFsn−2, s_n−1 _s_n−1

0

K3sn−1, snwsndsn.

2.24

Interchanging the order of integration,

Y^wt _t

0

ws

K₃t, s ^∞

n1

_t

s

dFt, s1 _s₁

s

dFs1, s₂· · · _s_n−1

s

K₃sn, sdFs_n−1, s_n

ds

_t

0

Lt, swsds. 2.25

Now substitutingY^winto2.20we obtain that

E

2 _T

0

Q1suswsds2 _T

0

Q2sX^us _s

0

Ls, lwldl

ds

E _T

0

Q₃swsds _T

0

Q₄s _s

0

Ls, lwldl

ds

2a1E _T

0

X^uTLT, swsdsa2

_T

0

LT, swsds

0

2.26

for allw∈U. Interchanging the order of integration and conditioning onGswe see that2.26 is equivalent to

E

2 _T

0

Q1suswsds2 _T

0

wsE _T

s

Q2lX^ulLl, sdl| Gs

ds

E _T

0

Q3swsds _T

0

wsE _T

s

Q4lLl, sdl| Gs

ds

2a1E T

0

EX^uTLT, s| Gswsds

a2E T

0

ELT, s| Gswsds

0.

2.27

(13)

Since this holds for allw∈U, we conclude that

2Q₁sus 2E _T

s

Q₂lX^ulLl, sdl| Gs

Q₃s E _T

s

Q₄lLl, sdl| Gs

2a₁EX^uTLT, s| Gs a₂ELT, s| Gs 0,

2.28

m-a.e. Note thatX^utcan be written as

X^ut M1t M2t M3t _t

0

usLt, sds. 2.29

SubstitutingX^utinto2.28, we get2.18, completing the proof.

Example 2.4. Consider the controlled system

X^ut ξt _t

0

K₂t, sdBs _t

0

K₃t, susds 2.30

and the performance functional

Ju E _T

0

Q1su²sds _T

0

Q3susds _T

0

Q4sXûsdsa1XûT²a2XûT

. 2.31

Suppose Gt {Ω,∅}, meaning that the control is deterministic. In this case, we can find the unique optimal control explicitly. Noting that the conditional expectation reduces to expectation, the2.18for the optimal controlubecomes

2Q₁sus 2a₁ _T

0

utK3T, tdt K₃T, s

Q₃s _T

s

Q₄lK3l, sdla₂K₃T, s 2a₁gTK3T, s 0 ds-a.e.,

2.32

where we have used the fact thatEM2t 0,M1t ξt, Lt, s K3t, sin this special case. Put

b _T

0

utK3T, tdt. 2.33

Then2.33yields

us −a1bK3T, s

Q₁s hs, ds-a.e., 2.34

(14)

where

hs −Q3s _T

s Q4lK3l, sdl

2Q₁s −a₂K₃T, s 2a₁gTK3T, s

2Q₁s . 2.35

Substitute the expression ofuinto2.34to get

−a1b _T

0

K₃T, t² Q₁t dt

_T

0

htK3T, tdtb. 2.36

Consequently,

b 1

1a1

_T

0

K3T, t²/Q1t dt

_T

0

htK3T, tdt. 2.37

Together with2.35we arrive at

us −a1

⎛

⎜⎝ 1 1a1

_T

0

K3T, t²/Q1t dt

_T

0

htK3T, tdt

⎞

⎟⎠K₃T, s

Q₁s hs, 2.38

ds-a.e.

3. A General Maximum Principle

In this section, we consider the following general controlled stochastic Volterra equation:

X^ut ξt _t

0

bt, s, X^us, us, ωds _t

0

σt, s, X^us, us, ωdBs

_t

0

R0

θt, s, X^us, us, z, ωNds, dz,

3.1

whereutis our control process taking values in R andξtis as in1.7. More precisely, u∈ A_G, whereA_Gis a family ofGt-predictable controls. HereGt⊂ Ftis a given subfiltration andb:0, T×0, T×R×R×Ω → R,σ:0, T×0, T×R×R×Ω → Randθ:0, T×0, T× R×R×R0×Ω → Rare given measurable,Ft-predictable functions. Consider a performance functional of the following form:

Ju E

T 0

ft, X^ut, ut, ωdtgX^uT, ω

, 3.2

wheref:0, T×R×D×Ω → RisFtpredictable andg :R×Ω → RisFTmeasurable and such that

E T

0

ft, X^ut, ut, ωdtgX^uT, ω

<∞, ∀u∈ AG. 3.3

(15)

The purpose of this section is to give a characterization for the critical point ofJu. First, in the following two subsections we recall briefly some basic properties of Malliavin calculus forB·andN·, ·which will be used in the sequel. For more information we refer to11 and12.

3.1. Integration by Parts Formula forB·

In this subsection, FT σBs,0 ≤ s ≤ T. Recall that the Wiener-Ito chaos expansion theorem states that anyF∈L²FT, Padmits the representation

F^∞

n0

In

fn 3.4

for a unique sequence of symmetric deterministic functionfn∈L²0, T^×nand

In

fn n!

_T

0

_t_n

0

· · · _t₂

0

fnt1, . . . , tndBt1dBt2· · ·dBtn. 3.5

Moreover, the following isometry holds:

E F²

^∞

n0

n!!!fn!!²

L²0,T^×n. 3.6

LetD1,2be the space of allF ∈L²FT, Psuch that its chaos expansion3.4satisfies

F²_D_1,2:^∞

n0

nn!!!f_n!!²

L²0,T^×n<∞. 3.7

ForF∈D_1,2andt∈0, T, the Malliavin derivative ofF,D_tF, is defined by

DtF ^∞

n0

nI_n−1

fn·, t , 3.8

whereI_n−1fn·, tis then−1 times iterated integral to the firstn−1 variables off_nkeeping the last variabletntas a parameter. We need the following result.

Theorem AIntegration by parts formuladuality formulaforB·. Suppose thathtisFt- adapted withE_T

0 h²tdt<∞and letF∈D1,2. Then

E

F _T

0

htdBt

E T

0

htDtF dt

. 3.9

(16)

3.2. Integration by Parts Formula forN In this sectionFT σηs,0≤s≤T, whereηs _s

0

R0zNdr, dz. Recall that the Wiener- Ito chaos expansion theorem states that anyF ∈L²FT, Padmits the representation

F^∞

n0

In

fn 3.10

for a unique sequence of functionsfn ∈ L²dt×νⁿ, where L²dt×νⁿ is the space of functions fnt1, z1, . . . , tn, zn; ti ∈ 0, T, zi ∈ R0 such that fn ∈ L²dt×νⁿ and fn is symmetric with respect to the pairs of variablest1, z₁,t2, z₂, . . . ,tn, z_n. HereI_nfnis the iterated integral:

In

fn n!

_T

0

R0

_t_n

0

R0

· · · _t₂

0

R0

fnt1, z1, . . . , tn, znNdt 1, dz1· · ·Ndt n, dzn. 3.11

Moreover, the following isometry holds:

E F²

^∞

n0

n!!!f_n!!²

L²dt×νⁿ. 3.12

LetD"_1,2be the space of allF ∈L²FT, Psuch that its chaos expansion3.18satisfies

F²_D_"

1,2:^∞

n0

nn!!!fn!!²

L²dt×νⁿ<∞. 3.13

ForF∈D"1,2andt∈0, T, the Malliavin derivative ofF,Dt,zF, is defined by

Dt,zF ^∞

n0

nI_n−1

fn·, t, z , 3.14

whereI_n−1fn·, t, zis the n−1 times iterated integral with respect to the firstn−1 pairs of variables offnkeeping the last pairtn, zn t, zas a parameter. We need the following result

Theorem B Integration by parts formula duality formula for N. Suppose ht, z is Ft- predictable withE_T

0

R0h²t, zdtνdz<∞and letF∈D"_1,2. Then

E

F _T

0

R0

ht, zNdt, dz

E T

0

R0

ht, zDt,zF dtνdz

. 3.15

(17)

3.3. Maximum Principles

Consider3.1. We will make the following assumptions throughout this subsection.

H.1The functionsb:0, T×0, T×R×R×Ω → R,σ:0, T×0, T×R×R×Ω → R, θ:0, T×0, T×R×R×R₀×Ω → R,f:0, T×R×R×Ω → R,andg :R×Ω → R are continuously diﬀerentiable with respect tox∈Randu∈R.

H.2 For all t ∈ 0, T and all bounded Gt-measurable random variables α the control

β_αs αχ_t,Ts 3.16

belongs toA_G.

H.3For allu, β∈ A_Gwithβbounded, there existsδ >0 such that

uyβ∈ A_G ∀y∈−δ, δ. 3.17

H.4For allu, β ∈ A_G withβbounded, the processY^βt d/dyX^uyβt|_y0 exists and satisfies the following equation:

Y^βt _t

0

∂b

∂xt, s, X^us, usY^βsds _t

0

∂b

∂ut, s, X^us, usβsds

_t

0

∂σ

∂xt, s, X^us, usY^βsdBs _t

0

∂σ

∂ut, s, X^us, usβsdBs

_t

0

R0

∂θ

∂xt, s, X^us, us, zY^βsNds, dz

_t

0

R0

∂θ

∂ut, s, X^us, us, zβsNds, dz.

3.18

H.5For allu ∈ AG, the Malliavin derivativesDtgX^uTandDt,zgX^uT exist.

In the sequel, we omit the random parameterωfor simplicity. LetJube defined as in3.2.

H.6 The functions ∂b/∂ut, s, x, u², ∂b/∂xt, s, x, u², ∂σ/∂ut, s, x, u²,

∂σ/∂xt, s, x, u², and

R0∂θ/∂ut, s, x, u,z²νdz,

R0∂θ/∂xt, s, x, u, z²νdz are bounded on0, T×0, T×R×R×Ω.

(18)

Theorem 3.1Maximum principle I for optimal control of stochastic Volterra equations. (1) Suppose thatuis a critical point forJuin the sense thatd/dyJuyβ|_y00 for all bounded β∈ AG. Then

E _T

t

∂f

∂x

s,Xs, us

Λs, tds _T

t

∂f

∂u

s,Xs, us ds

_T

t

∂b

∂x

T, s,Xs, us

Λs, tg XT

ds

_T

t

∂b

∂u

T, s,Xs,us g

XT ds

_T

t

∂σ

∂x

T, s,Xs, us

Λs, tDs

g

XT ds

_T

t

∂σ

∂u

T, s,Xs, us Ds

g

XT ds

_T

t

R0

∂θ

∂x

T, s,Xs,us, z

Λs, tDs,z

g

XT

νdz ds

_T

t

R0

∂θ

∂u

T, s,Xs, us, z D_s,z

g

XT

νdz ds

| Gt

0, 3.19

whereΛs, tis defined in3.29below andX X^u.

(2) Conversely, supposeu∈ A_Gsuch that3.19holds. Thenuis a critical point forJ·.

Proof. 1Suppose thatuis a critical point forJu. Letβ ∈ AGbe bounded. WriteX X^u. Then

0 d dyJ

uyβ

y0E _T

0

#∂f

∂x

t,Xt, ut

Y^βt∂f

∂u

t,Xt, ut βt

$ dtg

XT Y^βT

, 3.20 where

Y^βt d

dyX^uyβt y0

_t

0

∂b

∂x

t, s,Xs,us

Y^βsds _t

0

∂b

∂u

t, s,Xs, us βsds

_t

0

∂σ

∂x

t, s,Xs, us

Y^βsdBs _t

0

∂σ

∂u

t, s,Xs, us

βsdBs

_t

0

R0

∂θ

∂x

t, s,Xs, us, z

Y^βsNds, dz

_t

0

R0

∂θ

∂u

t, s,Xs,us, z

βsNds, dz. 3.21