Informs
Informs
Informs
Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at .
http://www.jstor.org/page/info/about/policies/terms.jsp
JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide range of
content in a trusted digital archive. We use information technology and tools to increase productivity and facilitate new forms
of scholarship. For more information about JSTOR, please contact [email protected].
INFORMS is collaborating with JSTOR to digitize, preserve and extend access to Mathematics of Operations
Research.
http://www.jstor.org
MATHEMATICS OF OPERATIONS RESEARCH
Vol. 13, No. 4, November 1988
Printed in U.S.A.
We have modeled the problem as an impulse control problem, thus only piece-
wise constant portfolios are permissible.This allows us to draw from the work of
Bensoussanand Lions [3] and others (e.g. [4], [9, [[11] and [15]) to characterizethe
optimal value functionas the solutionto the versionof the quasi-variational inequali-
ties that is appropriateto this problem.The optimaltimes of interventionare then the
times of entrance to the "action set", determinedby the value function, and the
optimal portfolios are maximizingargumentsof a function dependenton the utility
and the optimal value. In fact, Bensoussanand Lions [2] suggestan impulseportfolio
control model without explicitlylisting constraintsor giving solutions. Their model
differsfrom ours in some ways,chieflythe following.The idea that we are maximizing
an expected total utility of consumptionwith an interventioncost raises the question
of divergenceof the interventiontimes, which is usually easily dismissedin impulse
control problems in which one simply minimizesa total runningcost plus impulse
costs. Basicallythe issue can be resolvedby forbiddingthe explosionof total wealth.
In ?1 we formulate the problem rigorously.?2 introduces the quasi-variational
inequalitiesand the associatedimpulse control. Some attentionis paid to regularity
conditions of the control.In particularit is shown that underhypothesesof finiteness
and quasi-left-continuity(see [5]) of the price process,the interventiontimes cannot
accumulatebefore the investmentperiod ends. The proof of the optimalityof the
Q.V.I. control makes up ?3; it mostly follows the well-knownmethods.Thus a very
generalimpulse portfolioproblemis reducedto solvingthe quasi-variational inequali-
ties, which can, of course,be difficult.In ?4 we solve the problemin the specialcase of
linear coefficients,no jump structure,one risky asset, proportionatevariabletransac-
tions costs, and risk-neutralutility. Numericaltechniquesfor more generalsituations
are the object of futurestudy.
where the initial condition is Po(t) = Po > 0. Also, I0 > 0, so that POis positive and
increasing.The vector price process P is now (n + 1) - dimensional,and its initial
component Po has a0 = yo = 0.
590 JEROMEF. EASTHAM& KEVIN J. HASTINGS
This indicates that there is no inherent bound on the available nonrisky shares. Define
the feasible set K(p, s) corresponding to price vector p and portfolio vector s to be
In (1.3), C(p,s,s') is the amount the investor consumes if the price vector is p, the
portfolio is changed from s to s', and a fixed cost c plus a nonnegative variable cost '
is paid for transacting. Thus, (1.3) implies that money cannot be borrowed. There are
states for which the feasible set is empty. Denote K = {(p,s): K(p,s) # 0 }. We
assume from now on that ' (and hence C) is continuous, and that
Under (1.4), K is closed, for if (Pk,Sk) is a sequence in K converging to (p, s), then
the continuity of C shows that 0 must be feasible for (p,s), hence (p,s) E K. We
remark that (1.4) holds under some very sensible conditions. For instance, if
where, for each i and each fixed Pi, si, the function p,(si - s') - *i(pi, si, s[) has its
maximum at s[ = 0, then C(p,s,0) > C(p,s,s') for all feasible s', and (1.4) results.
Once such I is
n
if t < r <
0< ;
(1.5)s
(1.5) if Oi< r < 0k where k = inf{ m: 0, > '
S(r) = \Sk_1 }
The extra complication in the second half of the definition is forced on us because we
are allowing 0k = k+ 1. The optimal control that we will construct may demand
immediate rebalancing for certain utility functions.
OPTIMAL IMPULSE CONTROL OF PORTFOLIOS 591
Note that the only admissibleaction at the terminaltime T is to sell off all holdings,
despite the fact that CT may be negative.For this reason we assume that U(z, t) is
positive or negativeaccordingto whetherz is positive or negative.The problemis to
find an admissible v* such that J(p,s, t, v*) > J(p,s, t, v) for each p ER-++', s E B,
t e [0, T] and each admissiblev.
n n
+ E u,j(p,s, r)A j(p, r)
i=1 j=l
(2.3-b) Lu < 0,
(2.3-c) (u - Mu) Lu = 0,
Define the continuation set C = (u > Mu) and the action set A = CC.Note that by
(2.3-a), GC x [0, T) c A. Now it is clear that for (p,s) E K, the feasible set K(p,s) is
nonempty and compact. We will show in Lemma (2.7) below that K is upper-semicon-
tinuous (u.s.c.) as a multifunction. Since the function to be maximized in (2.2) is
continuous, and B is locally compact, a measurable selection theorem of Schail [14,
Corollary 4] implies that there is a Borel measurable function f: K x [0, T) -* B such
that
= Mu(p,s, r)
U(C(p,s, (p,s, r)), r) + u(p, 0 (p,s, r), r)
for all (p, s, r) E K X [0, T). Note that if K(p, s) is empty, then Mu(p, s, r)= - c
and (p, s, r) E C.
This permits us to define the Q.V.I. impulse control. Again, let the initial data be
(p, s, t) and write 80* = t and So* = s for convenience. Define
(2.5-a) 0k = inf{ r > 0* : (P(r)S, r) E A}),
In (2.5-a) we adopt the convention that, if the process never enters A before T, then
0* = oo. In Lemma (2.8) we will show that Mu is u.s.c., which, together with the
closure of K, implies that A is closed. The right continuity of P yields that
(P(k ), Sk*1,I ) e A and hence (P(Ok*),Sk_1) K, so that S is well defined. It
may happen that (P(k ), Sk , Ok) is again in A, in which case k+ 1 = 09*.Despite this,
the times ,"*are stopping times of (X,), since q4is a measurable function. In fact, one
can do an easy induction argument to prove that for all k,
The remainder of this section checks that v* = {(*, ,S)} is admissible. The
construction of v* and the remarks above imply that (1.6-a, b, d, e, f) hold. It remains
to prove the two semicontinuity lemmas on which the remarks were based, and the
divergence of the intervention times (1.6-c).
where (pk, Sk) (p,s) in K. (Note that when the latter is true, U lK(pk, sk) is
bounded.) Given any subsequence (ki) and s' e K(pk,,Sk,) converging to some s' as
OPTIMAL IMPULSE CONTROL OF PORTFOLIOS 593
i -o oo, it suffices to show that s' E K(p, s). Since B is closed, it is clear that s' E B.
Moreover, since C(pk,,ski, s) > 0 for all i, C is continuous, and (Pk, Ski,s ) - (p, s, s'),
E
we have that C(p, s, s') > 0. Hence s' K(p, s), which completes the proof.
(2.9) K(pk,s) C
N(limsupK(pj, sj); 8) c N(K(p,s); 8)
j-- oo
where N(B; 8) denotes the closed 8-neighborhood of a set B. Then any s$ E K(pk, sk)
has a counterpart s' E K(p, s) within a distance of 8, as long as k > N. Denote
for large enough k and all s$ E K(pk, sk). Let E - 0, and conclude that
limsupk,o Mu(pk,sk, rk) < Mu(p, s, r), from which the lemma follows.
(REMARK.Note that the proof does not depend on the specific nature of P, only
these two properties. Our P has the jump times of a Poisson process, and hence is q.l.c.,
and it is also a.s. finite in the case of linearly bounded coefficients.)
D = lim
0m < T},
k- -oo
and also drop the superscript "*" from the impulse control notation. The finiteness of
P and that of Sk,0 (shown below) guarantee that except for outcomes w in a set of
measure zero, Wk < oo. We assert first that
C
(2.13) D c lim sup Wk(Ok+l) - Wk(k) > almost surely.
k-o oo
594 JEROMEF. EASTHAM& KEVIN J. HASTINGS
By contradiction,suppose that w E D and that there is I = I(o) such that for all
k > I, Wk(Gk+l) - Wk(Ok) < c/2. Now in order for an intervention to occur at a time
0i, admissibility requires that Wi(0i) - Wi_l(0i) < -c. If j > I, then
Pnp(4l0) Pi(Ok)
(2.14) Sk,o<So + Si S
P(1)s
P(01)
O pi(
PO(Ok) k
k
k-'pi(0+) i,(Oi) c
+ E
Po(Oj+x)- Po() S.i - Ec
j=l , .JP0(0) j=- Po(i)
M 'k-1
J np( -
kc
< so + si + E IP,(j +) - P,(0j)[-
nPo(T)
S
c limsup [Po(0k+) -
)] + E PO Si
k ^ . i=l Po
Mn 'k-1 kc
+ [PO(k+l) - - -
PO(O
k) Po i=l r,j=l Pi(j Pi(0 kl nPO(T)
n
+ E [Pi(0k+l) - P,(Ok)] 'Sk,i > n D
i=1 +?[^^i)- (M -^ ic}
OPTIMAL IMPULSE CONTROL OF PORTFOLIOS 595
cn
2(2n+ 1) } D
U U limsup ([PO(k+1)
[P PO(k)]
i=l1M k-1
i(+) -p
PPo
j=1
kc c
> 2(2n + 1) n D
nPO(T)
n
U Ulimsup1 [P(k+) - + .
P,()]S, >i 2(2n 1) n
c limsup Pi(Ok+i) - n D.
Pi(Ok) > 2(2n + 1) M
Poc Poc nD
> MnP (T) +kM 2(2n + 1)(Po(T) - po)
C -
{(Pi(Oj+l) Pi(j)l 0 as j -, o) n D.
596 JEROMEF. EASTHAM& KEVIN J. HASTINGS
Fix p, s, and r; since the time r does not changethroughoutthe followingwe suppress
it in the notation. Denote 4) = 4(p,s). The strict inequality0k*+ > 90 for all k will
follow if we can show that thereis 8 = 8(p,s, r) > 0 such that for all s' E K(p, 4),
In light of (2.19), to show (2.17) it sufficesto show that, for all s' e K(p, 4),
But (2.20) follows from (2.18) and (2.16-b), so that we have proved that under
assumption(2.16), 0k*+l > 06, for everyk.
3. Optimalityof the Q.V.I.control. As in the previoussection,let u be a smooth
solutionof the quasi-variational inequalities.We have seen that undersome reasonable
model conditions, u gives rise to an admissibleimpulsecontrol v*. It is clearfrom the
descriptionof P that undersome otherreasonableconditionson the growthof u, and
on the coefficientsju,a, and y, we can use Ito's Lemmato conclude:
E f Lu(P(r),Sk1_, r) dr,
k- 1
OPTIMAL IMPULSE CONTROL OF PORTFOLIOS 597
PROOF. The notation will be complicated by the fact that successive intervention
times could be equal. Given w, let m0 = 0, ml = inf{k: Ok(W) > t},..., m,
inf{ k: Ok(O) > 0,,,l(w)). Again denote O"= Ok A T. It can easily be checkedthat
?I . ui)S)-
u( -
o )
M,
+1 --1f
(P(o,), Smi , - u(P(i-1) , Smi i' 8i)
=
U(P(om,
+Ii< ),Smikmi-ti-
T) Om, F) - u(P(Om,i %Smi
[U(P(k),SkIOm) ) - U(P(O)mi,Sk-Om)I
1,O, 1)
Mi1-
+
I(Gi<T} E - U(PCk k, Ok) UP(0Sk1OJ
k=mi
by (2.3-a). The rest of the proof is now standard, and will therefore be done rapidly.
Sum inequality(3.3) from i = 1 to j, take expectation,use Ito's Lemmaand (2.3-b)on
the first difference, and we have
-
Ep, S, I uP(o), S(Oj),
O u(p, S,t)I
M+1-1
d Ep, s,IEU(Ck,Ok)IJOk<T)
k=1
= J(p,s, t, V)
by the boundary condition (2.3-d). For the Q.V.I. control v", the inequalities are
replaced by equalities, which proves the theorem.
598 JEROMEF. EASTHAM& KEVIN J. HASTINGS
Equation (3.1) follows from Ito's lemma if several expectations are finite:
0
Ok'
k,
(E3) E /
6k-1 W
flu(P(r) + Y,Sk_, r)- u(P(r),Skl, r)
k'
One way to guarantee this is to put growth conditions on u and its partials, linear
growth on the coefficients p, o, and y, and finiteness up to a certain order of the
moment functions mk(r) = E[IP(r)lk]. Recalling inequality (2.4) for Sk,0, and the fact
that the available shares in risky assets are bounded by M, it can be shown that the
following conditions together are sufficient for the finiteness of the expectations
(E1)-(E4):
u and its partials ur u,, uj satisfy
Of course in the absence of any jump structure on P, (E3)-(E4) do not arise, and (3.6)
is not needed. In this case the well-known bounds (e.g. Gikhman and Skorohod [7,
Lemma 3.8]) on the moments of a diffusion process satisfying (3.5) show that condition
(3.7) is automatic, and (3.4) and (3.5) alone are sufficient.
investmentwhich appreciatedeterministically:
where Z is a one-dimensional standard Weiner process, and ,LO, ,U, a are positive
constants. Thus the partial differential operator associated with the optimal value
function is:
(4.3) Lu u + ioPoUp, +
Plpoup + 2pUplpl.
For simplicity we take the laissez-faire domain G to be R3+x [0, B1] and choose the
risk-neutral utility function U(x) - x. If the fixed cost is c > 0 and the variable costs
are proportional to the value of risky assets traded, then the feasible set will be
where 0 < c < 1. For fixed (p, s), K is a closed, bounded subset of R x [0, B1]. Hence
the sup-operator M is well defined.
Previously we showed that the optimal expected utility of consumption is character-
ized as a sufficiently regular solution of the time-dependent implicit obstacle problem:
Lu < 0,
(Lu)(u - Mu) = 0
The usual technique for solving implicit obstacle problems such as (4.5)-(4.6)
involves an 'iteration upon the obstacle' (8). Beginning with the choice of suitable u0,
one defines ui, i = 1 2, 3,..., as the solutions of the problems:
Lui < 0,
(4.7) Ui> 1,
(Lu,)(ui - Mu,_i) = 0
(4.8) Luo =0
satisfying the terminal condition (4.6). This u0 can be interpreted as the expected
utility of consumption if no transactions are permitted before t = T. By induction one
can interpret ui as the optimal expected utility of consumption if at most i transac-
600 JEROMEF. EASTHAM& KEVIN J. HASTINGS
If t,1 = Po, then uo > Muo everywhere, thus u = uo. There is no benefit to transac-
tions prior to t = T, hence in that case the policy of noninterventionis optimal.
Now considerthe case , < g.o.It is helpfulto considerthe function:
Lu< 0,
u > Mui,
This function can be interpretedas the value of a policy for a slightly different
problem.If the fixed cost c is paid only at time T, then by selling all risky assets at
time t < T and investingthe proceedsin risk-freeassets, one has an expected total
consumption(4.11).
Now u > u since the firstpartof the proof of (3.2) appliesto uias well as to u. Then
ui(p,s, t) > J(p, s, t, v) for any admissiblev. In particular,
(4.12) u >~u,,
calculations(illustratedshortly)show that
Straightforward
(4.14) Mui = Mu
+l
To= T - ( -
Ao) -ln( 1- C,
c
TIMAL IMPULSE CONTROL OF PORTFOLIOS 601
SI
B
8
A
0 sO
- c
[poSo - (1 + cl)pl(Bl - sl)]e?(T-) + (1 - cl)plBlel(
D = (, s, + (1 + c p)-p
-(poSO- c)),
E= (0, s),
0= (0,0).
Since to > O, it can be shown that s' = A gives a larger value than s' = 0 and that
s'= B is better than s' - E. Hence we can reduceto consideringA, B, and D. These
are such that C(p,s,s') = 0, so we actuallyare to comparethe values of uo at these
602 JEROMEF. EASTHAM& KEVIN J. HASTINGS
points:
UOls-A - UOs-B =
=(eo(- t) _ el(T-t))pl(1 - Cl)s,
eeo(T1-' ) - -
UOls-B
-
Uols-D ( i )e
-e
t(p oSo c).
(Low)(w - Mow) = 0,
(4.16) Wlt-T = 0.
Some work and the relation Mul = Muo are needed to obtain the following
simplifiedexpressions:
Low - w + 'LplWp
t +4- 2ap ,pl
(4.17)
_
Mow (eMo(T-t)_ eli(T-t))(l C)P1S1 -Ce-o(T-t)
After taking the difference Mow = M[w + Uo] - Uo, the variable s0 no longer appears
in the problem,whichimpliesthat the optimalstrategywhen ,ul < to does not require
knowledge of s0. The form of the differentialoperator Lo is unchangedby the
substitution:
(4.18) x = (1 - cl)p1s.
(Low)(w - Mow) = 0,
(4.20)
w(x, T) = 0,
where all direct referencesto p, s are eliminated.Note that x is the effectivewealthin
risky assets.
OPTIMAL IMPULSE CONTROL OF PORTFOLIOS 603
at time t = T - jk. For simplicity we take k = T and consider only j = 1, i.e. a single
'backward' time-step from w() = 0 at time t = T to w(l) at time t = 0. Then provided
that ,1T< 1, the solutionis:
r if 0 < x < x ,
w)(x) x)
= X 423/ i x
(4.23) ( ) \otherwise,
where r is the positive root of the initial equationfor AT, and y is determinedby
requiringx * to be a point of osculationbetween w() and (I(). That is:
(4.24)
X* [ LOT
x(r - er '
e eiOT_
OSC.
A- ptosc.
pts.
t_-.I t .1 ";U ;2 C.
,} U - ,4 -
E I
FIGURE2. One-Step Approximations w(1) for o2 = 0.01, 0.03, 0.05 (resp. x.* 55.15, 58.06, 60.17).
604 JEROMEF. EASTHAM& KEVIN J. HASTINGS
/
/
OSC.
pts.
In Figure2 we show the effectof the parametera2 on the solution w(). Increasing
the value of a2 increasesboth the value of w(1) and the position of the contact point
x,. In the case illustrated, c = 1, io0= 0.05, /L = 0.03, and a2 = 0.01, 0.03, 0.05.
Recall that an optimal control is to interveneonly when the portfolio holdings
belong to the action set A. By (4.18) all dependenceof the policy on cl, pl, s1 has
been subsumedby x. Now x*, the point of contact between w?() and (P), is the
semidiscretizedapproximationto the boundaryof the action set, dA. The approxima-
tion of x * to dA may be worsethan the approximationof w ) to wIt,.-k. Figure3
shows both a one-step and a two-step approximationto w at time t = 0, with
AO= 0.05, ~t = 0.03, a2 = 0.01, c = 1.
One can treatthe case of discontinuouspriceprocessesby the same techniquesif the
jump sizes have a specialform. In this case a modifiedtrend pt4is obtained,and the
analysisagainproceedsaccordingto whethert0 <.< [ or t0o> p. Details arepresented
in a forthcomingpaper.
References
[1] Bensoussan, A. and Lions, J. L. (1982). Applications of VariationalInequalities in Stochastic Control.
North Holland, Amsterdam.
[2] and . (1984). Impulse Control and Quasi-Variational Inequalities. Gauthier-Villars,
Paris.
[3] and . (1975). Nouvelles methodes en contrble impulsionnel. Appl. Math. Optim. 1, 4
289-312.
[4] and Tapiero, C. S. (1982). Impulse Control in Management. J. Optim. Theory Appl. 37, 4
419-442.
[5] Blumenthal, R. M. and Getoor, R. K. (1968). Markov Processes and Potential Theory.Academic Press,
N.Y.
[6] Doshi, B. T. Optimal Control of a Diffusion Process with Reflecting Boundaries and Both Continuous
and Lump Costs. Proc. Internat. Conf. Dynamic Programming, Academic Press, Vancouver (to
appear).
[7] Gikhman, I. I. and Skorohod, A. V. (1979). ControlledStochastic Processes. Springer-Verlag,N.Y.
[8] Glowinski, R., Lions, J. L. and Tr6molihres,A. (1981). Numerical Analysis of VariationalInequalities.
North Holland, New York.
[9] Kushner, H. (1977). Approximations and Computational Methods for Optimal Stopping and Stochas-
tic Impulsive Control Problems. Appl. Math. Optim, 3, 2/3 81-99,
OPTIMALIMPULSECONTROLOF PORTFOLIOS 605