• No results found

Polyhedra and optimization in connection with a weak majorization ordering

N/A
N/A
Protected

Academic year: 2022

Share "Polyhedra and optimization in connection with a weak majorization ordering"

Copied!
27
0
0

Laster.... (Se fulltekst nå)

Fulltekst

(1)

UNIVERSITY OF OSLO Department of informatics

Polyhedra and Optimization in Connection with a Weak Majorization Ordering

Geir Dahl

Preprint 10

December 12, 1994

(2)
(3)

Polyhedra and Optimization in Connection with a Weak Majorization Ordering

Geir Dahl

December 12, 1994

Abstract

We introduce the concept of weakk-majorization extending the clas- sical notion of weak sub-majorization. For integers kandn withkn a vectorx2Rn is weakly k-majorized by a vectorq 2Rk if the sum of ther largest components of xdoes not exceed the sum of ther largest components ofq, forr= 1;:::;k. For a givenqthe set of vectors weakly k-majorized byq denes a polyhedronP(q;k), and we determine all its vertices. We also determine the vertices and a complete and nonredun- dant linear description of the integer hull ofP(q;k). The results are used to give simple and ecient (polynomial time) algorithms for associated linear and integer linear programming problems.

Keywords:Majorization; polyhedral combinatorics.

1 Introduction

The concept of majorization has been investigated in a number ofbranches of mathematics and statistics. The notion reects to what extent compo- nents of vectors are spread out. Forp;q2Rnone says thatpis weakly sub-majorizedbyqifPrj=1p[j]Pr

j=1q[j]forr= 1;:::;n. Herep[j]de- notes thej'th largest component ofp. If alsoPnj=1pj=Pnj=1qj holds, p is majorized by q and we write p q. Several equivalent conditions for (weak sub-) majorization are known (see [8]); for instance, pq i there is a doubly stochastic matrix M 2 Rn;n (i.e. M has nonnegative elements and all row and column sums are 1) with p=Mq. This result can be combined with a theorem of Birkho and von Neumann to give the following geometric interpretation of majorization: pq if and only ifp lies in the convex hull of the set of vectors obtained by permuting the components ofq. The Birkho-von Neumann theorem says that the set of doubly stochastic matrices in Rn;n is the convex hull of the set of

Institute of Informatics, University of Oslo, P.O.Box 1080, Blindern, 0316 Oslo, Norway (Email:[email protected])

(4)

all permutation matrices, i.e. nn matrices where each row and each column consists of all zeros except for one entry which is 1. A similar characterization holds for weak submajorization, see [8].

An extensive treatment of the theory of majorization as well as its applications is given in the book by Marshall and Olkin [8]. An example of majorization in combinatorial analysis is the theoremofGale and Ryser.

It characterizes those vectorsrandcfor which there is a 0/1 matrix with row sumsr(i.e.,i'th row sum isri) and column sumsc; the condition is a simple majorization condition in terms ofcandr. A relation between edge coloring and majorization is discussed in [4]. Majorization also arises in e.g. matrix theory (Hadamard type inequalities, singular values etc.), numerical analysis (condition numbers, matrix approximation etc.) as well as in probability theory and statistics (reliability, stochastic ordering etc.), see [8] .

For generalizations of majorization within a measure theoretical frame- work as well as statistical interpretations, see the extensive treatment in [11]. In [2] approximate majorization is studied.

A central concept in the theory of majorization is that of a Schur- convex function. A function:Rn!Rthat preserves the ordering given by majorization is called Schur-convex, thus(x)(q) wheneverxq. Thereforeqmaximizes(x) over the setxq, so maximization of Schur- convex functions is easy. The following general method has produced interesting inequalities in the areas mentioned above, see [8]: nd some (simple) underlying majorization which combined with a suitable Schur- convex function leads to the inequality. A simple example, which is useful in this work, is the rearrangement inequality due to Hardy, Littlewood and Polya, see [7], [8]. Leta1;:::;an and b1;:::;bn be real numbers. Then

we have: n

X

i=1a[i]b[n,i+1]Xn

i=1aibiXn

i=1a[i]b[i]: (1) In this paper we study a majorization concept derived from weak sub- majorization by requiring that the ordering of the rlargest components of vectors only hold for r k wherek is some number not exceeding n. This concept, called weakk-majorization, is therefore a weaker notion than weak sub-majorization and it measures the concentration of sums of subvectors up to a given size. Unlike for Schur-convex functions, opti- mization with a weakk-majorization constraint is not a trivial task, and a main purpose of this paper is to study such optimization problems.

This paper is organized as follows. In section 2 we introduce weakk- majorization and describe some ofits basicproperties. Weintroduce linear programming (LP) problems with a weak k-majorization constraint and consider the special casek=nin some detail. In section 3 we study, for k < n,k-majorization polyhedra being the feasible set of these LP prob- lems, and determine all the vertices of these. This also leads to a simple (polynomial) algorithm for solving such LP problems. The remaining part of the paper treats integer linear programming (ILP) problems over the mentioned polyhedra, and vertices as well as complete linear descriptions

(5)

are given for the integer hull ofk-majorization polyhedra. Also a linear time algorithm for solving integer linear programming problems with a weakk-majorization constraint is given.

Notation.

R,ZandQdenote the set of real, integral and rational numbers, respectively, and Rm;n is the set of realmn matrices. For q 2 Rk we dene the tail average qs:k := (1=(k,s+ 1))Pkj=sqj. For each positive integer t we let Nt := f1;:::;tg, and for x 2 Rn, x[j] is the j'th largest component ofx. When S Nn we let jSjdenote the cardinality ofS, and ifx2Rnwe denex(S) :=Pj2Sxj. For concepts and results concerning polyhedra and linear inequalities, see [10]. The characteristic cone of a polyhedron P is denoted by char.cone(P). A pointed polyhedron is integral if all its vertices are integral. The integer hullPI of a (rational) polyhedronP is the polyhedron being the convex hull of all the integral points inP. A setARnis symmetric ifx2A whenever x 2 A and is a permutation onNn (i.e. a bijection) and where we denex= (x(1);:::;x(n)). For integersr;s withrs we letCr;s2Rs;sdenote the 0/1-circulant matrix with elements 0 and 1 and where the ones in rowiare in the positionsj i;i+ 1;:::;i+ (r,1) mod(s) (sorconsecutive ones are shifted one position to the right for each row). We letei2Rn be thei'th unit (coordinate) vector inRn, i.e., the i'th component ofeiis 1 and all other components are 0.

2 Weak

k

-majorization and optimization

We study basic properties of weakk-majorization. Associated optimiza- tion problems and polyhedra are also introduced, and the special case k=nis treated in some detail.

Let, throughout this section,k andnbe two given integers such that kn, and letq2Rk be a given vector called the majorant. We extend the concept of weak sub-majorization intok-majorizationas follows. We say that x 2 Rn is weakly k-majorized by q and write p k q if the following conditions hold:

Pr

j=1x[j]Pr

j=1q[j] for allr2Nk: (2) The order of the components in the vectors xandqis irrelevant in this denition, so ifxkq, then alsox0kq0wherex0andq0are permutations ofxandq, respectively. We see thatx1q i maxjxj maxjqj, and that x n q means that x is weakly sub-majorized by q. We also see that, for general k,xk q ix[k]is weakly sub-majorized byq[k]where x[k]= (x[1];:::;x[k]) andq[k]= (q[1];:::;q[k]). Thus weakk-majorization corresponds to weak sub-majorization on the vectors consisting of the k largest components. Therefore one can formulate equivalent conditions forxk q in terms ofx[k]andq[k]. For instance,xk q if and only if there is a doubly stochastic matrixM2Rk;k such thatx[k]=q[k]M.

For z 2 Rn we dene the real-valued function Lz on the interval [0;1] by (i) Lz(r=k) = Prj=1z[j] for r = 0;:::;k, and (ii) Lz is linear

(6)

on each subinterval [r=k;(r+ 1)=k], for r = 0;:::;k,1. This function is piecewise linear, continuous and concave, and it satises Lz(0) = 0, Lz(1) = Pkj=1z[j]. We call Lz the L-function associated with z. The dependency onk is suppressed in the notationLz ask is clear from the context. We mention that in [8] Lorentz-curves are discussed, they are similar to ourL-curves dened below but with opposite ordering of the components. TheL-curveassociated withzis the graph ofLz. Majoriza- tion concepts may be described in terms ofL-functions as follows:xkq if and only ifLx Lq (with componentwise ordering), i.e. theL-curve associated withxlies below theL-curve associated withq.

Our main interest is to study optimization problems with a majoriza- tion constraint and with a linear (and nonsymmetric) objective function.

The objective function in our dierent problems will becTx=Pnj=1cjxj

which is to be maximized over all vectorsx2Rn(orx2Zn) that satisfy ak-majorization constraint. Thus, for instance, we shall study the set of points inRnthat are weaklyk-majorized by some vectorq.

We assume hereafter thatq satisesq1:::qk 0. Letc 2Rn be a nonnegative objective function. Consider the following optimization problem:

maxfcTxjxkqg: (3)

A possible interpretation of this problem isas follows. Considernprojects, numbered from 1 ton, and assume thatcjis an expected value or prot associated with projectj. Let the variablexjrepresent the investment in project j. To keep down the overall risk of investing in failure projects , one may require that the investments are suitably spread out. This requirement may be implemented by saying that the investment vector x= (x1;:::;xn) is weaklyk-majorized by a given vectorq. This prevents investing too much in a single project, too much in any pair of projects etc. The risk aversion may be reected in the choice ofk andq.

Let P(q;k) denote the set of feasible solutions in the optimization problem (3), soP(q;k) =fx2 Rn jxk qg. Note that xk qif and only ifx(S)q(Nr) for each subsetSofNnwithjSj=rkbecause the maximum value ofx(S) taken over all such subsets isPrj=1x[j](confer the rearrangement inequality). It follows that the setP(q;k) is a polyhedron, in fact

P(q;k) =fx2Rnjx(S)q(Nr) for allS Nnwithr=jSjkg: (4) The polyhedronP(q;k), called ak-majorization polyhedron, is unbounded, its characteristic cone is,Rnand it is pointed, i.e. its minimal faces are vertices. Furthermore,P(q;k) is symmetric.

Eachn-majorization polyhedron may be viewed as a polymatroid (see e.g. [5], [6]). Dene the set functionf(S) =Prj=1qj for eachS Nn

wherer :=jSj. Then it follows from the assumption q1:::qk 0 that: (i) f is monotone, i.e., f(S) f(T) for S T Nn, (ii) f is submodular, i.e.,f(S[T) +f(S\T)f(S) +f(T) for allS;T Nn, and (iii) f is nonnegative. From (4) we see that P(q;n) = fx 2 Rn j

(7)

x(S)f(S) for allSNng, so this is the polymatroid associated with the submodular function f. Edmonds (see [3]) showed that maximizing a nonnegative linear objective function over a polymatroid can be done using the greedy algorithm. Thus, in the case k = n, we can solve the problem (3) by the following greedy algorithm (assume for simplicity that c1 ::: cn 0): x1 = f(f1g) = q1, x2 = f(f1;2g),f(f1g) = (q1+q2),q1=q2,:::,xn=f(Nn),f(Nn,1) =qn. The optimality of this solution could also be seen as a direct consequence of the rearrangement inequality (1). For k < n, however,P(q;k) may not be a polymatroid and therefore the greedy solution which isxj=qj forjk andxj=qk

for j > kmay not be optimal in (3). As a simple example, let n = 3, k = 2, q = (2;1) andc = (1;1;1). The greedy algorithm produces the nonoptimal solution (2;1;1) while the optimal solution is (3=2;3=2;3=2).

In the next section we shall explain how to repair this by giving an algorithm that solves (3) for generalk. This development will be done by studying properties of thek-majorization polyhedra.

It turns out thatn-majorization polyhedra have a simple, but interest- ing combinatorial structure as briey discussed next. Consider the face M(q;n) ofP(q;n) induced by the valid inequality x(Nn) q(Nn), i.e.

M(q;n) =fx2P(q;n)jx(Nn) =q(Nn)g. ThenM(q;n) consists of all vectors xthat are majorized by q and it is the base polyhedron of the polymatroid P(q;n). M(q;k) is bounded polyhedron, i.e. a polytope, and its extreme points are all the n! permutations ofq. Assuming that q1> ::: > qn >0, it can be shown that dim(M(q;k)) =n,1 and that the linear system x(Nn) = Pnj=1qj, x(S) Prj=1qj for all S Nn, r:=jSjgives a complete and nonredundant linear description ofM(q;n).

Two vertices are adjacent on M(q;n) (i.e., there is a 1-dimensional face containing both vertices) if and only if the permutations (ofq) dening the two vertices dier by an adjacent transposition. Finally, M(q;n) is simple, meaning that each vertex is contained in (the minimum number of) n facets. An interesting special case is when q = (n;n,1;:::;1).

ThenM(q;n) is called the permutahedron, see [1], [12]. All the properties described above forM(q;n) are well known to hold for the permutahe- dron. In fact, one can show that (forq1> ::: > qn>0)M(q;n) and the permutahedron are combinatorially equivalent.

The remaining part of this paper is devoted to the study of P(q;k) wheneverk < n.

3 Vertices of

k

-majorization polyhedra

In this section we nd all the vertices of the k-majorization polyhedron P(q;k). Throughout this section we assume that k < n and also (to avoid degenerate situations) that the given majorant q 2 Rk satises q1> ::: > qk>0. We deneQ=Pkj=1qj.

Dene, fors= 0;:::;k,1, the vectorws, called thes'thq-averageby ws:= (q1;:::;qs;qs+1:k;:::;qs+1:k) (5)

(8)

Note the special cases w0 = ( q1:k;:::;q1:k) and wk,1 = (q1;q2;:::;qk; :::;qk) and thatPkj=1wsj = Qas we have Psj=1qj+ (k,s) qs+1:k =

Pk

j=1qj =Q. We also observe that ws k q. In fact, theL-functions Lws andLq (associated withwsandq, respectively) coincide on [0;s=k] andLws(1) =Lq(1). SinceLws is linear on [s=k;1] andLq is concave, it follows thatLws Lq and thereforews k q. Note that, among the vectors that are weaklyk-majorized byq, theq-averagewsis maximally spread out in its rstkcomponents and minimally spread out (constant) in its remaining components.

The main result of this section says that the vertices ofP(q;k) are the permutations of theq-averages. In order to prove this result we shall need a lemma on tail averages which will be of use later as well.

Lemma 1

Letc= (c1;:::;ck)2Rk satisfyc1:::ck,1(note thatck

may be arbitrary). Then there is anm2f1;:::;kgsuch that

c1:k:::cm:kcm+1:k:::ck:k=ck: (6)

Proof.

Fort= 1;:::;k,1 the following relation holds between consecu- tive tail averages: ct:k=ct=(k,t+1)+ ct+1:k(k,t)=(k,t+1). Thus ct:k

is a convex combination ofctand ct+1:k which leads to, fort= 1;:::;k, (i) ctct+1:k ) ctct:kct+1:k;

(ii) ctct+1:k ) ctct:kct+1:k: (7) Let t= k,1. While ct ct+1:k andt 1 sett := t,1 and repeat this procedure. Assume that we here terminate with t=t0. If t0 = 1 , (7)(i) gives c1:k :::ck:k, so (6) holds withm= 1. Ift0>1, we have (again by (7)(i))ct0 >ct0+1:k:::ck:k, and then (7)(ii)) implies that ct0:kct0+1:k. Sincecis nonincreasing, we can repeatedly use (7)(ii) and thereby obtain (6) withm=t0+ 1.

Theorem 2

The vertex set ofP(q;k) is the setW(q;k) of vectors that can be obtained as permutations of somews fors2f0;:::;k,1g.

Proof.

We rst show that each vertex ofP(q;k) lies inW(q;k). by prov- ing that each bounded LP problem overP(q;k) has an optimal solution lying inW(q;k). Letd2Rn be an objective function and consider the LP problem maxfdTxjx2P(q;k)g. If somedj <0, then the problem is unbounded asej2P(q;k) for each0. We may therefore assume thatd1:::dn0. From the rearrangement inequality (1) and the fact that only thek largest components ofxdetermine the feasibility of a point, we may assume that x1 :::xk = :::=xn. Thus our LP

(9)

problem reduces another LP problem max Pkj=1cjxj

subject to

(i) x(Nr)q(Nr) forr= 1;:::;k; (ii) xj+1,xj0 forj= 1;:::;k,1;

(8)

where we dene cj =dj forj = 1;:::;k,1 and ck = Pnj=kdj. Note that we here have c1 ::: ck,1 whileck can have any value (not exceeding (n,k+1)dk). The LP dual of this problem is

min Pki=1yiPi j=1qj

subject to

(i) Pki=ryi+zr,1,zr=cr forr= 1;:::;k; (ii) y1;:::;yk;z1;:::;zk,10 ( andz0=zk= 0):

(9) The variablesz0andzkare used for notational simplicity only; these are not variables in the dual problem. We now apply Lemma 1 to the objective functioncand letmbe such that the inequalities (6) hold. Lets=m,1, and observe thats2f0;:::;k,1g. Let x2Rkand (y;z)2RkRk,1 be given by

(i) xj=

qj forj= 1;:::;s, qs+1:k forj=s+ 1;:::;k; (ii) yj=

8

>

<

>

:

cj,cj+1 forj= 1;:::;s,1, cs,cs+1:k forj=s,

0 forj=s+1;:::;k,1, cs+1:k forj=k;

(iii) zj=

0 forj= 1;:::;s,

(k,j)( cj+1:k,cs+1:k) forj=s+1;:::;k,1.

The solution xcan be viewed as the projection ofws into the space of the rstkvariables, which implies that xis feasible in (8). We next show that (y;z) is feasible in the dual problem (9). Clearly, yj 0 forj < s since cj cj+1. Also, ys =cs,cs+1:k 0 according to our choice of s; see Lemma 1 and (7). The remaining components of y are trivially nonnegative, and therefore y0. It follows from Lemma 1 that z0.

It remains to verify that (y;z) satises constraint (9)(i). Ifrs+ 2, we havePki=ryi+ zr,1,zr= cs+1:k+ (k,r+ 1)( cr:k,cs+1:k),(k, r)( cr+1:k,cs+1:k) = (k,r+1) cr:k,(k,r) cr+1:k=cr. Next, ifr=s+1, we obtainPki=s+1yi+ zs,zs+1= cs+1:k,(k,s,1)( cs+2:k,cs+1:k) = (k,s) cs+1:k,(k,s,1) cs+2:k = cs+1. Finally, when r s we have

Pk

i=ryi+ zr,1,zr = Psi=,1r(ci,ci,1) +cs,cs+1:k+ cs+1:k = (cr,

cr+1)+(cr+1,cr+2)+:::+(cs,1,cs)+cs=cr. This proves that (y;z) is dual feasible.

(10)

The value of the (primal) objective function of (8) in the point xis

Pk

j=1cjxj=Psi=1ciqi+(1=(k,s))Pki=s+1qiPk

j=s+1cj=Psi=1ciqi+ cs+1:kPk

j=s+1qj. The value of the (dual) objective function of (9) in the point (y;z) isPki=1yiPi

j=1qj = Psi=1,1(ci,ci+1)Pij=1qj+ (cs,

cs+1:k)Psj=1qj+ cs+1:kPk

j=1qj=Psj=1cjqj+ cs+1:kPk

j=s+1qj. We see that these two values coincide. Therefore, by linear programming duality, it follows that xis optimal in the primal problem (8) and (y;z) is optimal in (9). It follows from the initial problem reduction thatwsis an optimal solution of maxfdTx j x 2 P(q;k)g, which proves that each vertex of P(q;k) lies inW(q;k).

To complete the proof we must show that each point in W(q;k) is indeed a vertex ofP(q;k). To do this, assume that somewsis a convex combination with strictly positive weights of other points zm 2W(q;k), m = 1;:::;N, i.e. ws = PNm=1mzm with m > 0 and Pmm = 1.

Since all these points satisfyx1q1andws1=q1, we must havez1r=q1 for all r. Using a similar argument we can prove thatzrj =qj forj s. All the points zm lie in P(q;k) so they satisfy the valid inequality x(Nk) Q, andws satises this inequality with equality. This implies thatzr(Nk) =Qand therefore by the concavity of theL-functionsLzr

Lws. If one of these inequalities were strict, this would contradict that ws = PNm=1mzm. Thus, Lzr = Lws for all m, from which we get zm=ws form= 1;:::;N. This proves thatws is an extreme point of P(q;k), and it is therefore also a vertex of this polyhedron. Similarly, we can prove that each permutation ofwsis a vertex ofP(q;k) and the proof is complete.

An interesting consequence of Theorem 2 is a simple algorithm for solving max fcTx jx k qg. Initially the components ofc are ordered decreasingly; this can be done in timeO(n2). Assume for simplicity that c is nonincreasing. Next we determine an index m as in Lemma 1 and lets=m,1. Thenws is an optimal solution of the problem! Here the calculation of m is done in linear time by recursive calculations of tail averages. Thus the main work in the algorithm is simply the ordering of the objective function.

We remark that Theorem2 can also be proved by a more direct method without using LP duality. In fact, one can show that ifx is an optimal solution of (8) satisfyingx(Ns) =q(Ns) for somes2f0;:::;k,1g, then we can nd an optimal solution xsatisfying xj=qjforrs. By choos- ings maximal with this property, one can nd another optimal solution x satisfying x(Nk) = Pkj=1qj and with all the remaining components xs+1;:::;xnequal; this solution xis precisely theq-averagews.

From Theorem 2 we also obtain an inner description of weakk-major- ization polyhedra by using the fact that the characteristic cone ofP(q;k) is,Rn.

Corollary 3

P(q;k) = conv(W(q;k)),Rn, i.e., xk q if and only if x z for some z 2 Rn which is a convex combination of permuted q-

(11)

averages.

In Fig.1 we illstrate the intersection betweenP(q;k) and the nonneg- ative orthant fork = 2, n = 3 and q = (2;1). The dierent permuted q-averages are shown.

(2,1,1)

(1,2,1) (3/2,3/2,3/2) (1,1,2)

Figure1: Example,P(q ;k )\Rn.

4 Weak

k

-majorization and integrality

In this section we study integer linear programming problems with a weak k-majorization constraint. These problems may be motivated by applica- tions concerning e.g. the distribution of indivisible units to locations or projects where it may be natural to impose a majorization constraint to assure a certain level of diversication. Our approach is to study the integer hull of the polyhedronP(q;k) and determine its vertices.

Hereafterk and n are integers satisfyingk < n and the majorant q satises q1 > ::: > qk > 0. We are interested in the following integer linear programming problem

maxfcTxjxkq;xis integralg: (10) Note that the feasible set is hereP(q;k)\Zn. If somecjis negative, then (10) is unbounded, so we hereafter assume thatc1:::cn. Note that the LP relaxation maxfcTxjx2P(q;k)gmay have (a unique) optimal solution that is fractional. For instance, consider again the examplen= 3, k = 2,q = (2;1) andcT = (1;1;1). The optimal solution in P(q;k) is (3=2;3=2;3=2), while the optimal integral solution is (2;1;1).

We observe that the feasible set of (10) is unchanged if we perform integer round-down on each component of the majorantq. Thus we shall hereafter assume thatqis integral (this also strengthen the LP relaxation).

(12)

The integer hullQ(q;k) ofP(q;k) is the convex hull of the feasible set in (10), i.e.,

Q(q;k) = conv(fx2Rnjxkq;xis integralg): (11) From general polyhedral theory (see [10]) we know thatQ(q;k)is a polyhe- dron, i.e., it is the solution set of a nite set of linear inequalities. Q(q;k) is pointed and its characteristic cone is,Rn. Furthermore it is easy to see thatQ(q;k) is symmetric. Note that the problem (10) may be viewed as maximizingcTxoverQ(q;k).

Some more notation is needed. Let mmax =bq1:kc =bQ=kc (recall that Q = Pkj=1qj). For each nonnegative m mmax we dene the following magnitudes

sm= maxfs < kjqs+1:kmg;

m=Pkj=sm+1qj,(k,sm,1)m;

qm= (q1;:::;qsm;m;m;:::;m)2Rn.

We call qm a rounded q-averagedue to its near resemblance to the q- averagesws(seesection 3). The rstsmcomponentsaremaximally spread out (w.r.t. q) while the remaining components are minimally spread out:

they are all equal except for the (sm+ 1)'th which may be viewed as an integrality correction. Observe that if m qk, then sm = k,1 andqm= (q1;:::;qk;m;:::;m). Finally, we note that ifm := qs+1:k is integral, thensm=sandqm=ws. Some basic properties ofqmare given next.

Lemma 4

Let mbe a nonnegative integer withmmmax. Thenqmis integral, nonincreasing and qmkq. Furthermoreqm(Nk) =Q.

Proof.

It is clear that qm is integral. To prove that the components of qm are nonincreasing we shall bound m. Let s = sm. Then we havem =Pkj=s+1qj,(k,s,1)m = (k,s)( qs+1:k,m) +mm because k,s 1 and qs+1:k m (due to the denition of s = sm).

Furthermorem =Pkj=s+1qj,(k,s,1)m Pkj=s+1qj,(k,s, 1) qs+2:k = Pkj=s+1qj,Pk

j=s+2qj = qs+1 < qs. Thusqs > m m andqmis nonincreasing. To prove the majorization, we rst observe that whenj s we haveqm(Nj) =q(Nj) and by the rst part of the proof qm(Ns+1) =q(Ns) +m < q(Ns) +qs+1 =q(Ns+1). This shows that LqmLqon [0;(s+1)k]. Next,qm(Nk) =q(Ns)+m+(k,s,1)m= q(Ns) +Pkj=s+1qj = q(Nk) =Q, so Lqm(k) = Lq(k). This combined with the concavity ofLq and the fact thatLqm is linear on [(s+ 1)=k;1]

implies that Lq Lqm on [(s+ 1)=k;1]. ThusLq Lqm on [0;1] and thereforeqmkq.

Note that qm0 qm form0 :=qk > m, so we havecTqm0 cTqm as the objective functionc is nonnegative. We shall therefore hereafter restrict the attention tom2M :=fqk;:::;mmaxg. Sinceq is strictly de- creasing, the tail averages are ordered qk:k<qk,1:k< ::: <q1:k. There- fore the closed intervals Is := [ qs+1:k;qs:k] for s = 1;:::;k,1 are all

(13)

nonempty and consecutive intervals intersect in one point, namely a tail average ofq. We say thatm2M isq-extremeifmis either the smallest or largest integer in one of the intervalsIs,s= 1;:::;k,1. Some prop- erties of theq-extreme integers are collected in the next lemma; they are all easy consequences of the denition ofsm. We letMex denote the set ofq-extreme integers.

Lemma 5

Letm2M. Then we have (i) m2Ism+1;

(ii) ifm2Mex, thenm equals eitherbqsm+1:kcordqsm+2:ke; (iii) m2Mexif and only ifm's is obtained from some tail

averageqs:k by integer rounding either up or down.

We are now in a position to give the rst main result of this section.

Proposition 6

Each vertex ofQ(q;k) is a permutation of some rounded q-averageqm withmbeingq-extreme.

Proof.

We consider the ILP problem (10) and may assume that c1 ::: cn 0 (otherwise a suitable permutation would be introduced below). We shall prove that some roundedq-average is an optimal solution of this problem. From the rearrangement inequality (1) it follows that there exists an optimal solution x satisfyingx1 ::: xn 0, so we may consider the ILP problem

maxfcTxjxkq;x1:::xn0;xis integralg: (12) Note that when xis nonincreasing,xk q holds if and only ifx(Nr) q(Nr) forr= 1;:::;k. Thus the variablesxk+1;:::;xndo not enter any of these constraints and we may assume thatxk=:::=xn(asc0). For eachm2M, we consider the problem obtained from (12) by adding the constraintsxk=:::=xn=m. We shall solve this problem directly using previous results, and rst we note that the problem may be considered as one inkvariables:

maxfdTxjxkq;x1:::xk=m; xis integralg: (13) where the objective function d is given by dj = cj for j k,1 and dk=Pnj=kcj.

We claim that there exists an optimal solution x of (13) satisfying x(Nk) =q(Nk). To prove this, consider an optimal solution withx(Nk) largest possible. Assume thatx(Nk)< q(Nk); we shall deduce a contra- diction. Observe thatx(Nk)q(Nk),1 asxis integral. Ifxs > xs+1= ::: =xk for somes < k, we let x be the solution obtained from x by increasing the (s+ 1)'th component by 1. Clearly xis decreasing. Fur- thermore, asq is strictly decreasing andxs+1 =:::= xk, the L-curves LxandLq can only intersect on the interval [s=k;1] in one point, namely forj=s=k. ThusLx is strictly belowLq in [(s+ 1)=k;1], and therefore Lx Lq in this interval. Also, in [0;s=k] Lx = Lx, so x k q which implies that x is also optimal. In addition x(Nk) = x(Nk) + 1, which contradicts the maximality ofx(Nk). It remains to discuss the case when

(14)

x1=:::=xk. Asq is strictly decreasing, we must havex(Nj)< q(Nj) for all j. Consequently, if we dene the new solution x from xby in- creasing the rst variable by 1, we obtain, as in the rst case discussed, a new optimal solution with x(Nk) =x(Nk) + 1 again contradicting the maximality ofx(Nk). This proves the claim.

Consider (as in the claim above) an optimal solution x2Rk of (13) with x(Nk) = q(Nk). The feasibility of x implies that Lx Lq and LxL0 on [0;1] whereL0() :=Q,km(1,) for 2[0;1]. Therefore L L00 := minfLq;L0g. It is easy to check thatL00 =Lqm, so we get xkPk(qm) wherePk(qm) is the vector (projection) containing the rst k components ofqm. (Remark: Lq is a function on [0;1], althoughqm2

Rn). From the discussion in section 2 on optimizing overP(q;n) using the greedy algorithm, it is clear that Pk(qm) maximizesdTz forz 2Rk satisfying z k Pk(qm). ThusdTPk(qm) dTx, and the optimality of xin problem (13) gives that Pk(qm) is also and optimal solution to that problem.

Finally, by taking the maximum optimal value of the problems (13) over allmwe solve the original problem (10). The optimal value of (13) is (we lets=sm)Pkj=1cjPk(qm)j=Psj=1cjqj+cs+1fPk

j=s+1qj,(k, s,1)mg+Pnj=s+2cjm=Psj=1cjqj+cs+1Pk

j=s+1qj+mfPnj=s+2cj,

(k,s,1)cs+1g = (s) +m(s). where (s) and(s) are dened in the obvious way. Observe thatand depends onmbut only through s = sm. Thus the optimal value for thosem with sm = s0 (for some s0) is achieved by choosingm either maximal (if 0) or minimal (if <0) withsm=s0, i.e. mis extreme. Based on the initial discussions in this proof, we can conclude that for someq-extremem2M the rounded q-averageqmis an optimal solution of (10) which proves that each vertex ofQ(q;k) must be a roundedq-average as desired.

The previous proposition shows the form of each vertex, but it remains to decide if all the solutionsqm withm extreme are vertices ofQ(q;k).

The answer turns out to be armative, and to prove this we shall rst give an optimization result which is useful later as well. We study the problem maxfcTx jx 2 Q(q;k)g for a special stair-case objective function. For each real number 1 and each pair of integerss;twith 0s < k <

tn, we denecs;t()2Rnby cs;t() =

( forj= 1;:::;s, 1 forj=s+1;:::;t,

0 forj=t+1;:::;n, (14) and we consider the problem

maxf(cs;t()Txjx2Q(q;k)g: (15)

Lemma 7

Let s and t be as above. Denem =bqs+1:kc,m = m+ 1, = (k,s)m,Pkj=s+1qj,1= (t,s)=(k,s) and2= 1 +(t,k)=. Finally let M() denote the set ofm 2M such thatqm is an optimal solution of (15).

(15)

Then1 and21>1; in fact2> 1holds if and only ifqs+1:k

is fractional. FurthermoreM() is given by the following expressions:

(i) If1 < 1, then M() =fmmaxg. (ii) If=1, then M() =fm;:::;m maxg. (iii) If1< < 2, then M() =fmg.

(iv) If=2, then M() =fm;mg. (v) If > 2, then M() =fmg.

Proof.

Letc=cs;t(). From the denition of mwe get m+1>qs+1:k= (1=(k,s))Pkj=s+1qj and thus = (k,s)( m+ 1),Pkj=s+1qj > 0.

As is integral, we get 1. Note that 1 is well-dened as s < k and also1 = (t,s)=(k,s)>1 becauset > k. Furthermore, we have m+ 1qs+1:k+ 1, which impliesk,sand the inequality is strict if qs+1:k is fractional. Thus2 = 1 + (t,k)= 1 + (t,k)=(k,s) = (t,s)=(k,s) =1and here we see that2> 1if qs+1:kis fractional.

Consider the optimization problem (15). From Proposition 6, we know that there is an optimal solution of the formqmfor somem2Mex. We have cTqm= (,1)Xs

j=1qmj+Q+ (t,k)m: (16) asqmj=mforj k andqm(Nk) =Q. This implies that the following set of inequalities hold

cTqqk< ::: < cTqm: (17) In fact, formmwe getsmsand thereforeqmj=qj forj= 1;:::;s. Thus from (16) we getcTqm= (,1)Psj=1qj+ (t,k)m+Qwhich is strictly increasing as a function ofm(due tot > k), and all the inequalities in (17) follow.

Next, let m > m, so sm < s. Then qms+1 = ::: = qmk = m and

Ps

j=1qmj =Pkj=1qmj,Pk

j=s+1qmj =Q,(k,s)m. By inserting this in (16) we get (,1)(Q,(k,s)m) + (t,k)m+Q= m+Qwhere

=t,k,(,1)(k,s)). Therefore is positive (resp. zero, negative) i is smaller than (resp. equal to, larger than)1 = (t,s)=(k,s).

Thus if 1 < 1, then >0 and thereforecTqm < ::: < cTqmmax. Next, if=1, then = 0 andcTqm =:::=cTqmmax. And, nally, if > 1, then <0 andcTqm > ::: > cTqmmax.

It remains to compareqm and qm. From (16) we obtain after some calculations that cTqm,cTqm = (,1),(t,k). Thus (i) if >

1+(t,k)==2, thencTqm> cTqm, (ii) if=2, thencTqm=cTqm; and (iii) if < 2, thencTqm< cTqm.

The lemma now follows by putting all these inequalities together for varying.

The complete characterization of the vertices of Q(q;k) can now be given.

Theorem 8

The vertex set ofQ(q;k) consists of the vectors that can be obtained as permutations of someqm withmbeingq-extreme.

Referanser

RELATERTE DOKUMENTER