• No results found

Majorization, polyhedra and statistical testing problems

N/A
N/A
Protected

Academic year: 2022

Share "Majorization, polyhedra and statistical testing problems"

Copied!
22
0
0

Laster.... (Se fulltekst nå)

Fulltekst

(1)

UNIVERSITY OF OSLO Department of Informatics

Majorization, polyhedra and statistical testing problems.

Geir Dahl

Report 240,

ISBN 82-7368-155-6

February 1997

(2)
(3)

Majorization, polyhedra and statistical testing problems.

Geir Dahl

February 1997

Abstract

There are important connections between majorization and convex polyhedra. Both weak majorization and majorization are preorders related to certain simple convex cones. We investigate the facial struc- ture of a polyhedral cone C associated with a layered directed graph.

A generalization of weak majorization based on C is introduced. It denes a preorder of matrices. An application in statistical testing theory is discussed in some detail.

Keywords: Majorization, polyhedra, cone ordering, statistical testing.

1 Introduction

In this paper we study some problems related to majorization. For z IRn we let z[j] denote the jth largest number among the components of z. If

x,y IRn one says that x is weakly majorized by y, denoted by x w y, provided that

Xk j=1

x[j] Xk

j=1

y[j] fork = 1, . . . , n.

If, in addition, equality holds for k = n, then x is majorized by y and we write xy. Both and w are preorderings that reect how spread out the components of the vectors are. These concepts play an important role in dierent areas of mathematics and statistics, see [10], [1] and other papers

Institute of Informatics, University of Oslo, P.O.Box 1080, Blindern, 0316 Oslo, Nor- way (Email:[email protected])

1

(4)

in the special issue of Linear Algebra and Its Appl. (Volume 199, 1994) in honor of I. Olkin.

There are interesting (convex) polyhedra that are related to (weak) ma- jorization. This applies to the polytopen ofn×ndoubly-stochastic matri- ces as one has the well-known characterization of Hardy-Littlewood-Polya:

x y if and only if there exists an S n with x = Sy. For a given majorization, say x y, the polytope Ω(x y) consisting of all S n satisfying x=Sywas studied in [5]. Several combinatorial properties of this polytope were established. As another example, [6] and [7] contains a study of the concept of weakk-majorization from a polyhedral point of view. Both

w and are cone orderings corresponding to certain simple polyhedral cones (see [10]). For instance, consider the convex cone Dn+ of nonincreasing and nonnegative vectors

Dn+={xIRn:x1 . . .xn0}.

Assume that both the vectors xand y are nonincreasing. Then xis weakly majorized by y if and only if yx lies the polar cone of D+n (consisting of the vectorszsatisfyingwTz0for allw∈ Dn+). Now, due to the simplicity of the cone Dn+ one can explicitly determine a nite set of generators (a frame) so that Dn+ is the set of nonnegative linear combinations of these generators. From this fact one may derive nice characterizations of Schur- convex functions and also dierent characterizations of weak majorization.

In this paper we study from a polyhedral point of view a class of convex cones that contains Dn+ as a special case. The motivation comes from sta- tistical testing theory, and this application is discussed in some detail. The paper is organized as follows. In section 2 we introduce a polyhedral cone

C associated with layered directed graphs. In a special case C consists of nonnegative m×n matrices where no entry is smaller than any entry in a preceding row. Thus, for n= 1 we have C =Dm+. The faces of C are stud- ied (in the general case) and characterized by means of certain partitions in the graph. In section 3 we relate C to weak majorization and introduce a new preorder based on C. The application in statistical testing theory is presented in section 4. It concerns optimal tests for certain testing problems in discrete experiments.

We describe our notation. Sn denotes the group of permutations on n elements and Kn is the standard simplex in IRn, i.e., Kn = {x IRn+ :

Pn

j=1xj = 1}. For a nite set V we let IRV denote the vector space of real valued functions from V to IR. 0 denotes the vector with all components being zero. If S V the vectorχS is the incidence vector ofS (so χSv equals 1 ifvS and 0 otherwise) and we also denex(S) =PvSxv forxIRV. If

2

(5)

A IRV the convex hull (conical hull) ofAis denoted by conv(A)(cone(A)).

The relative interior of a convex set C is denoted by rint(C). For polyhedral theory, we refer to [4], [12], [14]. Some graph terminology is used, but is is fairly standard.

2 A cone of row-ordered vectors

Let ni for i= 1, . . . , m be given positive integers. Let Ri ={(i, j) : 1 j

ni}fori= 1, . . . , mand dene the index set (or node set)V =R1. . .Rm. Each set Ri is called a row. We let D = (V, E) denote the directed graph with node set V and with an arc from each node in the rowRi to each node in the next row Ri+1 for i = 1, . . . , m1. Thus, E ={(u, v) : u Ri, v

Ri+1 for some im1}. Dis a layered digraph. In the special case where

n1 = . . . = nm = n, the nodes correspond to the entries (or indices) of an

m×n-matrix. We shall study certain problems in the vector space IRV of real valued functions from V to the reals. In the matrix special case above this vector space may be identied with IRm,n.

We are interested in the polyhedral coneC IRV consisting of the vectors

xIRV that satisfy the following set of homogeneous linear inequalities (i) xu xv for all(u, v)E;

(ii) xv 0 for allvV . (1)

Thus a nonnegative vector x 0 lies in C i no component of x is smaller than a component in the next row. The cone C is full dimensional. IfxC we say that x is row-ordered.

Our main task in this section is to study the facial structure of C. The faces and, in particular, the extreme rays of C are of interest in two dierent contexts in the subsequent sections; majorization and statistical testing.

The faces of C are related to partitions of V as discussed in the fol- lowing. Consider a partition N = {N0, N1, . . . , Np} of V where the sets

N1, . . . , Np are nonempty while N0 may be empty. (A more concise no- tation would be N = (N0,{N1, . . . , Np})). We say that partitions N =

{N0, N1, . . . , Np} and M= {M0, M1, . . . , Mq} are equal and write N = M if p=q,N0 =M0 and{N1, . . . , Np}={M1, . . . , Mq}, so N1, . . . , Np is just a renumberingof the setsM1, . . . , Mp. (This is consistent with(N0,{N1, . . . , Np}) = (M0,{M1, . . . , Mq})). The partition N induces an equivalence relation N on N in the usual way, that is,iN j if and only ifi, j Nkfor somek p. We write uN 0 if i N0. If no confusion should arise, we may write in stead of N.

3

(6)

Partitions with a certain relation to the rows R1, . . . , Rm are of interest below, and to dene these some more terminology is useful. For integers

l and r with l r we call the set I = {l, l + 1, . . . , r 1, r} an interval and dene l(I) := l, r(I) := r. A family of intervals It, t = 0, . . . , p is called cross-free if there is a permutationπ Sp+1 with π(0) = 0 such that

r(Iπ(t+1)) l(Iπ(t)) for t = 0,1, . . . , p1. Let N = {N0, N1, . . . , Np} be a partition and dene the associated sets (projections)

I(Nt) ={i m :RiNt6=∅}

for t = 0, . . . , p. We say that N is cross-free if I(N0), I(N1), . . . , I(Np) is a family of cross-free intervals. It is not dicult to verify that N is cross-free if and only if the following conditions hold

CF(i) if uv whereuRi1, vRi2 and i1 < i < i2 then u w for each wRi;

CF(ii) for each i < m there is at most one k such that Nk intersects both row Ri and Ri+1;

CF(iii) RkN0 6= implies that Rt N0 for allt > k.

Roughly, this means, e.g. in the matrix case, that the sets N0, N1, . . . , Np

are stacked on top of each other with N0 at the bottom of the matrix, see Figure 1. Let Pbe the set of all cross-free partitions (with equality as dened above).

N0

N1 N2

N3 N4

Figure 1: A cross-free partition in the matrix case, m= 3and n= 4. We dene a polyhedral cone C(N) associated with the partition N:

C(N) ={xC :xu =xv when uv; xv = 0 when v0}. (2) A crucial property of C(N) holds whenN is a cross-free partition.

4

(7)

Lemma 2.1

Let N ∈ P. Then there is a point xC(N) such that for each

u, vV one has that xu =xv if and only if u v.

Proof.

Assume that the partition N = {N0, N1, . . . , Np} is cross-free.

Then the setsI(N0), I(N1), . . . , I(Np)are intervals and for some permutation

π Sp+1 with π(0) = 0 we haver(Iπ(t+1))l(Iπ(t)) for t= 0, . . . , p1. Let

x IRV denote the vector with xv = t when v Nπ(t), t = 0, . . . , p. Then, due to the construction, x C, and furthermore xu = xv if and only if

uv.

There is a bijection betweenP and the set of faces ofC. This is described in the following proposition.

Proposition 2.2

(i) For each N ∈ P the cone C(N) is a face of C. (ii) If F is a nonempty face of C, there is an N ∈ P withF =C(N). (iii) Let N,M ∈ P. Then C(N) =C(M) if and only if N =M.

(iv) If N = {N0, N1, . . . , Np} is a cross-free partition the face C(N) has dimension p.

Proof.

(i). Let N = {N0, N1, . . . , Np} be a cross-free partition of N. We prove that C(N) (as dened in (2)) is a face of C by nding an equivalent system describing C(N) and consisting of valid inequalities for C set to equality.

From property CF(i) it follows that if (i1, j1) = u v = (i2, j2) and

i1 < i2 then there is a path from u to v in D, say u = u0, u1, . . . , ut = v, where each ui u for each i. This means that each equalityxu =xv in (2) is equivalent to the equalities xui = xui+1 for i = 0, . . . , t1. Similarly, by CF(iii), if u 0 there is a path in D consisting of nodes u =u0, u1, . . . , ut

with ut Rm and ui u for all i. Thus, xu = 0 is equivalent to xu =

xu1, . . . , xut1 = xut, xut = 0. We may therefore replace all the equalities in (2)) by an equivalent system of valid inequalities for C set to equality, and therefore C(N) is a face ofC.

(ii). Observe that all the inequalities xv 0 for v V \Rm may be removedin the denition ofC; they are impliedby the remaining inequalities.

Let F be a face of C. Then

F ={xIRV :xu =xv when (u, v)E0, xv = 0 when vV0}

for suitableE0 EandV0 Rm. Dene for eachvV the setN(v)V by

N(v) = {u V : xu = xv for all xF}. Since v N(v) all these sets are nonempty. Moreover, it is easy to see any two setsN(v1)andN(v2)are either

5

(8)

equal or disjoint. Consider the setT ={uV :xu= 0 for allxF}. This set may be empty, but if it is nonempty, sayvT, thenT =N(v). It follows that the setsN(v),vV constitute a partitionN ={N0, N1, . . . , Np}where

N0 = T is possibly empty. As usual, let denote the equivalence relation induced by this partition. We prove that

F =C(N).

Let x F. If u v, then u N(v) and therefore, due to the denition of

N(v), xu = xv holds. Also, if x F and v T, then xv = 0. It follows that F C(N). To prove the converse inclusion, let (u, v) E0. Then each x F satises xu = xv, so u N(v), i.e., u v. Thus each arc in

E0 has both endnodes in the same equivalence class. Consequently all the equalities dening F are also valid equalities for C(N) (because we clearly have V0 T). This proves that F =C(N), as desired.

It remains to prove that the partitionN is cross-free. Assume rst that

i1 < i < i2 and u Ri1, v Ri2,w Ri with u v. Then, by denition of

N, we have thatxu = xv for all x F (and F is assumed nonempty). But for x F C, we have xu xw xv, and therefore xu =xw for all xF so uw. This proves that CF(i) holds. Next, assume (u1, v1),(u2, v2)E0 whereu1, u2 Ri. LetxF. Thenxu1 xv2 =xu2 xv1 =xu1 so all these numbers are equal and u1 u2. This proves CF(ii). Finally, let uRkN0 and let x F. Thus, xu = 0 and therefore, if v Rt with t > k, we get

0 = xu xv 0 and vN0. This proves that N is cross-free.

(iii). Consider two cross-free partitions N ={N0, N1, . . . , Np} and M=

{M0, M1, . . . , Mq}such thatC(N) = C(M). Assume that there are nodes u and vwithuN vandu6≡M v. Using Lemma 2.1 we can nd an xC(M) withxu 6=xv. ButC(N) =C(M)soxC(N)which gives (asuN v) that

xu = xv; a contradiction. It follows that the equivalence classes induced by

N and those induced by Mcoincide. Similar arguments give thatN0=M0. Therefore N = M. (The converse, that C(N) = C(M) when N = M is trivial).

(iv). LetN ={N0, N1, . . . , Np}be a cross-free partition and consider the face C(N). It follows from Lemma 2.1 that no inequality xu xv where

u 6≡ v is an implicit equality for C(N). Thus the dimension of C(N) may be found from the rank r of the equalities in (2). It is easy to check that

r =|N0|+Pp

t=1(|Nt|−1) and from the dimension formula for polyhedra (see [12]) we get dim(C(N)) =|V| −r=p.

The cone C(N) for the cross-free partition N shown in Figure 1 has dimension 4.

Note that there are many subsystems of (1) that induce the same face of C. In fact two such subsystems induce the same face if and only if they

6

(9)

dene the same cross-free partition according to the procedure described in the proof of property (ii) of Proposition 2.2.

We present some further relations between cross-free partitions and the faces of C. Let N = {N0, N1, . . . , Np} and M= {M0, M1, . . . , Mq} be two cross-free partitions. We say that N is ner than M, and write N ⊆ M, if N0 M0 and each set Nt is contained in some set Ms. It is easy to see

(P,) is a partially ordered set.

Remark 2.3

It is useful to see what this partial ordering corresponds to in the digraph D. ForN ⊆ P dene the arc set

E(N) ={(u, v)E :uN v}.

consisting of arcs with both endnodes in the same equivalence class. Con- versely, for any subset E0 of E the connected components of the subgraph

(V, E0) (ignoring arc directions) gives rise to a partition of V, although it may not be cross-free. We observe that N ⊆ M if and only ifN0 M0 and

E(N)E(M).

The following result is a strengthening of Proposition 2.2 (iii).

Proposition 2.4

Let N and M be cross-free partitions. Then N ⊆ M if and only if C(N)C(M).

Proof.

Assume that N ⊆ M and let xC(M). If uN v, then uM v and therefore xu = xv. If u N 0, then u M 0 and xu = 0. This proves that xC(N), and we conclude that C(N)C(M).

Conversely, assume that C(N) C(M). Assume that there are nodes

u and v with u N v and u 6≡M v. Due to Lemma 2.1) we may nd an

x C(M) such that xu 6= xv. This implies that x 6∈ C(N) (for u N v) which contradicts that C(N) C(M). It follows that whenever u N v we also have u 6≡M v. Furthermore, if uN 0, the eachx C(N) satises

xu= 0and, in particular, this holds for eachxC(M)(as C(N)C(M)).

This proves that N ⊆ M.

Let FC denote the set of all faces of the cone C. It is well known that

(FC,)is a lattice, called the face-lattice ofC(the partial ordering is setwise containment), see [4], [14]. We let, in any lattice,F G(F G) denote the smallest upper bound or join (greatest lower bound or meet) of the elements

F and G.

Corollary 2.5

(P,)is a lattice which is anti-isomorphic to the face lattice

(FC,).

7

(10)

Proof.

This may be proved directly, but it also follows from Proposition 2.5 as follows. Due to Proposition 2.2 the function f : P → FC given by

f(N) =C(N) is a bijection. From Proposition 2.5 it follows that N ∨ U =

f1(C(N)C(M)) and N ∧ U = f1(C(N)C(M)). This proves that both meet and join exist in P so this is a lattice and, in addition, f is a lattice anti-isomorphism from P intoFC.

We may also give an explicit description of how the lattice operations act on P. Let N = {N0, N1, . . . , Np} and M ={M0, M1, . . . , Mq} be cross-free partitions. We rst determine N ∧ M. The sets Ni Mj for 1 i p,

i j q dene a partitionU = {U0, U1, . . . , Ur} where U0 = N0M0 and

U1, . . . , Urare the remaining nonempty setsNiMj. It is easy to see thatU is cross-free (all the properties CF(i)(iii) hold) and that it must be the greatest lower bound ofN andM, i.e.,U =N ∧M. It is somewhat more complicated to determineN ∨M. LetU ={U0, U1, . . . , Ur}be an upper bound (inP) for two cross-free partitionsN andM. Thus, due to Remark 2.3,N0M0 U0 andE(N)E(M)E(U). LetW ={W0, W1, . . . , Wt}denote the partition induced by the connected components in the subgraph (V, E(N)E(U)) of

D, and let W0 be the union of the (one or two) components intersecting eitherN0 or M0. Observe thatW ⊆ U. W may not be cross-free, but we can modify it into a cross-free partition as follows. If we can nd arcs (u1, v1) and (u2, v2) with u1 W v1, u2 W v2 and u1 6≡W u2, then we replace these two equivalence classes (containing u1 resp. u2) by their union. We repeat this procedure until there is no arc pair left with the mentioned properties.

Next, if there are two equivalence classes, say W1 and W2, with all the sets

W1Ri1,W1Ri+1 and W2Ri nonempty, then we replaceW1 and W2 by their union W1 W2. This is repeated until there is no pair of equivalence classes with these properties. Let W0 denote the new partition obtained by this procedure. It is not dicult to check that W0 is cross-free and that

W0 ⊆ U. Thus, we must have W0 =N ∨ M.

Observe that the construction of the meet and join in the lattice P also translates (via the bijectionf) to nding the meet and join for a pair of faces of the cone C.

Let k ∈ {1, . . . , m 1}. We call a subset S of V a k-block in D if

S = R1∪ · · · ∪Rk S0 for some nonempty subset S0 of Rk+1. A block is a

k-block for some k.

Corollary 2.6

The extreme rays of C are the faces C(N) constructed from cross-free partitions N = {N0, N1}, i.e. p = 1. Moreover, C is generated by the incidence vectors χS where S is either a block in D or S consist of a single node in R1, i.e., each vector in C may be written as a nonnegative linear combination of the mentioned vectors.

8

(11)

Proof.

The extreme rays have dimension 1 so the rst part follows from Proposition 2.2. The second form follows from the cross-free property of the partition N.

Each of the inequalities xv 0 for v Rm and xu xv for (u, v) E denes a facet of C. This is easy to prove directly, and it also follows from Proposition 2.2. Let e and f denote the number of extreme rays and facets of C, respectively. We see that

e=n1+ Xm

i=2

(2ni 1), f =

mX1 i=1

nini+1+nm.

In the matrix case with ni =nfor all i, we obtain e =n+ (m1)(2n1) and f = (m 1)n2 +n. In particular, the number of extreme rays grows exponentially in n except whenn= 1.

The generators of C determined in Corollary 2.6 may be used to give a parametric form of each face ofC. Consider a faceF ofC, so (by Proposition 2.2) F =C(P)for some cross-free partitionN ={N0, N1, . . . , Np}. The sets

I(N0), I(N1), . . . , I(Np) are intervals and there is a permutation π Sp+1 with π(0) = 0 and r(Iπ(t+1)) l(Iπ(t)) for t = 0, . . . , p 1. Dene, for

t = 0, . . . , p the node set Gt = pk=tNπ(k). Then each Gt is either a block or consist of a single node in R1. Moreover, χGt, t = 0, . . . , p are anely independent and they generate the face F, i.e.,

F =cone({χGt :t = 1, . . . , p}).

We omit the proof of these facts.

Finally, let us consider the two-dimensional faces of C. Each such face is spanned by two generators of C. We say that that two distinct generatorsz andwareadjacentifF =cone({z,w})is a face ofC (and thendim(F) = 2).

One may derive from Corollary 2.6 that (i) χv for eachvR1 is adjacent to all other generators, and (ii) for distinct blocks S and T the generators χS and χT are adjacent if and only if S T or T S. This implies that the diameter of C is two: any two generators are either adjacent or they are both adjacent to some other generator.

As an application of the results above we consider a polytope obtained by intersectingC with a certain hyperplane. Letp IRV be a vector satisfying

pv > 0 for v V and PvV pv = 1. Later, p will be viewed as a discrete probability distribution on the node set V. Consider the polyhedron

C(p) ={xC :X

vV

pvxv = 1} (3)

9

(12)

Observe that C(p)is bounded asC IRn+and eachpv is positive. Therefore,

C(p)is a polytope. The faces of C(p)are the intersection between the faces of C and the hyperplane {x IRV : PvV pvxv = 1}. In particular, we may determine the vertices of C(p) from Corollary 2.6. Recall the notation

p(S) = P

vSpv for each subset S of V.

Corollary 2.7

The vertices of C(p) are the points (1/p(S))χS where S is either a block in D or S consist of a single node in R1.

3 Row-ordered majorization

In this section we introduce a vector ordering which may be seen as a gen- eralization of weak majorization. Some properties of the new ordering are given.

We consider again the digraphDintroduced in section 2, but restrict the attention to the matrix case where |Ri|=n fori= 1, . . . , m. Thus each x

IRV may be viewed as a realm×n-matrix with (i, j)th entryxi,j, and this is done throughout the present section. The results below also hold for a general node set V, but the matrix case is of special interest. We identify the vector spaces IRV and IRm,n. This space is equipped with the usual inner product for vectors which in matrix form is hx,yi= Pi,jxi,jyi,j = Trace(xTy). We let 0denote the matrix (suitably dimensioned) with all zeros.

LetK denote the the set of generators for the cone C, that is (see Corol- lary 2.6) K contains χv forv R1 and the incidence vectors of blocks inD. Thus we have

C=cone(K).

The polar cone (sometimes called the dual cone) of C is the convex cone

C ={xIRm,n:hy,xi ≥0 for all yC}.

Consider an m ×n-matrix x. We say that x C-majorizes 0 and write

x C 0, or 0 C x, provided that x C. Since K generates C, x C 0 holds if and only if

hg,xi ≥0 for all gK. (4)

If yIRm,n we say that xC-majorizes by yand write xC y or yC x if

xyC 0.

We may write the concept ofC-majorization in a more transparent form.

For a real numbera we writea :=max{−a,0}.

10

Referanser

RELATERTE DOKUMENTER

http://www.tabnak.ir/pages/?cid=42. As there is a steady, very important stream of illegal smuggling of fuel out of Iran, where the price is among the world’s lowest, the claim

While we managed to test and evaluate the MARVEL tool, we were not able to solve the analysis problem for the Future Land Power project, and we did not provide an answer to

The system can be implemented as follows: A web-service client runs on the user device, collecting sensor data from the device and input data from the user. The client compiles

Based on the above-mentioned tensions, a recommendation for further research is to examine whether young people who have participated in the TP influence their parents and peers in

An abstract characterisation of reduction operators Intuitively a reduction operation, in the sense intended in the present paper, is an operation that can be applied to inter-

However, a shift in research and policy focus on the European Arctic from state security to human and regional security, as well as an increased attention towards non-military

In this section we give an overview of different refinement oracles used in the hierarchical radiosity setting 1 , and also some information theory tools for scene discretisation

In this paper we discuss how Interval Analysis can be used to solve some problems in Computer Vision, namely autocalibration and triangulation.. The crucial property of