• No results found

Constructing panel data estimators by aggregation: A general moment estimator and a suggested synthesis

N/A
N/A
Protected

Academic year: 2022

Share "Constructing panel data estimators by aggregation: A general moment estimator and a suggested synthesis"

Copied!
30
0
0

Laster.... (Se fulltekst nå)

Fulltekst

(1)

Discussion Papers No. 420, May 2005 Statistics Norway, Research Department

Erik Biørn

Constructing Panel Data Estimators by Aggregation:

A General Moment Estimator and a Suggested Synthesis

Abstract:

A regression equation for panel data with two-way random or fixed effects and a set of individual specific and period specific `within individual' and `within period', estimators of its slope coefficients are considered. They can be given Ordinary Least Squares (OLS) or Instrumental Variables (IV) interpretations. A class of estimators, obtained as an arbitrary linear combination of these

`disaggregate' estimators, is defined and an expression for its variance-covariance matrix is derived.

Nine familiar `aggregate' estimators which utilize the entire data set, including two between, three within, three GLS, as well as the standard OLS, emerge by specific choices of the weights. Other estimators in this class which are more robust to simultaneity and measurement error bias than the standard aggregate estimators and more efficient than the `disaggregate' estimators, are also considered. An empirical illustration of robustness and efficiency, relating to manufacturing productivity, is given.

Keywords: Panel data. Aggregation. Simultaneity. Measurement error. Method of moments. Factor productivity

JEL classification: C13, C23, C43.

Address: Erik Biørn, University of Oslo and Statistics Norway,Address for correspondence:

Department of Economics, P.O. Box 1095 Blindern, 0317 Oslo, Norway. E-mail:

erik.biorn@econ.uio.no

(2)

Discussion Papers comprise research papers intended for international journals or books. A preprint of a Discussion Paper may be longer and more elaborate than a standard journal article, as it may include intermediate calculations and background material etc.

Abstracts with downloadable Discussion Papers in PDF are available on the Internet:

http://www.ssb.no

http://ideas.repec.org/s/ssb/dispap.html

For printed Discussion Papers contact:

Statistics Norway

Sales- and subscription service NO-2225 Kongsvinger

Telephone: +47 62 88 55 00 Telefax: +47 62 88 55 95

E-mail: Salg-abonnement@ssb.no

(3)

1 Introduction

A primary reason for the substantial growth in the availability and use of panel data in econometrics during the last thirty years is the opportunity that such data give for identifying and controlling for unobserved heterogeneity which may affect the estima- tion of slope coefficients and other parameters of interest from cross-section data, time- series data, or repeated (non-overlapping) cross-sections. It is well known [seee.g., Bal- tagi (2001, chapters 2 and 3), Balestra (1996), and M´aty´as (1996)] (i) that the potential nuisance created by fixed (additive) individual heterogeneity in OLS estimation can be eliminated by measuring all variables from their individual means or taking individual differences over time, (ii) that the potential nuisance created by fixed (additive) time specific heterogeneity in OLS estimation can be eliminated by measuring all variables from their time specific means or taking time specific differences over individuals, and (iii) that efficient estimation in the presence of suitably structuredrandom individual or time specific heterogeneity, can be performed by (Feasible) Generalized Least Squares.

It is, however, possible to construct such aggregate estimators from disaggregate building-blocks. Approaching estimation in this way, is far from an algebraic exercise. It is illuminating primarily because we can utilize the fact that regression coefficients can be estimated consistently from parts of a panel data set in a large number of ways and that some disaggregate estimators are more robust to bias than others. We can, for instance apply all observations from one or two individuals or from one or two periods only. By combining an increasing number of individual specific or period specific estimators, we can include an increasing part of the observations until, at the limit, we utilize the full panel data set. Such an investigation is interesting on the one hand because several familiar estimators (within, between, generalized least squares etc.) for coefficients in panel data models can be interpreted as known linear combinations of elementary estimators, on the other hand because we get suggestions of other estimators along the way.

The paper is structured as follows. After describing the model and some ways of transforming it (Section 2), we first, in Section 3, define ‘disaggregate’ ‘within individ- ual’ and ‘within period’ estimators, each of which can be given either an OLS or an Instrumental Variables (IV) interpretation. In Section 4, a more general moment esti- mator, obtained by an arbitrary weighting of these elementary estimators as well as its variance-covariance matrix, is constructed. We next reconsider nine familiar estimators of a slope coefficient vector in a linear regression equation for panel data with two-way random or fixed effects, three of which are ‘within (group)’, two are ‘between (group)’

estimators related to individual or time variation, one is the standard OLS (Ordinary Least Squares) and three are Generalized Least Squares (GLS) estimators. We show that these‘aggregate’ estimators can all belong to this class and demonstrate that our

(4)

general estimator contains not only the nine estimators mentioned above, but also several others which are more robust to violation of the standard assumptions in random coeffi- cient models. In this process, some textbook results [Maddala (1977, section 14–2) and Hsiao (2003, section 2.2)] and some results in Biørn (1994, 1996) are generalized. Both a standard regression framework and situations with simultaneity (correlation between individual effects, period effects, and/or disturbances on the one hand and the regressor vector on the other) and situations with random measurement errors in the regressor vector are considered. Among the latter estimators we select estimators which are more robust to simultaneity and measurement errors and more efficient than the ‘disaggregate’

estimators. Finally, an empirical illustration of robustness and efficiency loss, relating to manufacturing productivity, is given.

2 Model, notation, and transformations

Consider a linear regression equation relatingy to a vector of K (stochastic) regressors x, with data set from a panel of N ( 2) individuals observed in T (2) periods:

yit=k+xitβ+²it, ²it =αi+γt+uit, i= 1, . . . , N; t= 1, . . . , T, (2.1)

whereyit and xit = (x1it, . . . , xKit) are the values of y and x for individual i in period t, β = (β1, . . . , βK)0 is the coefficient vector, αi and γt are random effects specific to individualiand periodt, respectively,uit is a genuine disturbance, andkis an intercept term. It is, however, possible to interpretαi andγtas fixed effects, see Section 5. At the moment, we make the standard assumptions for two-way random effects models,

uit IID(0, σ2), αi IID(0, σ2α), γtIID(0, σγ2), i= 1, . . . , N; t= 1, . . . , T, (2.2)

uit, αi, γt, xit are independently distributed for all iand t, (2.3)

which imply

E(²it|X) = 0, E(²it²js|X) =δijσα2 +δtsσγ2+δijδtsσ2, i, j= 1, . . . , N, t, s= 1, . . . , T, (2.4)

whereδij = 1 for i=j and = 0 fori6=j, and δts = 1 fort=sand = 0 fort6=s, andX is the (N T ×K) matrix containing all the xit’s. Some of the assumptions in (2.2) and (2.3) will be relaxed later on.

The individual specific vectors and matrices, of dimension (T ×1) and (T ×K), respectively, and the period specific vectors and matrices, of dimension (N ×1) and (N×K), respectively, are

yi·=

yi1

... yiT

, Xi·=

xi1

... xiT

, t=

y1t

... yN t

, t=

x1t

... xN t

,

(5)

which are stacked into y=

y1·

... yN·

, X =

X1·

... XN·

, y =

1

... T

, X=

1

... T

.

Further, let eH be the (H×1) vector of ones, IH the H-dimensional identity matrix, AH = eHeH0 /H, BH = IH −AH and let α = (α1, . . . , αN)0 and γ = (γ1, . . . , γT)0. Alternative ways of writing (2.1) are then

yi·=eTk+Xi·β+²i·, ²i·=eTαi+γ+ui·, i= 1, . . . , N, (2.5)

t=eNk+tβ+²·t, ²·t=α+eNγt+t, t= 1, . . . , T, (2.6)

where²i·,ui·,²·t,tare defined in similar way asyi·andt, and after deducting global means we obtain

yi·−y¯= (Xi·−X)β¯ +²i·−¯², ²i·−¯²=eTi−α) +¯ BTγ+ui·−u,¯ (2.7)

t−y¯= (X·t−X¯)β+²·t−¯², ²·t−¯² =BNα+eNt−¯γ) +u·t−u¯, (2.8)

where ¯α = (1/N)Piαi, ¯γ = (1/T)Ptγt, X¯ = (1/N)PiXi·, X¯ = (1/T)Ptt,

¯

y = (1/N)Piyi·, y¯ = (1/T)Ptt, , etc. Premultiplying (2.5) by BT, (2.7) by AT, (2.6) byBN and (2.8) byAN, give, respectively,

BTyi· = BTXi·β+BT²i·,

AT(yi· −y) =¯ AT(Xi· −X)β¯ +ATi· −¯²), (2.9)

BNt = BNtβ+BN²·t,

AN(y·t−y¯) = AN(X·t−X¯)β+AN·t¯²).

(2.10)

We letW,V,B, andC, with appropriate subscripts, symbolize matrices containing within individual, within period, between individual, and between period (co)variation, respectively. Define individual specific and period specific cross-product matrices as follows:

WXXij =Xi0·BT Xj·=PTt=1(xit−x¯i·)0(xjt−x¯j·),

WXγi =Xi0·BT γ=PTt=1(xit−x¯i·)0t¯γ), i, j= 1, . . . , N, (2.11)

VXXts =0tBNs=PNi=1(xit−x¯·t)0(xis−x¯·s),

VXαt =0tBNα=PNi=1(xit−x¯·t)0i−α),¯ t, s= 1, . . . , T, (2.12)

BXXii = (Xi· −X)¯ 0AT(Xi· −X) =¯ Txi· −x)¯ 0(x¯i· −x),¯

BXαii= (Xi· −X)¯ 0eTi−α) =¯ T(x¯i· −x)¯ 0i−α),¯ i= 1, . . . , N, (2.13)

CXXtt= (X·t−X¯)0AN(X·t−X¯) =Nt−x)¯ 0(x¯·t−x),¯

CXγtt= (X·t−X¯)0eNt¯γ) =N(x¯·t−x)¯ 0t−γ),¯ t= 1, . . . , T, (2.14)

etc., wherex¯i·= (eT0/T)Xi·,x¯·t= (eN0 /N)X·t,x¯= (eN T0 /(N T))X= (eT N0 /(T N))X. These matrices have the following properties:

(6)

WXXij, which has full rank K if xit contains no individual specific variables, is the (K×K) matrix ofwithin individual covariation in thex’s of individuals iand j, and VXXts, which has full rankK ifxit contains no period specific variables, is the (K×K) matrix of within period covariation in the x’s of periods t and s. If individual specific regressors occur,WXXij has one zero column (and row) for each such variable, and if period specific regressors occur, VXXts has one zero column (and row) for each such variable.

BXXii and CXXtt, which have rank 1, are the (K×K) matrices of between indi- vidual cross-products and between period cross-products of the x’s of individual i and period t, respectively.

WXγi is the (K×1) vector of within covariation of thex’s of individual iand the period specific effects, VXαt is the (K×1) vector of within covariation of thex’s of period tand the individual specific effects,BXαi is the (K×1) vector ofbetween cross-products of thex’s of individualiand its individual specific effects, andCXγt is the (K×1) vector ofbetween cross-products of thex’s of periodtand its period specific effects.

Premultiplying the two equations in (2.9) by Xi0·BT and (Xi· −X)¯ 0AT, respec- tively, and premultiplying the two equations in (2.10) by 0tBN and (X·t−X¯)0AN, respectively, while using (2.11)–(2.14), we get

WXY ij =WXXijβ+WX²ij, WX²ij =WXγi+WXU ij, i, j= 1, . . . , N, (2.15)

BXY ii =BXXiiβ+BX²ii, BX²ii =BXαii+BXU ii, i= 1, . . . , N, (2.16)

VXY ts=VXXtsβ+VX²ts, VX²ts=VXαt+VXU ts, t, s= 1, . . . , T, (2.17)

CXY tt=CXXttβ+CX²tt, CX²tt=CXγtt+CXU tt, t= 1, . . . , T.

(2.18)

These can be considered ‘moment versions’ of Eq. (2.1),

3 Base estimators and their properties

The fact that WX²ij and have zero expectations when (2.2) and (2.3) are satisfied, in combination with (2.15) and (2.17) motivate the followingN2 individual specific and T2 period specific estimators ofβ:

βbW ij =W−1XXijWXY ij = (Xi0·BTXj·)−1(Xi0·BTyj·), i, j= 1, . . . , N, (3.1)

βbV ts=V−1XXtsVXY ts= (X·0tBNs)−1(X·0tBNs), t, s= 1, . . . , T.

(3.2)

We denote them asbase estimators, ordisaggregate estimators, ofβ. They can be given the following interpretations:

(7)

(i) βbW ii is the OLS estimator based on observations from individual i, and βbW ij, for j 6= i, is the IV estimator based on the ‘within variation’ of individual j, BTXj·, using the ‘within variation’ of individual iin Xi·, BTXi·, as IV matrix.

(ii) βbV tt is the OLS estimator based on observations from periodt, andβbV ts, fors6=t, is the IV estimator based on the ‘within variation’ of period s, BNs, using the

‘within variation’ of period tin t, BNt, as IV matrix.

All theseN2+T2estimators exist if all elements ofxitvary across individuals and periods, since this usually ensures thatWXXij and VXXts have rankK.

If individual specific variables occur, so that WXXij contains one or more zero rows and columns, their coefficients cannot be estimated from (3.1), but estimators for the coefficients of the other, i.e., the two-dimensional or period specific variables, can be solved fromWXXijβbW ij =WXY ij. Likewise, if period specific variables occur, so that VXXtscontains one or more zero rows and columns, their coefficients cannot be estimated from (3.2), but estimators for the coefficients of the other, i.e., the two-dimensional or individual specific variables, can be solved fromVXXtsβbV ts=VXY ts.

Since inserting for WXY ij from (2.15) and for VXY ts from (2.17) in (3.1) and (3.2) gives, respectively,

βbW ij−β=W−1XXijWX²ij=W−1XXij(WXγi+WXU ij), i, j= 1, . . . , N, (3.3)

βbV ts−β=V−1XXtsVX²ts=V−1XXts(VXαt+VXU ts), t, s= 1, . . . , T, (3.4)

and (2.2) and (2.3) imply

E(WXU ij|X) =E(WXγi|X) =0K1, i, j= 1, . . . , N, (3.5)

E(VXU ts|X) =E(VXαt|X) =0K1, t, s= 1, . . . , T, (3.6)

we know that βbW ij and βbV ts are unbiased estimators for β. Furthermore, βbW ij is consistent when T → ∞ (T-consistent, for short), since then plim(WX²ij/T) = 0K1, provided that plim(WXXij/T) is non-singular, andβbV tsis consistent whenN → ∞(N- consistent, for short), since then plim(VX²ts/N) = 0K1, provided that plim(VXXts/N) is non-singular.

However, some of the base estimators may be consistent even if conditions (2.2)–(2.3) are weakened. The followingrobustness results hold:

[1 ]Since (3.3) does not contain α, all βbW ij are T-consistent even if αi is treated as fixed or allowed to be correlated with x¯i·, but if γt is correlated with x¯·t, all βbW ij are inconsistent. Symmetrically, since (3.4) does not contain γ, all βbV ts are N- consistent even ifγt is treated as fixed or allowed to be correlated withx¯·t, but ifαi is correlated with x¯i·, all βbV ts are inconsistent.

(8)

[2 ]Endogeneity of or random measurement error in (some components of) xit may cause E(xit0uit) 6=0K1. Then plim(WXU ii/T)6=0K1, so that the OLS estimators βbW ii are inconsistent, but the IV estimators βbW ij (j 6= i) remain T-consistent.

Symmetrically, we then also have plim(VXU tt/N)6=0K1, so that the OLS estima- tors βbV tt are inconsistent, but the IV estimatorsβbV ts (s6=t)remainN-consistent.

In Appendix A it is shown that when (2.2)–(2.3) hold, the matrices of covariances be- tween the individual specific and the period specific base estimators, respectively, can be expressed as

C(βbW ijbW kl|X) =E[(βbW ij−β)(βbW kl−β)0|X]

(3.7)

= (σγ2+δjlσ2)W−1XXijWXXikW−1XXlk, C(βbV tsbV pq|X) =E[(βbV ts−β)(βbV pq−β)0|X]

(3.8)

= (σα2 +δsqσ2)V−1XXtsVXXtpV−1XXqp, C(βbW ijbV pq|X) =E[(βbW ij−β)(βbV pq−β)0|X]

(3.9)

=σ2W−1XXij(xiq−x¯i·)0(xjp−x¯·p)V−1XXqp, i, j, k, l= 1, . . . , N, t, s, p, q= 1, . . . , T.

Eq. (3.7) for (k, l) = (i, j) and (3.8) for (p, q) = (t, s) give in particular the variance- covariance matrices

V(βbW ij|X) = E[(βbW ij−β)(βbW ij−β)0|X]

(3.10)

= (σ2γ+σ2)W−1XXijWXXiiW−1XXji, i, j= 1, . . . , N, V(βbV ts|X) = E[(βbV ts−β)(βbV ts−β)0|X]

(3.11)

= (σ2α+σ2)V−1XXtsVXXttV−1XXst, t, s= 1, . . . , T.

When (2.2)–(2.3) hold,βbW jj and βbV ss are always more efficient than βbW ij (j 6=i) and βbV ts (s 6= t), respectively, i.e., V(βbW ij|X)−V(βbW jj|X) for i 6= j and V(βbV ts|X)− V(βbV ss|X) for t6=sare positive (semi)definite matrices. The formal proof of this is

V(βbW ij|X)−V(βbW jj|X) = (σγ2+σ2)(W−1XXijWXXiiW−1XXji−W−1XXjj)

= (σ2γ+σ2)(A−1W XijA−1W Xji−IK)W−1XXjj, V(βbV ts|X)−V(βbV ss|X) = (σ2α+σ2)(V−1XXtsVXXttV−1XXst−V−1XXss)

= (σα2 +σ2)(A−1V XtsA−1V Xst−IK)V−1XXss, where

AW Xij =W−1XXiiWXXij, AV Xts=V−1XXttVXXts.

The latter are the (K×K) matrix of (sample) regression coefficients when regressing the block ofX relating toj,i.e.,Xj·, on the block of X relating to individual i, i.e., Xi·,

(9)

and when regressing the block of X relating to period s, i.e., s, on the block of X relating to periodt, i.e., t, respectively. Here all (A−1W XijA−1W Xji−IK), j 6= i, and all (A−1V XtsA−1V Xst −IK), s 6= t, are positive (semi)definite matrices, provided that all variables inxit are two-dimensional.

The structure of the variance-covariance matrices of the estimators is transparent in theone-regressor case, K= 1. Then (3.7) and (3.10) read

C(βbW ijbW kl|X) = (σ2γjlσ2) WXXik

WXXijWXXkl, V(βbW ij|X) = (σγ22)WXXii WXXij2 , (3.12)

where WXXik, βbW ij, etc. denote the scalar counterparts to WXXik, βbW ij, etc. The coefficient of correlation between two arbitrary individual specific base estimators for the slope coefficient can therefore be written as

ρ(βbW ijbW kl|X) = C(βbW ijbW kl|X) [V(βbW ij|X)V(βbW kl|X)]1/2 (3.13)

= σ2γ+δjlσ2 σ2γ+σ2

WXXik

(WXXiiWXXkk)1/2 = ρ(²jt, ²lt)RW Xik, where RW Xik = WXXik/(WXXiiWXXkk)1/2 is the sample coefficient of correlation be- tween thex’s of individualsiandkandρ(²jt, ²lt) = (σγ2jlσ2)/(σγ22) is the coefficient of correlation between²jt and ²lt. If we therefore consider (2.5) as an N-equation model with one equation for each individual and with common slope coefficient,ρ(βbW ijbW kl|X) is simply the product of the coefficient of correlation between two²disturbances from in- dividuals (equations)jandlin the same period, and the coefficient of correlation between the values of the regressor (instrument) for individuals (equations)iand k. This means thatρ(βbW ijbW kl|X) has one equation specific component (j vs. l) and one instrument specific component (i vs.k). Forj=land for i=k(3.13) gives, respectively,

ρ(βbW ijbW kj|X) =RW Xik, for allj;i6=k [same equation (individual), different IV], ρ(βbW ijbW il|X) = σ2γ

σγ22, for alli;j6=l [different equations (individuals), same IV].

Symmetrically, (3.8) and (3.11) for K= 1 give C(βbV tsbV pq|X) = (σα2 +δsqσ2) VXXtp

VXXtsVXXpq, V(βbV ts|X) = (σα2 +σ2)VXXtt VXXts2 . (3.14)

The coefficients of correlation can therefore be written as ρ(βbV tsbV pq|X) = C(βbV tsbV pq|X)

[V(βbV ts|X)V(βbV pq|X)]1/2 (3.15)

= σ2α+δsqσ2 σ2α+σ2

VXXtp

(VXXttVXXpp)1/2 = ρ(²is, ²iq)RV Xtp,

(10)

whereRV Xtp =VXXtp/(VXXttVXXpp)1/2 is the coefficient of correlation between thex’s in periodstand pandρ(²is, ²iq) = (σ2α+δsqσ2)/(σα2+σ2) is the coefficient of correlation between²isand²iq. If we therefore consider (2.6) as aT-equation modelwith one equation for each period and with common slope coefficient,ρ(βbV tsbV pq|X) is simply the product of the coefficient of correlation between two ² disturbances from periods (equations) s and q for the same individual, and the coefficient of correlation between the values of the regressor (instrument) in periodst and p. This means thatρ(βbV tsbV pq|X) has one equation specific component (svs. q) and one instrument specific component (t vs. p).

Fors=q and t=p (3.15) gives, respectively,

ρ(βbV tsbV ps|X) =RV Xtp, for all s;t6=p [same equation (period), different IV], ρ(βbV tsbV tq|X) = σα2

σ2α2, for allt;s6=q [different equations (periods), same IV].

From (3.12) and (3.14) we find that theinefficiency when using the (within) variation of individualias IV for the (within) variation of individualj relative to performing OLS on the observations from individualjand when using the (within) variation of periodtas IV for the (within) variation of periodsrelative to performing OLS on the observations from periods, can be expressed simply as, respectively,

V(βbW ij|X)

V(βbW jj|X) = 1

AW XijAW Xji = 1 R2W Xij, (3.16)

V(βbV ts|X)

V(βbV ss|X) = 1

AV XtsAV Xst = 1 R2V Xts. (3.17)

Hence,R−2W Xij (≥1) andR−2V Xts (≥1) measure, respectively, the loss of efficiency when using estimators which are robust to inconsistency caused by simultaneity or random measurement error in the regressor, (i) by estimating a relationship for individual j by using as IV observations from another individual,i, rather than using OLS, and (ii) by estimating a relationship for period sby using as IV observations from another period, t, rather than using OLS.

4 A class of moment estimators

Since each of theN2+T2base estimators of β,βbW ij andβbV ts, only uses a minor part of the panel data set, they may not be considered real competitors to estimators constructed from the complete data set, when (2.2)–(2.3) are valid. And even if these assumptions are violated by correlation between xit and uit, between x¯i· and αi, and/or between

¯

t and γt, aggregate estimators which are more efficient than any of the IV estimators βbW ij (j 6= i) and βbV ts (s 6= t) may exist. Yet, the insight provided by examining

(11)

these base estimators as we have done in th previous section is useful when constructing composite estimators ofβ, of which they can serve as building-blocks.

This motivates the construction ofa class of estimators of β by weighting the indi- vidual specific or period specific (co)variation inX andy, as defined in (2.11)–(2.12), as follows: Letθ = (θts) be a (T×T) matrix andτ = (τij) an (N×N) matrix of (positive, zero or negative) weights and define ageneral moment estimator as

b=b(θ,τ) =³PTt=1PTs=1θtsVXXts+PNi=1PNj=1τijWXXij´−1 (4.1)

× ³PTt=1PTs=1θtsVXY ts+PNi=1PNj=1τijWXY ij´. Using (3.1)–(3.2) it can be written as a weighted average of the base estimators:

b=³PTt=1PTs=1θtsVXXts+PNi=1PNj=1τijWXXij´−1 (4.2)

× ³PTt=1PTs=1θtsVXXtsβbV ts+PNi=1PNj=1τijWXXijβbW ij´, or, in simplified notation,

b=PTt=1PTs=1GV tsβbV ts+PNi=1PNj=1GW ijβbW ij, (4.3)

whereGV tsandGW ij are (K×K) weighting matrices,PtPsGV ts+PiPjGW ij =IK, given by

GV ts=Q−1θtsVXXts, t, s= 1, . . . , T, GW ij =Q−1τijWXXij, i, j= 1, . . . , N, Q=Q(θ,τ) =PTt=1PTs=1θtsVXXts+PNi=1PNj=1τijWXXij.

(4.4)

None of the latter matrices are symmetric in general. If, however, θts = θst for all t, s andτij =τji for alli, j, thenQ0=Q.

The estimator b is unbiased for any θ and τ when (2.2) and (2.3) hold, and in Appendix B it is shown that its variance-covariance matrix is (This formula in the special case whereK = 1 and σγ2 = 0 is derived in Biørn (1994, Appendix A).)

V(b|X) =Q−1P (Q−1)0=Q(θ,τ)−1P(θ,τ, σ2, σ2α, σγ2)(Q(θ,τ)−1)0, (4.5)

where

P =P(θ,τ, σ2, σ2α, σγ2) =σ2(SV +SW +SV W) +σα2ZV +σγ2ZW, (4.6)

(12)

SV =SV(θ) = XT

t=1

XT

p=1

VXXtp à T

X

s=1

θtsθps

! ,

SW =SW(τ) = XN

i=1

XN

k=1

WXXik

XN

j=1

τijτkj

,

SV W =SV W(θ,τ) = XT

t=1

XT

s=1

XN

i=1

XN

j=1

θtsτij(xis−x¯i·)0(xjt−x¯·t), ZV =ZV(θ) =

XT

t=1

XT

p=1

VXXtp à T

X

s=1

θts

! Ã T X

r=1

θpr

! ,

ZW =ZW(τ) = XN

i=1

XN

k=1

WXXik

XN

j=1

τij

ÃN

X

l=1

τkl

! . (4.7)

It is easily seen that SW V = 0 if either θts = θ for all t, s or τij = τ for all i, j, that ZV =0ifPTs=1θts= 0 for allt, and thatZW =0ifPTj=1τij = 0 for alli. The standard estimators in fixed and random effects models have at least one of these properties, which will be shown in the next section.

By utilizing (4.5)–(4.7), we can estimate V(b|X) consistently from a panel data set for any weighting matrices θ and τ we may choose when consistent estimators of the variancesσ2,σα2, and σ2γ have been obtained.

5 Specific aggregate estimators

In this section, we consider specific members of the class of estimators described by (4.1).

Some of these are familiar, others less familiar.

Aggregate within and between estimators

The estimatorbcontains several familiar estimators for fixed effects models as particular members. We first establish the weighting system (θ,τ) for six such estimators and comment on other, less familiar estimators which are more robust to violation of the basic assumptions. The results below generalize those in Biørn (1994, section 3), where only one regressor is included (K= 1) and period specific effects are disregarded (γt= 0).

We define, in the usual way [see, e.g., Greene (2003, section 13.3.2)], the (K×K)- matrices ofoverall (aggregate)within individual and within period, (co)variation as

WXX =PNi=1WXXii =PNi=1PTt=1(xit−x¯i·)0(xit−x¯i·), (5.1)

VXX =PTt=1VXXtt =PTt=1PNi=1(xit−x¯·t)0(xit−x¯·t), (5.2)

etc. The correspondingoverall between individual, andbetween period (co)variation are BXX =PNi=1BXXii =TPNi=1xi· −x)¯ 0(x¯i· −x) = (1/T¯ )PTt=1PTs=1VXXts, (5.3)

CXX =PTt=1CXXtt =NPTt=1(x¯·t−x)¯ 0(x¯·t−x) = (1/N¯ )PNi=1PNj=1WXXij, (5.4)

Referanser

RELATERTE DOKUMENTER

The first is to estimate, from unbalanced household panel data, within a static two-equation model framework, income and price responses (including cross-price responses) for

This paper analyzes tobacco demand within a discrete choice framework. Using binomial and multinomial logit models with random effects, and an unbalanced panel data set of

Taken together, they provide a population panel data set with information for every individual and household about their demographic characteristics (including age, gender, and

Sorption of Cu, Sb and Pb (%) as a function a function of the total concentration of elements in the pond with charcoal and iron hydroxide as sorbents in two

3.1 Evolution of costs of defence 3.1.1 Measurement unit 3.1.2 Base price index 3.2 Defence inflation and investment cost escalation 3.3 Intra- and intergenerational DSI

Thus, results from accurate CFD models are used to obtain realistic local wind conditions in urban environments, which in turn are used for dispersion predictions via

Chapter 6.5.2 contained a characterization of measurements and basic models in a statistical sense, indicating that there is reason to include terrain elevation and diffraction

As with other devices, the host uses the information retrieved from the device to find a matching driver for each active interface in the configuration. The device is now ready