Large Sample Properties of Generalized Method of Moments Estimators
Econometrica (1982)
- ISSN: 00129682
- DOI: 10.2307/1912775
- PubMed: 100
Available from www.jstor.org
or
Abstract
This paper studies estimators that make sample analogues of population orthogonality conditions close to zero. Strong consistency and asymptotic normality of such estimators is established under the assumption that the observable variables are stationary and ergodic. Since many linear and nonlinear econometric estimators reside within the class of estimators studied in this paper, a convenient summary of the large sample properties of these estimators, including some whose large sample properties have not heretofore been discussed, is provided.
Available from www.jstor.org
Page 1
Large Sample Properties of Genera...
Large Sample Properties of Generalized Method of Moments Estimators Lars Peter Hansen Econometrica, Vol. 50, No. 4. (Jul., 1982), pp. 1029-1054. Stable URL: http://links.jstor.org/sici?sici=0012-9682%28198207%2950%3A4%3C1029%3ALSPOGM%3E2.0.CO%3B2-O Econometrica is currently published by The Econometric Society. Your use of the JSTOR archive indicates your acceptance of JSTOR's Terms and Conditions of Use, available at http://www.jstor.org/about/terms.html. JSTOR's Terms and Conditions of Use provides, in part, that unless you have obtained prior permission, you may not download an entire issue of a journal or multiple copies of articles, and you may use content in the JSTOR archive only for your personal, non-commercial use. Please contact the publisher regarding any further use of this work. Publisher contact information may be obtained at http://www.jstor.org/journals/econosoc.html. Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission. JSTOR is an independent not-for-profit organization dedicated to and preserving a digital archive of scholarly journals. For more information regarding JSTOR, please contact support@jstor.org. http://www.jstor.org Mon Jun 4 12:31:21 2007
Page 2
Econometrics, Vol. 50, No. 4 (July, 1982) LARGE SAMPLE PROPERTIES O F GENERALIZED METHOD O F MOMENTS ESTIMATORS' This paper studies estimators that make sample analogues of population orthogonality conditions close to zero. Strong consistency and asymptotic normality of such estimators is established under the assumption that the observable variables are stationary and ergodic. Since many linear and nonlinear econometric estimators reside within the class of estima- tors studied in this paper. a convenient summary of the large sample properties of these estimators, including some whose large sample properties have not heretofore been discussed. is provided. 1. INTRODUCTION IN THIS PAPER we study the large sample properties of a class of generalized method of moments (GMM) estimators which subsumes many standard econo- metric estimators. To motivate this class, consider an econometric model whose parameter vector we wish to estimate. The model implies a family of orthogonal- ity conditions that embed any economic theoretical restrictions that we wish to impose or test. For example, assumptions that certain equations define projec- tions or that particular variables are predetermined give rise to orthogonality conditions in which expected cross products of unobservable disturbances and functions of observable variables are equated to zero. Heuristically, identification requires at least as many orthogonality conditions as there are coordinates in the parameter vector to be estimated. The unobservable disturbances in the orthogo- nality conditions can be replaced by an equivalent expression involving the true parameter vector and the observed variables. Using the method of moments, sample estimates of the expected cross products can be computed for any element in an admissible parameter space. A GMM estimator of the true parameter vector is obtained by finding the element of the parameter space that sets linear combinations of the sample cross products as close to zero as possible. In studying strong consistency of G M M estimators, we show how to construct a class of criterion functions with minimizers that converge almost surely to the true parameter vector. The resulting estimators have the interpretation of making the sample versions of the population orthogonality conditions as close as possible to zero according to some metric or measure of distance. We use the metric to index the alternative estimators. This class of estimators includes the nonlinear instrumental Cariables estimators considered by, among others, Amemiya [I, 21, Jorgenson and Laffont [24], and Gallant [ 1 1 1 . ~ There the 'The author acknowledges helpful comments by Robert Avery, Robert Hodrick, V. Joseph Hotz, Dan Peled, Thomas Sargent, Katherine Schipper, Kenneth Singleton, Kenneth Wallis, Halbert White, and an anonymous referee. Special thanks are given to Christopher Sims who played a prominent role in the formulation of this paper. 2 ~ e two- and three-stage least squares under the heading of instrumental include versions of variables procedures.
Page 3
1030 LARS PETER HANSEN population orthogonality conditions equate expected cross products of instru- ments and serially independent disturbances to zero. In our treatment we work directly with expressions for the population orthogonality conditions and implic- itly permit the disturbance terms used in construction of the orthogonality conditions to be both serially correlated and conditionally hetero~kedastic.~ We allow ourselves flexibility in choosing the distance measure because it permits choosing measures that are computationally convenient and because the choice of distance measure influences the asymptotic distribution of the resulting esti- mator. In studying asymptotic normality, we view estimation in a different but closely related fashion. We follow Sargan [29, 301 and consider estimators that have the interpretation of setting linear combinations of the sample orthogonality condi- tions to zero, at least asymptotically, where the number of linear combinations that are set to zero is equal to the number of coordinates in the parameter vector to be estimated. We index alternative estimators by an associated weighting matrix that selects the particular linear combinations of orthogonality conditions that are used in estimation. Since alternative weighting matrices give rise to estimators with alternative asymptotic covariance matrices, we describe how to obtain an asymptotically optimal weighting matrix. The estimators considered in our treatment of consistency are shown to reside in the class of estimators considered in our treatment of asymptotic normality by examining the first-order conditions of minimization problems used to construct the class of consistent estimators. It turns out, however, that our discussion of asymptotic normality is sufficiently general to include other consistent estimators that are obtained from minimizing or maximizing other criterion functions which have first-order condi- tions that satisfy the specification of our generic GMM estimator, e.g., least squares or quasi-maximum likelihood estimators. Again our discussion of large sample properties permits the disturbances implicitly used in the orthogonality conditions to be both conditionally heteroskedastic and serially ~orrelated.~ There are a variety of applications in which it is important to possess an asymptotic theory which accommodates these features. In testing market effi- ciency and the rationality of observed forecasts using least squares procedures, one oftentimes encounters situations in which the implied forecast interval 3Sargan [30] treats the case in which disturbances can follow a low-order autoregression and can be filtered to remove serial correlation prior to the construction of the orthogonality conditions. White [34] discusses linear instrumental variables estimation in which observation vectors are independent but not necessarily identically distributed. White allows heteroskedasticity to exist both conditionally and unconditionally, but places restrictions on higher moments of observable and unobservable variables that are not needed in this paper. Here we think of heteroskedasticity emerging because of some implicit conditioning, do not impose independence, but maintain a stationarity assumption. 4Engle [9] allows for conditional heteroskedasticity in regression models with serially uncorrelated disturbances. He proposes a maximum likelihood procedure for estimating such models when the form of the heteroskedasticity is specified a priori. White [32, 33, 341 has studied the asymptotic distribution of a variety of estimators for cross-sectional models which allow for both conditional and unconditional forms of heteroskedasticity. See Footnote 3.
Page 4
1031 LARGE SAMPLE PROPERTIES exceeds the sampling interval giving rise to a serially correlated forecast error [4, 14, 171. Least squares procedures can be used since the hypothetical forecast error should be orthogonal to the observed forecast and to any other variables in the information set of economic agents when the forecast is made. On the other hand, generalized least squares procedures can result in inconsistent parameter estimators (see Sims [31] and Hansen and Hodrick [17]). Brown and Maital [4], Hansen and Hodrick [17], and Hakkio [14] rely on the asymptotic distribution theory in this paper to carry out least squares estimation and inference for such models. Hansen and Sargent [18, 191 have considered linear rational expectations models in which economic agents are assumed to forecast infinite geometrically- declining sums of forcing variables and the econometrician employs only a subset of the variables in the information set of economic agents. The distur- bance terms in these models are serially correlated but orthogonal to current and past values of a subset of variables which are not strictly exogenous. Hansen and Sargent [18, 191 discuss how to apply the techniques developed in this paper to those rational expectations models. McCallum [28] has shown how other types of linear rational expectations models with disturbance terms that have low-order autoregressive representations lead to equations that can be estimated consis- tently using standard instrumental variables procedures. He notes, however, that the associated asymptotic distribution of the estimations has to be modified in the manner suggested in this paper to allow the disturbances to be serially correlated. In considering models like those studied by McCallum [28], Cumby, Huizinga, and Obstfeld [5] propose a two-step, two-stage least squares estimator that resides within the class of estimators examined in this paper.5 Hansen and Singleton [20] have studied how to test restrictions and estimate parameters in a class of nonlinear rational expectations models. They construct generalized instrumental variables estimators from nonlinear stochastic Euler equations and note that the implied disturbance terms in these models are conditionally heteroskedastic and in many circumstances serially correlated. Their estimators are special cases of the generic G M M estimator of this paper. Finally, Avery, Hansen, and Hotz [3] describe how to use methods in this paper to obtain computationally convenient procedures for estimating multiperiod probit models. The vector disturbance term implicit in their orthogonality condi- tions also is conditionally heteroskedastic. In the examples described above, application of the techniques in this paper will not result in asymptotically efficient estimators. However, in these and other examples, a researcher may be willing to sacrifice asymptotic efficiency in exchange for not having to specify completely the nature of the serial correlation and/or heteroskedasticity or in exchange for computationally simpler estimation strategies. As noted above, we do provide a more limited optimality discussion 'cumby, Huizinga, and Obstfeld [S] proposed their estimator independently of this paper. However, their discussion of its asymptotic distribution exploited results in a precursor to this paper written by the author.
Page 5
1032 LARS PETER HANSEN that is patterned after an approach taken by Sargan [29, 301 and can be easily exploited in practice. The organization of the paper is as follows. The second section provides some consistency results for the GMM estimator under various assumptions about the form of the econometric model. The third section discusses the asymptotic distribution of the GMM estimator and considers the construction of an asymp- totically optimal estimator among the class of estimators that exploit the same orthogonality conditions. The fourth section examines procedures for testing overidentifying restrictions using GMM estimation. Finally, the fifth section contains some concluding remarks. 2. CONSISTENCY OF THE GMM ESTIMATOR In this section we specify our first form of the GMM estimator and provide some sufficient conditions that insure its almost sure convergence to the parame- ter vector that is being estimated. Let 52 denote the set of sample points in the underlying probability space used in our estimation problem, and let E denote the associated expectations operator. We will be working with a p component stochastic process {x,:n 2 1)defined on this probability space. A finite segment of one realization of this process, i.e., {x,(w,)1 : 5 n 5 N ) for sample size N and for some w, E 52, can be thought of as the observable data series that the econometrician employs. ASSUMPTION 2.1: {x, : 1 5 n ) is stationary and ergodic. We introduce a parameter space S that is a subset of R4 (or its compactifica- tion) and let Do be the element of S that we wish to estimate. ASSUMPTION a) is a separable metric space. 2.2: (S, One possibility is to use the standard absolute value norm on R4 to define o. It is well known that since S is a subset of Rq the resulting metric space is separable. We do not restrict ourselves to this metric in order to allow for S to be a subset of a compactification of R4. We consider a function f:RP x S +Rrwhere R is the real line and r is greater than or equal to q. ASSUMPTION 2.3:f(., ,8) is Bore1 measurable for each /3 in S and f(x, .) is continuous on S for each x in RP. The function f provides an expression for the r orthogonality conditions that emerge from the econometric model in the sense indicated by Assumption 2.4. ASSUMPTION 2.4: Ef(x,, /3) exists and is finite for all /3 E S and Ef(x,,Do) = 0.
Readership Statistics
152 Readers on Mendeley
by Discipline
61% Economics
9% Mathematics
by Academic Status
41% Ph.D. Student
15% Assistant Professor
7% Post Doc
by Country
30% United States
12% Germany
7% United Kingdom
Sign up today - FREE
Mendeley saves you time finding and organizing research. Learn more
- All your research in one place
- Add and import papers easily
- Access it anywhere, anytime


