Inference for the parametric distribution of a response given covariates is considered under informative selection of a sample from a finite population. Under this selection, the conditional distribution of a response in the sample, given the covariates and given that it was selected for observation, is not the same as the conditional distribution of the response in the finite population, given only the covariates. It is instead a weighted version of the conditional distribution of interest. Inference must be modified to account for this informative selection. An established approach in this context is maximum "sample likelihood", developing a weight function that reflects the informative sampling design, then treating the observations as if they were independently distributed according to the weighted distribution. While the sample likelihood methodology has been widely applied, its theoretical foundation has been less developed. A precise asymptotic description of a wide range of informative selection mechanisms is proposed. Under this framework, consistency and asymptotic normality of the maximum sample likelihood estimators are established. The theory allows for the possibility of nuisance parameters that describe the selection mechanism. The proposed regularity conditions are verifiable for various sample schemes, motivated by real problems in surveys. Simulation results for these examples illustrate the quality of the asymptotic approximations, and demonstrate a practical approach to variance estimation that combines aspects of model-based information theory and design-based variance estimation.
CITATION STYLE
Bonnéry, D., Breidt, F. J., & Coquet, F. (2018). Asymptotics for the maximum sample likelihood estimator under informative selection from a finite population. Bernoulli, 24(2), 929–955. https://doi.org/10.3150/16-BEJ809
Mendeley helps you to discover research relevant for your work.