Multiple system estimation using covariates having missing values and measurement error: Estimating the size of the Māori population in New Zealand

Publication date

2022-01

Authors

van der Heijden, PeterISNI 0000000067738801
Cruyff, M.J.L.F.ISNI 0000000419421817
Smith, P.A.
Bycroft, C.
Graham, P.
Matheson-Dunning, N.

Editors

Advisors

Supervisors

Document Type

Article
Open Access logo

License

cc_by_nc

Abstract

We investigate the use of two or more linked lists, for both population size estimation and the relationship between variables appearing on all or only some lists. This relationship is usually not fully known because some individuals appear in only some lists, and some are not in any list. These two problems have been solved simultaneously using the EM algorithm. We extend this approach to estimate the size of the indigenous Māori population in New Zealand, leading to several innovations: (1) the approach is extended to four lists (including the population census), where the reporting of Māori status differs between registers; (2) some individuals in one or more lists have missing ethnicity, and we adapt the approach to handle this additional missingness; (3) some lists cover subsets of the population by design. We discuss under which assumptions such structural undercoverage can be ignored and provide a general result; (4) we treat the Māori indicator in each list as a variable measured with error, and embed a latent class model in the multiple system estimation to estimate the population size of a latent variable, interpreted as the true Māori status. Finally, we discuss estimating the Māori population size from administrative data only. Supplementary materials for our article are available online.

Keywords

administrative data, capture-recapture, latent class model, list coverage, population size estimation, Economics and Econometrics, Statistics and Probability, Social Sciences (miscellaneous), Statistics, Probability and Uncertainty

Citation

van der Heijden, P G M, Cruyff, M, Smith, P A, Bycroft, C, Graham, P & Matheson-Dunning, N 2022, 'Multiple system estimation using covariates having missing values and measurement error : Estimating the size of the Māori population in New Zealand', Journal of the Royal Statistical Society, Series A: Statistics in Society, vol. 185, no. 1, pp. 156-177. https://doi.org/10.1111/rssa.12731