Reconciliation of inconsistent data sources by correction for measurement error: The feasibility of parameter re-use

Publication date

2018-01-01

Authors

Pankowska, Paulina
Bakker, Bart
Oberski, DanielORCID 0000-0001-7467-2297ISNI 0000000396652603
Pavlopoulos, Dimitris

Editors

Advisors

Supervisors

Document Type

Article
Open Access logo

License

taverne

Abstract

National Statistical Institutes (NSIs) often obtain information about a single variable from separate data sources. Administrative registers and surveys, in particular, often provide overlapping information on a range of phenomena of interest to official statistics. However, even though the two sources overlap, they both contain measurement error that prevents identical units from yielding identical values. Reconciling such separate data sources and providing accurate statistics, which is an important challenge for NSIs, is typically achieved through macro-integration. In this study we investigate the feasibility of an alternative method based on the application of previously obtained results from a recently introduced extension of the Hidden Markov Model (HMM) to newer data. The method allows a reconciliation of separate error-prone data sources without having to repeat the full HMM analysis, provided the estimated measurement error processes are stable over time. As we find that these processes are indeed stable over time, the proposed method can be used effectively for macro-integration, to reconciliate both first-order statistics-e.g. the size of temporary employment in the Netherlands-and second-order statistics-e.g. the amount of mobility from temporary to permanent employment.

Keywords

administrative data, data quality, Hidden Markov Model, labour market transitions, measurement error, register data, survey data, Taverne, Management Information Systems, Economics and Econometrics, Statistics, Probability and Uncertainty, SDG 8 - Decent Work and Economic Growth

Citation

Pankowska, P, Bakker, B, Oberski, D L & Pavlopoulos, D 2018, 'Reconciliation of inconsistent data sources by correction for measurement error : The feasibility of parameter re-use', Statistical Journal of the IAOS, vol. 34, no. 3, pp. 317-329. https://doi.org/10.3233/SJI-170368