A review of regression procedures for randomized response data, including univariate and multivariate logistic regression, the proportional odds model and item response models

Publication date

2016-04-20

Authors

Cruyff, Maarten
Böckenholt, Ulf
Van der Heijden, P.G.M.
Frank, Laurence E.ISNI 0000000392814177

Editors

Chaudhuri, A.
Christofides, C.T.
Rao, C.R.

Advisors

Supervisors

Document Type

Part of book

License

No license information available

Abstract

In survey research, it is often problematic to ask people sensitive questions because they may refuse to answer or they may provide a socially desirable answer that does not reveal their true status on the sensitive question. To solve this problem Warner (1965) proposed randomized response (RR). Here, a chance mechanism hides why respondents say yes or no to the question being asked. Thus far RR has been mainly used in research to estimate the prevalence of sensitive characteristics. It is not uncom- mon that researchers wrongly believe that the RR procedure has the drawback that it is not possible to relate the sensitive characteristics to explanatory variables. Here, we pro- vide a review of the literature of regression procedures for dichotomous RR data. Uni- variate RR data can be analyzed with a version of logistic regression that is adapted so that it can handle data collected by RR. Subsequently the manuscript presents extensions towards repeated cross-sectional data that allowed for a change in the design with which the RR data are collected. We also review regression procedures for multivariate dichotomous RR data, such as the model by Glonek and McCullagh (1995), a model for the sum of a set of dichotomous RR data, and a model from item response theory that assumes a latent variable that explains the answers on the RR variables. We end with a discussion of a recent development in the analysis of multivariate RR data, namely models that take into account that there may be respondents that do not follow the instructions of the RR design by answering no whatever the sensitive question asked. These are coined self-protective responses.

Keywords

Randomized response, Logistic regression, Multivariate, Randomized response data, Proportional odds model, Item response model, Self-protective responses

Citation

Cruyff, M, Böckenholt, U, Van der Heijden, P G M & Frank, L 2016, A review of regression procedures for randomized response data, including univariate and multivariate logistic regression, the proportional odds model and item response models. in A Chaudhuri, C T Christofides & C R Rao (eds), Handbook of Statistics : Data Gathering, Analysis and Protection of Privacy Through Randomized Response Techniques: Qualitative and Quantitative Human Traits. vol. 34, Elsevier, Amsterdam, pp. 287-315. https://doi.org/10.1016/bs.host.2016.01.016