Adjusting for population stratification in polygenic risk score analyses: a guide for model specifications in the UK Biobank

Publication date

2023-09

Authors

Lin, Bochao D.
Pries, Lotta-Katrin
van Os, JimORCID 0000-0002-7245-1586ISNI 0000000116319073
Luykx, Jurjen J
Rutten, Bart P F
Guloksuz, Sinan

Editors

Advisors

Supervisors

Document Type

Article

Collections

Open Access logo

License

taverne

Abstract

The current study was conducted to provide a general guidance for model specifications in polygenic risk score (PRS) analyses of the UK Biobank, such as adjusting for covariates (i.e. age, sex, recruitment centers, and genetic batch) and the number of principal components (PCs) that need to be included. To cover behavioral, physical and mental health outcomes, we evaluated three continuous outcomes (BMI, smoking, drinking) and two binary outcomes (Major Depressive Disorder and educational attainment). We applied 3280 (656 per phenotype) different models including different sets of covariates. We evaluated these different model specifications by comparing regression parameters such as R2, coefficients, and P values, as well as ANOVA tests. Findings suggest that only up to three PCs appears to be sufficient for controlling population stratification for most outcomes, whereas including other covariates (particularly age and sex) appears to be more essential for model performance.

Keywords

Taverne, Genetics(clinical), Genetics, Journal Article

Citation

Lin, B D, Pries, L-K, van Os, J, Luykx, J J, Rutten, B P F & Guloksuz, S 2023, 'Adjusting for population stratification in polygenic risk score analyses : a guide for model specifications in the UK Biobank', Journal of Human Genetics, vol. 68, no. 9, pp. 653-656. https://doi.org/10.1038/s10038-023-01161-1