Integrating omics datasets with the OmicsPLS package

Publication date

2018-10-11

Authors

El Bouhaddani, SaidORCID 0000-0002-2279-4337
Uh, Hae-WonORCID 0000-0003-4195-7872
Jongbloed, Geurt
Hayward, Caroline
Klarić, Lucija
Kiełbasa, Szymon M
Houwing-Duistermaat, JeanineORCID 0000-0002-4505-7137

Editors

Advisors

Supervisors

Document Type

Article

Collections

Open Access logo

License

Abstract

BACKGROUND: With the exponential growth in available biomedical data, there is a need for data integration methods that can extract information about relationships between the data sets. However, these data sets might have very different characteristics. For interpretable results, data-specific variation needs to be quantified. For this task, Two-way Orthogonal Partial Least Squares (O2PLS) has been proposed. To facilitate application and development of the methodology, free and open-source software is required. However, this is not the case with O2PLS. RESULTS: We introduce OmicsPLS, an open-source implementation of the O2PLS method in R. It can handle both low- and high-dimensional datasets efficiently. Generic methods for inspecting and visualizing results are implemented. Both a standard and faster alternative cross-validation methods are available to determine the number of components. A simulation study shows good performance of OmicsPLS compared to alternatives, in terms of accuracy and CPU runtime. We demonstrate OmicsPLS by integrating genetic and glycomic data. CONCLUSIONS: We propose the OmicsPLS R package: a free and open-source implementation of O2PLS for statistical data integration. OmicsPLS is available at https://cran.r-project.org/package=OmicsPLS and can be installed in R via install.packages("OmicsPLS").

Keywords

Data-specific variation, Joint principal components, O2PLS, Omics data integration, R package

Citation

Bouhaddani, S E, Uh, H-W, Jongbloed, G, Hayward, C, Klarić, L, Kiełbasa, S M & Houwing-Duistermaat, J 2018, 'Integrating omics datasets with the OmicsPLS package', BMC Bioinformatics, vol. 19, no. 1, 371. https://doi.org/10.1186/s12859-018-2371-3