Transformation of multicolour flow cytometry data with OTflow prevents misleading multivariate analysis results and incorrect immunological conclusions

Publication date

2022-01

Authors

Folcarelli, Rita
van Staveren, Selma
Tinnevelt, Gerjen
Cadot, Emily
Vrisekoop, Nienke
Buydens, Lutgarde
Koenderman, LORCID 0000-0002-5636-6453ISNI 0000000398375208
Jansen, Jeroen
van den Brink, Oscar F.

Editors

Advisors

Supervisors

Document Type

Article

Collections

Open Access logo

License

cc_by_nc

Abstract

The rapid evolution of the flow cytometry field, currently allowing the measurement of 30-50 parameters per cell, has led to a marked increase in deep multivariate information. Manual gating is insufficient to extract all this information. Therefore, multivariate analysis (MVA) methods have been developed to extract information and efficiently analyze the high-density multicolour flow cytometry (MFC) data. To aid interpretation, MFC data are often logarithmically transformed before MVA. We studied the consequences of different transformations of flow cytometry data in datasets containing negative intensities caused by background subtractions and spreading error, as logarithmic transformation of negative data is impossible. Transformations such as logicle or hyperbolic arcsine transformations allow linearity around zero, whereas higher (positive and negative) intensities are logarithmically transformed. To define the linear range, a parameter (or cofactor) must be chosen. We show how the chosen transformation parameter has great impact on the MVA results. In some cases, peak splitting is observed, producing two distributions around zero in an actual homogeneous population. This may be misinterpreted as the presence of multiple cell populations. Moreover, when performing arbitrary transformation before MVA analysis, biologically relevant and statistically significant information might be missed. We present a new algorithm, Optimal Transformation for flow cytometry data (OTflow), which uses various statistical methods to optimally choose the parameter of the transformation and prevent artifacts such as peak splitting. Arbitrary or unconsidered transformation can lead to wrong conclusions for the MVA cluster methods, dimensionality reduction methods, and classification methods. We recommend transformation of flow cytometry data by using OTflow-defined parameters estimated per channel, in order to prevent peak splitting and other artifacts in the data.

Keywords

arcsinh | cofactor, flow cytometry, flowVS, inverse hyperbolic sine, logicle transformation, multivariate analysis, OTflow, preprocessing, transformation, Pathology and Forensic Medicine, Histology, Cell Biology, Journal Article

Citation

Folcarelli, R, van Staveren, S, Tinnevelt, G, Cadot, E, Vrisekoop, N, Buydens, L, Koenderman, L, Jansen, J & van den Brink, O F 2022, 'Transformation of multicolour flow cytometry data with OTflow prevents misleading multivariate analysis results and incorrect immunological conclusions', Cytometry Part A, vol. 101, no. 1, pp. 72-85. https://doi.org/10.1002/cyto.a.24491