Machine Learning Identifies Stemness Features Associated with Oncogenic Dedifferentiation

Publication date

2018-04-05

Authors

Malta, Tathiane M.
Sokolov, Artem
Gentles, Andrew J.
Burzykowski, Tomasz
Poisson, Laila
Weinstein, John N.
Kamińska, Bożena
Huelsken, Joerg
Omberg, Larsson
Gevaert, Olivier

Editors

Advisors

Supervisors

Document Type

Article

Collections

Open Access logo

License

cc_by_nc_nd

Abstract

Cancer progression involves the gradual loss of a differentiated phenotype and acquisition of progenitor and stem-cell-like features. Here, we provide novel stemness indices for assessing the degree of oncogenic dedifferentiation. We used an innovative one-class logistic regression (OCLR) machine-learning algorithm to extract transcriptomic and epigenetic feature sets derived from non-transformed pluripotent stem cells and their differentiated progeny. Using OCLR, we were able to identify previously undiscovered biological mechanisms associated with the dedifferentiated oncogenic state. Analyses of the tumor microenvironment revealed unanticipated correlation of cancer stemness with immune checkpoint expression and infiltrating immune cells. We found that the dedifferentiated oncogenic phenotype was generally most prominent in metastatic tumors. Application of our stemness indices to single-cell data revealed patterns of intra-tumor molecular heterogeneity. Finally, the indices allowed for the identification of novel targets and possible targeted therapies aimed at tumor differentiation. Stemness features extracted from transcriptomic and epigenetic data from TCGA tumors reveal novel biological and clinical insight, as well as potential drug targets for anti-cancer therapies.

Keywords

cancer stem cells, dedifferentiation, epigenomic, genomic, machine learning, pan-cancer, stemness, The Cancer Genome Atlas, General Biochemistry,Genetics and Molecular Biology

Citation

Malta, T M, Sokolov, A, Gentles, A J, Burzykowski, T, Poisson, L, Weinstein, J N, Kamińska, B, Huelsken, J, Omberg, L, Gevaert, O, Colaprico, A, Czerwińska, P, Mazurek, S, Mishra, L, Heyn, H, Krasnitz, A, Godwin, A K, Lazar, A J, Caesar-Johnson, S J, Demchok, J A, Felau, I, Kasapi, M, Ferguson, M L, Hutter, C M, Sofia, H J, Tarnuzzer, R, Wang, Z, Yang, L, Zenklusen, J C, Zhang, J, Chudamani, S, Liu, J, Lolla, L, Naresh, R, Pihl, T, Sun, Q, Wan, Y, Wu, Y, Cho, J, DeFreitas, T, Frazer, S, Gehlenborg, N, Getz, G, Heiman, D I, Kim, J, Lawrence, M S, Lin, P, Meier, S, Noble, M S, de Krijger, R & The Cancer Genome Atlas Research Network 2018, 'Machine Learning Identifies Stemness Features Associated with Oncogenic Dedifferentiation', Cell, vol. 173, no. 2, pp. 338-354.e15. https://doi.org/10.1016/j.cell.2018.03.034