Gene and protein sequence features augment HLA class I ligand predictions

Publication date

2024-06-11

Authors

Bresser, Kaspar
Nicolet, Benoit P
Jeko, AnitaISNI 0000000506769491
Wu, WeiISNI 0000000107490485
Loayza-Puch, Fabricio
Agami, Reuven
Heck, Albert J.R.ORCID 0000-0002-2405-4404ISNI 0000000393921118
Wolkers, Monika C
Schumacher, Ton N

Editors

Advisors

Supervisors

Document Type

Article

Collections

Open Access logo

License

cc_by

Abstract

The sensitivity of malignant tissues to T cell-based immunotherapies depends on the presence of targetable human leukocyte antigen (HLA) class I ligands. Peptide-intrinsic factors, such as HLA class I affinity and proteasomal processing, have been established as determinants of HLA ligand presentation. However, the role of gene and protein sequence features as determinants of epitope presentation has not been systematically evaluated. We perform HLA ligandome mass spectrometry to evaluate the contribution of 7,135 gene and protein sequence features to HLA sampling. This analysis reveals that a number of predicted modifiers of mRNA and protein abundance and turnover, including predicted mRNA methylation and protein ubiquitination sites, inform on the presence of HLA ligands. Importantly, integration of such "hard-coded" sequence features into a machine learning approach augments HLA ligand predictions to a comparable degree as experimental measures of gene expression. Our study highlights the value of gene and protein features for HLA ligand predictions.

Keywords

CP: Immunology, HLA class I, HLA ligand predictions, HLA ligandome, XGBoost, antigen presentation, epitope prediction, epitopes, machine learning, General Biochemistry,Genetics and Molecular Biology

Citation

Bresser, K, Nicolet, B P, Jeko, A, Wu, W, Loayza-Puch, F, Agami, R, Heck, A J R, Wolkers, M C & Schumacher, T N 2024, 'Gene and protein sequence features augment HLA class I ligand predictions', Cell Reports, vol. 43, no. 6, 114325. https://doi.org/10.1016/j.celrep.2024.114325