Development and Application of the Branched and Isoprenoid GDGT Machine Learning Classification Algorithm (BIGMaC) for Paleoenvironmental Reconstruction

Publication date

2023-07

Authors

Martínez-Sosa, PabloISNI 0000000524167961
Tierney, Jessica E.
Pérez-Angel, Lina C.
Stefanescu, Ioana C.
Guo, JingjingISNI 0000000492831169
Kirkels, Frédérique M.S.A.ISNI 0000000492905674
Sepúlveda, Julio
Peterse, FrancienORCID 0000-0001-8781-2826ISNI 0000000492917456
Shuman, Bryan N.
Reyes, Alberto V.

Editors

Advisors

Supervisors

Document Type

Article
Open Access logo

License

taverne

Abstract

Glycerol dialkyl glycerol tetraethers (GDGTs), both archaeal isoprenoid GDGTs (isoGDGTs) and bacterial branched GDGTs (brGDGTs), have been used in paleoclimate studies to reconstruct environmental conditions. Since GDGTs are produced in many types of environments, their relative abundances also depend on the depositional setting. This suggests that the distribution of GDGTs also preserves useful information that can be used more broadly to infer these depositional environments in the geological past. Here, we combined existing iso- and brGDGT relative abundance data with newly analyzed samples to generate a database of 1,153 samples from several modern sedimentary settings. We observed a robust relationship between the depositional environment and the relative abundances of GDGTs in our samples. This data set was used to train and test the Branched and isoGDGT Machine learning Classification (BIGMaC) algorithm, which identifies the environment a sample comes from based on the distribution of GDGTs with high precision and recall (F1 = 0.95). We tested the model on the sedimentary record from the Giraffe kimberlite pipe, an Eocene maar in subantarctic Canada, and found that the BIGMaC reconstruction agrees with independent stratigraphic and palynological information, provides new information about the paleoenvironment of this site, and helps improve its paleotemperature reconstruction. In contrast, we also include an example from the PETM-aged Cobham lignite as a cautionary example that illustrates the limitations of the algorithm. We propose that in cases where paleoenvironments are unknown or are changing, BIGMaC can be applied in concert with other proxies to generate more refined paleoclimate records.

Keywords

GDGTs, machine learning, paleoenvironment, Taverne, Oceanography, Atmospheric Science, Palaeontology

Citation

Martínez-Sosa, P, Tierney, J E, Pérez-Angel, L C, Stefanescu, I C, Guo, J, Kirkels, F, Sepúlveda, J, Peterse, F, Shuman, B N & Reyes, A V 2023, 'Development and Application of the Branched and Isoprenoid GDGT Machine Learning Classification Algorithm (BIGMaC) for Paleoenvironmental Reconstruction', Paleoceanography and Paleoclimatology, vol. 38, no. 7, e2023PA004611, pp. 1-21. https://doi.org/10.1029/2023PA004611