Show simple item record

Files in this item


Item metadata

dc.contributor.authorDornan, Tadhg
dc.contributor.authorO'Sullivan, Gary
dc.contributor.authorO'Riain, Neal
dc.contributor.authorStueeken, Eva
dc.contributor.authorGoodhue, Robbie
dc.identifier.citationDornan , T , O'Sullivan , G , O'Riain , N , Stueeken , E & Goodhue , R 2020 , ' The application of machine learning methods to aggregate geochemistry predicts quarry source location : an example from Ireland ' , Computers & Geosciences , vol. In press , 104495 .
dc.identifier.otherRIS: urn:6EDA6CA6F8FD9C400F1D2795E04461A4
dc.identifier.otherORCID: /0000-0001-6861-2490/work/72842759
dc.descriptionThis publication has emanated from research supported in part by a research grant from Science Foundation Ireland (SFI) under Grant Number 13/RC/2092 and co-funded under the European Regional Development Fund and by iCRAG industry partners.en
dc.description.abstractAttempts using geochemical data to classify quarry sources which provided reactive rock aggregate, composed of Carboniferous aged pyritic mudrocks and limestones, which has caused structural damage to over 12, 500 homes across Ireland have not yet succeeded. In this paper, a possible solution to this problem is found by performing machine learning models, such as Logistic regression and Random Forest, upon a geochemical dataset obtained through the scanning electron microscope energy-dispersive X-ray spectroscopy (SEM-EDS) and Laser ablation-quadrupole-inductively couple plasma mass spectrometry (LA-Q-ICPMS) of pyrite and Isotope ratio mass spectrometry (IRMS) of bulk rock aggregate, to predict quarry source location. When comparing the classification scores, the LA-Q-ICPMS dataset achieved the highest average classification score of 55.38% for Random Forest and 67.73% for Logistic regression based on 10-fold cross validation testing. As a result, this dataset was then used to classify a set of known unknown samples and achieved average classification accuracies of 40.30% for random forest and 66.80% for logistic regression, based on a systematic train-test procedure. There is scope to enhance these classification scores to an accuracy of 100% by combining the geochemical datasets together. However, due to the difficulty in linking pyrites analysed by SEM-EDS to those analysed by LA-Q-ICPMS, and relating a bulk rock analytical technique (IRMS) to mineral geochemistry (SEM-EDS, LA-Q-ICPMS), median values have to be used when combining IRMS (Fe, S) and SEM-EDS (TS and δ34S) datasets with LA-Q-ICPMS data. Therefore, if these combined datasets were used as part of an applied quarry classification system, statistically meaningful mean values taken from a near normally distributed dataset would have to be used in order to accurately represent the quarry composition.
dc.relation.ispartofComputers & Geosciencesen
dc.subjectGE Environmental Sciencesen
dc.titleThe application of machine learning methods to aggregate geochemistry predicts quarry source location : an example from Irelanden
dc.typeJournal articleen
dc.contributor.institutionUniversity of St Andrews. School of Earth & Environmental Sciencesen
dc.contributor.institutionUniversity of St Andrews. St Andrews Centre for Exoplanet Scienceen
dc.description.statusPeer revieweden

This item appears in the following Collection(s)

Show simple item record