TY - JOUR
T1 - Gradient liquid chromatographic retention time prediction for suspect screening applications
T2 - A critical assessment of a generalised artificial neural network-based approach across 10 multi-residue reversed-phase analytical methods
AU - Barron, Leon P.
AU - McEneff, Gillian L.
PY - 2016/1/15
Y1 - 2016/1/15
N2 - For the first time, the performance of a generalised artificial neural network (ANN) approach for the prediction of 2492 chromatographic retention times (tR) is presented for a total of 1117 chemically diverse compounds present in a range of complex matrices and across 10 gradient reversed-phase liquid chromatography-(high resolution) mass spectrometry methods. Probabilistic, generalised regression, radial basis function as well as 2- and 3-layer multilayer perceptron-type neural networks were investigated to determine the most robust and accurate model for this purpose. Multi-layer perceptrons most frequently yielded the best correlations in 8 out of 10 methods. Averaged correlations of predicted versus measured tR across all methods were R2=0.918, 0.924 and 0.898 for the training, verification and test sets respectively. Predictions of blind test compounds (n=8-84 cases) resulted in an average absolute accuracy of 1.02±0.54 min for all methods. Within this variation, absolute accuracy was observed to marginally improve for shorter runtimes, but was found to be relatively consistent with respect to analyte retention ranges (~5%). Finally, optimised and replicated network dependency on molecular descriptor data is presented and critically discussed across all methods. Overall, ANNs were considered especially suitable for suspects screening applications and could potentially be utilised in bracketed-type analyses in combination with high resolution mass spectrometry.
AB - For the first time, the performance of a generalised artificial neural network (ANN) approach for the prediction of 2492 chromatographic retention times (tR) is presented for a total of 1117 chemically diverse compounds present in a range of complex matrices and across 10 gradient reversed-phase liquid chromatography-(high resolution) mass spectrometry methods. Probabilistic, generalised regression, radial basis function as well as 2- and 3-layer multilayer perceptron-type neural networks were investigated to determine the most robust and accurate model for this purpose. Multi-layer perceptrons most frequently yielded the best correlations in 8 out of 10 methods. Averaged correlations of predicted versus measured tR across all methods were R2=0.918, 0.924 and 0.898 for the training, verification and test sets respectively. Predictions of blind test compounds (n=8-84 cases) resulted in an average absolute accuracy of 1.02±0.54 min for all methods. Within this variation, absolute accuracy was observed to marginally improve for shorter runtimes, but was found to be relatively consistent with respect to analyte retention ranges (~5%). Finally, optimised and replicated network dependency on molecular descriptor data is presented and critically discussed across all methods. Overall, ANNs were considered especially suitable for suspects screening applications and could potentially be utilised in bracketed-type analyses in combination with high resolution mass spectrometry.
KW - Emerging contaminants
KW - High resolution mass spectrometry
KW - Non-target analysis
KW - Retention time prediction
KW - Suspect screening
UR - http://www.scopus.com/inward/record.url?scp=84943564701&partnerID=8YFLogxK
U2 - 10.1016/j.talanta.2015.09.065
DO - 10.1016/j.talanta.2015.09.065
M3 - Article
AN - SCOPUS:84943564701
SN - 0039-9140
VL - 147
SP - 261
EP - 270
JO - TALANTA
JF - TALANTA
ER -