TY - JOUR
T1 - ER-R
T2 - Improving regression by deep learning and prior knowledge utilization for fluorescence analysis
AU - Sinitsa, Sergey
AU - Sochen, Nir
AU - Borisover, Mikhail
AU - Buchanovsky, Nadia
AU - Mendlovic, David
AU - Klapp, Iftach
N1 - Publisher Copyright:
© 2023 Elsevier B.V.
PY - 2023/5/15
Y1 - 2023/5/15
N2 - Linear regression is a dominant estimation technique in chemometrics, where there is a need for inexpensive and reliable sensors for water monitoring. However, most problems are nonlinear, such as the estimation of concentration in solution from an emitted fluorescence spectrum (EFS). Even if an estimation method gives desirable results, at some point it will be used under field conditions, where poor signal quality and less control over environmental effects are expected, leading to poor performance. In this study, we overcome these problems by implementing deep neural network (DNN) models and transfer learning technique for EFS analysis. The proposed models, R (Regression module) and ER (Encoder-Regression), outperformed linear methods and a naive DNN approach for high-quality laboratory-sampled data with a maximum mean relative error of ∼11%, vs. a minimum mean relative error of 184% for the linear methods. In the case of low-quality data, which were simulated based on a real-use case, the lowest error of the linear methods climbed to 263%, whereas the proposed ER model error remained at 9%. At low concentrations, ER gave the best results for all datasets: ∼3.46 ppb in the high-quality datasets, and 2.4 ppb in the low-quality datasets.
AB - Linear regression is a dominant estimation technique in chemometrics, where there is a need for inexpensive and reliable sensors for water monitoring. However, most problems are nonlinear, such as the estimation of concentration in solution from an emitted fluorescence spectrum (EFS). Even if an estimation method gives desirable results, at some point it will be used under field conditions, where poor signal quality and less control over environmental effects are expected, leading to poor performance. In this study, we overcome these problems by implementing deep neural network (DNN) models and transfer learning technique for EFS analysis. The proposed models, R (Regression module) and ER (Encoder-Regression), outperformed linear methods and a naive DNN approach for high-quality laboratory-sampled data with a maximum mean relative error of ∼11%, vs. a minimum mean relative error of 184% for the linear methods. In the case of low-quality data, which were simulated based on a real-use case, the lowest error of the linear methods climbed to 263%, whereas the proposed ER model error remained at 9%. At low concentrations, ER gave the best results for all datasets: ∼3.46 ppb in the high-quality datasets, and 2.4 ppb in the low-quality datasets.
KW - Chemometrics
KW - Deep learning
KW - Organic contamination
KW - Regression
KW - Transfer learning
KW - Water quality
UR - http://www.scopus.com/inward/record.url?scp=85149637646&partnerID=8YFLogxK
U2 - 10.1016/j.chemolab.2023.104785
DO - 10.1016/j.chemolab.2023.104785
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
AN - SCOPUS:85149637646
SN - 0169-7439
VL - 236
JO - Chemometrics and Intelligent Laboratory Systems
JF - Chemometrics and Intelligent Laboratory Systems
M1 - 104785
ER -