TY - JOUR
T1 - From modeling dose-response relationships to improved performance of decision-tree classifiers for predictive toxicology of nanomaterials
AU - Romano, Roni
AU - Barbul, Alexander
AU - Korenstein, Rafi
N1 - Publisher Copyright:
© 2023 Elsevier B.V.
PY - 2023/8
Y1 - 2023/8
N2 - The development and application of predictive models towards toxicity of engineered nanomaterials is still far from being satisfactory. One promising contribution to confront this challenge is to effectively augment the performance of machine learning classifiers by progressing the approach towards balancing experimental toxicity data. We propose an improved balancing methodology by fitting the in-vitro toxicological dose-response datasets of engineered nanomaterials to three, four, and five, free parameter dose-response models. The four-free parameter model displays the best fit (in terms of adjusted R2) for most of the examined data. The fitted curve yields, in each case, a continuous sequence of data points, which extends the restricted experimental data and generates additional fitted data points for the minority class, leading to the formation of balanced data for predicting the nanoparticle's toxicology by decision tree classifiers. The ability to best predict the experimental toxicity data, by applying the decision tree model, was tested by forming three versions of the same experimental data: the imbalanced raw experimental data, the balanced data by applying the common Synthetic Minority Oversampling Technique, and by using the approach of Balanced Fitted Dose-Response method, introduced in the present study. We demonstrate that our approach provides improved performance of decision trees in predicting nanoparticles’ toxicity, a method that pertains also to chemical toxicity, central in health and environmental research.
AB - The development and application of predictive models towards toxicity of engineered nanomaterials is still far from being satisfactory. One promising contribution to confront this challenge is to effectively augment the performance of machine learning classifiers by progressing the approach towards balancing experimental toxicity data. We propose an improved balancing methodology by fitting the in-vitro toxicological dose-response datasets of engineered nanomaterials to three, four, and five, free parameter dose-response models. The four-free parameter model displays the best fit (in terms of adjusted R2) for most of the examined data. The fitted curve yields, in each case, a continuous sequence of data points, which extends the restricted experimental data and generates additional fitted data points for the minority class, leading to the formation of balanced data for predicting the nanoparticle's toxicology by decision tree classifiers. The ability to best predict the experimental toxicity data, by applying the decision tree model, was tested by forming three versions of the same experimental data: the imbalanced raw experimental data, the balanced data by applying the common Synthetic Minority Oversampling Technique, and by using the approach of Balanced Fitted Dose-Response method, introduced in the present study. We demonstrate that our approach provides improved performance of decision trees in predicting nanoparticles’ toxicity, a method that pertains also to chemical toxicity, central in health and environmental research.
KW - Cell viability
KW - Decision tree performance
KW - Dose-response curve fitting
KW - Imbalanced data
KW - Machine learning
KW - Nanoparticle cytotoxicity
UR - http://www.scopus.com/inward/record.url?scp=85162001929&partnerID=8YFLogxK
U2 - 10.1016/j.comtox.2023.100277
DO - 10.1016/j.comtox.2023.100277
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
AN - SCOPUS:85162001929
SN - 2468-1113
VL - 27
JO - Computational Toxicology
JF - Computational Toxicology
M1 - 100277
ER -