TY - JOUR
T1 - Combining statistical and physical considerations in deriving targeted QSPRs using very large molecular descriptor databases
AU - Paster, Inga
AU - Tovarovski, Greta
AU - Shacham, Mordechai
AU - Brauner, Neima
PY - 2010
Y1 - 2010
N2 - The use of 3-D molecular descriptors in Quantitative Structure Property Relationships (QSPR) is considered. Such descriptors are known to have high level of uncertainty, as different minimization algorithms tend to yield different 3-D structures, which in turn yield different descriptor values. An algorithm has been developed in which the uncertainty in a 3-D descriptor is determined by the difference between the values obtained when using minimized structures coming from different sources for the same compound. This uncertainty can be used as an estimate in the descriptor "noise"level for determining its "signal to noise"ratio. A descriptor of a low signal-to-noise ratio should not be included in QSPRs even if it is highly correlated with the property values of the training set.
AB - The use of 3-D molecular descriptors in Quantitative Structure Property Relationships (QSPR) is considered. Such descriptors are known to have high level of uncertainty, as different minimization algorithms tend to yield different 3-D structures, which in turn yield different descriptor values. An algorithm has been developed in which the uncertainty in a 3-D descriptor is determined by the difference between the values obtained when using minimized structures coming from different sources for the same compound. This uncertainty can be used as an estimate in the descriptor "noise"level for determining its "signal to noise"ratio. A descriptor of a low signal-to-noise ratio should not be included in QSPRs even if it is highly correlated with the property values of the training set.
KW - 3-D structure
KW - Molecular descriptor
KW - Property prediction
KW - QSPR
UR - http://www.scopus.com/inward/record.url?scp=77955193272&partnerID=8YFLogxK
U2 - 10.1016/S1570-7946(10)28011-2
DO - 10.1016/S1570-7946(10)28011-2
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
AN - SCOPUS:77955193272
SN - 1570-7946
VL - 28
SP - 61
EP - 66
JO - Computer Aided Chemical Engineering
JF - Computer Aided Chemical Engineering
IS - C
ER -