TY - JOUR
T1 - Analysis and refinement of the training set in predicting a variety of constant pure compound properties by the targeted QSPR method
AU - Shacham, Mordechai
AU - Brauner, Neima
PY - 2011/6/15
Y1 - 2011/6/15
N2 - The possibility of obtaining reliable predictions of a wide variety of constant properties is examined. To this aim, a modified version of the Targeted QSPR (Brauner et al., 2006) method is applied.In the present study new statistical indicators are introduced, which enable a reliable estimation of the prediction uncertainty for the (unknown) property of the target compound based on the training set data. It is shown that while increasing the number of descriptors in the QSPR enables better representation of the training set data, it may significantly deteriorate the prediction of the target compound property value. If necessary, improved prediction is achievable by using the statistical information to refine the training set, rather than by increasing the number of the descriptors used. It is demonstrated that by proper adjustment of the training set, the great majority of the constant properties can be predicted within the experimental error level.
AB - The possibility of obtaining reliable predictions of a wide variety of constant properties is examined. To this aim, a modified version of the Targeted QSPR (Brauner et al., 2006) method is applied.In the present study new statistical indicators are introduced, which enable a reliable estimation of the prediction uncertainty for the (unknown) property of the target compound based on the training set data. It is shown that while increasing the number of descriptors in the QSPR enables better representation of the training set data, it may significantly deteriorate the prediction of the target compound property value. If necessary, improved prediction is achievable by using the statistical information to refine the training set, rather than by increasing the number of the descriptors used. It is demonstrated that by proper adjustment of the training set, the great majority of the constant properties can be predicted within the experimental error level.
KW - Computation chemistry
KW - Molecular descriptor
KW - Parameter identification
KW - Property prediction
KW - QSPR
KW - Systems engineering
UR - http://www.scopus.com/inward/record.url?scp=79955015342&partnerID=8YFLogxK
U2 - 10.1016/j.ces.2011.03.019
DO - 10.1016/j.ces.2011.03.019
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
AN - SCOPUS:79955015342
SN - 0009-2509
VL - 66
SP - 2606
EP - 2615
JO - Chemical Engineering Science
JF - Chemical Engineering Science
IS - 12
ER -