TY - JOUR
T1 - Ensemble modelling or selecting the best model
T2 - Many could be better than one
AU - Barai, S. V.
AU - Reich, Yoram
PY - 1999/11
Y1 - 1999/11
N2 - In the course of data modelling, many models could be created. Much work has been done on formulating guidelines for model selection. However, by and large, these guidelines are conservative or too specific. Instead of using general guidelines, models could be selected for a particular task based on statistical tests. When selecting one model, others are discarded. Instead of losing potential sources of information, models could be combined to yield better performance. We review the basics of model selection and combination and discuss their differences. Two examples of opportunistic and principled combinations are presented. The first demonstrates that mediocre quality models could be combined to yield significantly better performance. The latter is the main contribution of the paper; it describes and illustrates a novel heuristic approach called the SG(k-NN) ensemble for the generation of good-quality and diverse models that can even improve excellent quality models.
AB - In the course of data modelling, many models could be created. Much work has been done on formulating guidelines for model selection. However, by and large, these guidelines are conservative or too specific. Instead of using general guidelines, models could be selected for a particular task based on statistical tests. When selecting one model, others are discarded. Instead of losing potential sources of information, models could be combined to yield better performance. We review the basics of model selection and combination and discuss their differences. Two examples of opportunistic and principled combinations are presented. The first demonstrates that mediocre quality models could be combined to yield significantly better performance. The latter is the main contribution of the paper; it describes and illustrates a novel heuristic approach called the SG(k-NN) ensemble for the generation of good-quality and diverse models that can even improve excellent quality models.
KW - Data Modelling
KW - Ensemble
KW - Machine Learning
KW - Model Selection
KW - Neural Networks
KW - Stacked Generalization
UR - http://www.scopus.com/inward/record.url?scp=0033231649&partnerID=8YFLogxK
U2 - 10.1017/S0890060499135029
DO - 10.1017/S0890060499135029
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
AN - SCOPUS:0033231649
SN - 0890-0604
VL - 13
SP - 377
EP - 386
JO - Artificial Intelligence for Engineering Design, Analysis and Manufacturing: AIEDAM
JF - Artificial Intelligence for Engineering Design, Analysis and Manufacturing: AIEDAM
IS - 5
ER -