Ensemble modelling or selecting the best model: Many could be better than one

S. V. Barai*, Yoram Reich

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

30 Scopus citations

Abstract

In the course of data modelling, many models could be created. Much work has been done on formulating guidelines for model selection. However, by and large, these guidelines are conservative or too specific. Instead of using general guidelines, models could be selected for a particular task based on statistical tests. When selecting one model, others are discarded. Instead of losing potential sources of information, models could be combined to yield better performance. We review the basics of model selection and combination and discuss their differences. Two examples of opportunistic and principled combinations are presented. The first demonstrates that mediocre quality models could be combined to yield significantly better performance. The latter is the main contribution of the paper; it describes and illustrates a novel heuristic approach called the SG(k-NN) ensemble for the generation of good-quality and diverse models that can even improve excellent quality models.

Original languageEnglish
Pages (from-to)377-386
Number of pages10
JournalArtificial Intelligence for Engineering Design, Analysis and Manufacturing: AIEDAM
Volume13
Issue number5
DOIs
StatePublished - Nov 1999

Keywords

  • Data Modelling
  • Ensemble
  • Machine Learning
  • Model Selection
  • Neural Networks
  • Stacked Generalization

Fingerprint

Dive into the research topics of 'Ensemble modelling or selecting the best model: Many could be better than one'. Together they form a unique fingerprint.

Cite this