TY - GEN
T1 - Space decomposition in data mining
T2 - 14th International Symposium on Methodologies for Intelligent Systems, ISMIS 2003
AU - Rokach, Lior
AU - Maimon, Oded
AU - Lavi, Inbal
N1 - Publisher Copyright:
© Springer-Verlag Berlin Heidelberg 2003.
PY - 2003
Y1 - 2003
N2 - Data mining algorithms aim at searching interesting patterns in large amount of data in manageable complexity and good accuracy. Decomposition methods are used to improve both criteria. As opposed to most decomposition methods, that partition the dataset via sampling, this paper presents an accuracyoriented method that partitions the instance space into mutually exclusive subsets using K-means clustering algorithm. After employing the basic divide-andinduce method on several datasets with different classifiers, its error rate is compared to that of the basic learning algorithm. An analysis of the results shows that the proposed method is well suited for datasets of numeric input attributes and that its performance is influenced by the dataset size and its homogeneity. Finally, a homogeneity threshold is developed, that can be used for deciding whether to decompose the data set or not.
AB - Data mining algorithms aim at searching interesting patterns in large amount of data in manageable complexity and good accuracy. Decomposition methods are used to improve both criteria. As opposed to most decomposition methods, that partition the dataset via sampling, this paper presents an accuracyoriented method that partitions the instance space into mutually exclusive subsets using K-means clustering algorithm. After employing the basic divide-andinduce method on several datasets with different classifiers, its error rate is compared to that of the basic learning algorithm. An analysis of the results shows that the proposed method is well suited for datasets of numeric input attributes and that its performance is influenced by the dataset size and its homogeneity. Finally, a homogeneity threshold is developed, that can be used for deciding whether to decompose the data set or not.
UR - http://www.scopus.com/inward/record.url?scp=7444228672&partnerID=8YFLogxK
U2 - 10.1007/978-3-540-39592-8_5
DO - 10.1007/978-3-540-39592-8_5
M3 - ???researchoutput.researchoutputtypes.contributiontobookanthology.conference???
AN - SCOPUS:7444228672
SN - 3540202560
SN - 9783540202561
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 24
EP - 31
BT - Foundations of Intelligent Systems - 14th International Symposium, ISMIS 2003, Proceedings
A2 - Zhong, Ning
A2 - Ras, Zbigniew W.
A2 - Tsumoto, Shusaku
A2 - Suzuki, Einoshin
PB - Springer Verlag
Y2 - 28 October 2003 through 31 October 2003
ER -