TY - JOUR
T1 - Curious Feature Selection
AU - Moran, Michal
AU - Gordon, Goren
N1 - Publisher Copyright:
© 2019 Elsevier Inc.
PY - 2019/6
Y1 - 2019/6
N2 - In state-of-the-art big-data applications, the process of building machine learning models can be very challenging due to continuous changes in data structures and the need for human interaction to tune the variables and models over time. Hence, expedited learning in rapidly changing environments is required. In this work, we address this challenge by implementing concepts from the field of intrinsically motivated computational learning, also known as artificial curiosity (AC). In AC, an autonomous agent acts to optimize its learning about itself and its environment by receiving internal rewards based on prediction errors. We present a novel method of intrinsically motivated learning, based on the curiosity loop, to learn the data structures in large and varied datasets. An autonomous agent learns to select a subset of relevant features in the data, i.e., feature selection, to be used later for model construction. The agent optimizes its learning about the data structure over time without requiring external supervision. We show that our method, called the Curious Feature Selection (CFS) algorithm, positively impacts the accuracy of learning models on three public datasets.
AB - In state-of-the-art big-data applications, the process of building machine learning models can be very challenging due to continuous changes in data structures and the need for human interaction to tune the variables and models over time. Hence, expedited learning in rapidly changing environments is required. In this work, we address this challenge by implementing concepts from the field of intrinsically motivated computational learning, also known as artificial curiosity (AC). In AC, an autonomous agent acts to optimize its learning about itself and its environment by receiving internal rewards based on prediction errors. We present a novel method of intrinsically motivated learning, based on the curiosity loop, to learn the data structures in large and varied datasets. An autonomous agent learns to select a subset of relevant features in the data, i.e., feature selection, to be used later for model construction. The agent optimizes its learning about the data structure over time without requiring external supervision. We show that our method, called the Curious Feature Selection (CFS) algorithm, positively impacts the accuracy of learning models on three public datasets.
KW - Big data
KW - Curiosity loop
KW - Data science
KW - Feature selection
KW - Intrinsic motivation learning
KW - Reinforcement learning
UR - http://www.scopus.com/inward/record.url?scp=85061082374&partnerID=8YFLogxK
U2 - 10.1016/j.ins.2019.02.009
DO - 10.1016/j.ins.2019.02.009
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
AN - SCOPUS:85061082374
SN - 0020-0255
VL - 485
SP - 42
EP - 54
JO - Information Sciences
JF - Information Sciences
ER -