TY - GEN
T1 - Automating Literature Reviews
T2 - 5th IEEE International Conference on Big Data Intelligence and Computing, DataCom 2019
AU - Lee, Hsiao Hui
AU - Mach, Patrizia
AU - Shmueli, Galit
AU - Yahav, Inbal
N1 - Publisher Copyright:
© 2019 IEEE.
PY - 2019
Y1 - 2019
N2 - From patient waiting time to consumer shopping, firms collect more and more human behavior data to assist their decision making process. This trend in business also affects academic research, especially in operations management (OM), a research area that often relies on mathematical modeling to guide business decisions. However, it is both time and labor intensive to identify applications and opportunities that use behavioral big data (BBD) in the large and growing published literature. In this paper, we introduce a procedure that applies various data mining approaches to survey a vast number of research articles across three different OM journals, and identify articles that use BBD. The goal is to reduce the number of articles that must be read manually and yet reduce the false negatives (missed BBD papers); in other words, in this classification task we emphasize the importance of sensitivity over specificity with respect to detecting BBD papers. Testing different feature engineering and classification approaches, we find that the highest sensitivity and specificity are provided by a Random Forest classifier, applied to a bag-of-words set of features.
AB - From patient waiting time to consumer shopping, firms collect more and more human behavior data to assist their decision making process. This trend in business also affects academic research, especially in operations management (OM), a research area that often relies on mathematical modeling to guide business decisions. However, it is both time and labor intensive to identify applications and opportunities that use behavioral big data (BBD) in the large and growing published literature. In this paper, we introduce a procedure that applies various data mining approaches to survey a vast number of research articles across three different OM journals, and identify articles that use BBD. The goal is to reduce the number of articles that must be read manually and yet reduce the false negatives (missed BBD papers); in other words, in this classification task we emphasize the importance of sensitivity over specificity with respect to detecting BBD papers. Testing different feature engineering and classification approaches, we find that the highest sensitivity and specificity are provided by a Random Forest classifier, applied to a bag-of-words set of features.
KW - behavioral big data
KW - data mining
KW - literature review
KW - operations management
UR - http://www.scopus.com/inward/record.url?scp=85205315225&partnerID=8YFLogxK
U2 - 10.1109/DataCom.2019.00026
DO - 10.1109/DataCom.2019.00026
M3 - ???researchoutput.researchoutputtypes.contributiontobookanthology.conference???
AN - SCOPUS:85205315225
T3 - Proceedings - 2019 IEEE 5th International Conference on Big Data Intelligence and Computing, DataCom 2019
SP - 119
EP - 122
BT - Proceedings - 2019 IEEE 5th International Conference on Big Data Intelligence and Computing, DataCom 2019
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 18 November 2019 through 21 November 2019
ER -