What should be taught in an academic program of data sciences?

Research output: Contribution to journalConference articlepeer-review

Abstract

The new academic discipline of Data Sciences (DS) has been developed in recent years mainly because of the need to make decisions based on huge amounts of data-Big Data. In parallel, there has been a huge progress in the development of technologies that enable to identify patterns, to filter big data, and to provide relevant meanings to information, due to machine learning and sophisticated inference techniques. The profession of Data Scientist (or Data Analyst) has become highly demanded in recent years. It is required in the business sector where data is the "oxygen" for business survival; it is needed in the governmental sector in order to improve its services to the citizens; and it is very imperative in the scientific world, where large data depositories collected in varied disciplines have to be integrated, mined and analyzed, in order to enable interdisciplinary research. The purpose of this paper is to demonstrate how the scientific discipline of Data Sciences fits into academic programs intended to prepare data analysts for the business, public, government, and academic sectors. The article first delineates the Data Cycle, which portrays the transformation of data and their derivatives along the route from generation to decision making. The cycle includes the following stages: Problem definition ? identifying pertinent data sources ? data collection, and storing (including cleansing and backup) ? data integration ? data mining ? processing and analysis ? visualization ? learning and decision-making ? feedback for future cycles. Within this cycle, there might be sub cycles, where a number of stages are repeated and reiterated. It should be noted that the data cycle is generic. It might have slight variations under various circumstances, however, there is not much difference between the scientific cycle and all the other cycles. Each stage within the cycle requires different tools, namely hardware and software technologies that support the stage. This article classifies these tools. The final part of the article suggests a typology for academic DS programs. It outlines an academic program that will be offered to those wishing to practice the Data Analyst profession. An introductory course that should be mandatory to all students campus-wide is sketched.

Original languageEnglish
Pages (from-to)55-64
Number of pages10
JournalDigital Presentation and Preservation of Cultural and Scientific Heritage
Volume9
StatePublished - 26 Sep 2019
Event9th International Conference on Digital Presentation and Preservation of Cultural and Scientific Heritage, DiPP 2019 - Sofia, Bulgaria
Duration: 26 Sep 201928 Sep 2019

Keywords

  • Academic program in data sciences
  • Big data
  • Data analyst
  • Data mining
  • Data sciences

Fingerprint

Dive into the research topics of 'What should be taught in an academic program of data sciences?'. Together they form a unique fingerprint.

Cite this