Provenance-based analysis of data-centric processes

Daniel Deutch*, Yuval Moskovitch, Val Tannen

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

12 Scopus citations

Abstract

We consider in this paper static analysis of the possible executions of data-dependent applications, namely applications whose control flow is guided by a finite-state machine, as well as by the state of an underlying database. We note that previous work in this context has not addressed two important features of such analysis, namely analysis under hypothetical scenarios, such as changes to the application’s state machine and/or to the underlying database, and the consideration of meta-data, such as cost or access privileges. Observing that semiring-based provenance has been proven highly effective in supporting these two features for database queries, we develop in this paper a semiring-based provenance framework for the analysis of data-dependent processes, accounting for hypothetical reasoning and meta-data. The development addresses two interacting new challenges: (1) combining provenance annotations for both information that resides in the database and information about external inputs (e.g., user choices) and (2) finitely capturing infinitely many process executions. We have implemented our framework as part of the PROPOLIS system.

Original languageEnglish
Pages (from-to)583-607
Number of pages25
JournalVLDB Journal
Volume24
Issue number4
DOIs
StatePublished - 24 Aug 2015

Funding

FundersFunder number
Israeli Ministry of Science
National Science FoundationIIS 1217798
Broadcom Foundation
United States-Israel Binational Science Foundation
Israel Science Foundation
Tel Aviv University

    Keywords

    • Process analysis
    • Provenance semirings

    Fingerprint

    Dive into the research topics of 'Provenance-based analysis of data-centric processes'. Together they form a unique fingerprint.

    Cite this