TY - GEN

T1 - The confidence in our k-Tails

AU - Cohen, Hila

AU - Maoz, Shahar

N1 - Publisher Copyright:
© 2014 ACM.

PY - 2014

Y1 - 2014

N2 - K-Tails is a popular algorithm for extracting a candidate behavioral model from a log of execution traces. The usefulness of k-Tails depends on the quality of its input log, which may include too few traces to build a representative model, or too many traces, whose analysis is a waste of resources. Given a set of traces, how can one be confident that it includes enough, but not too many, traces? While many have used the k-Tails algorithm, no previous work has yet investigated this question. In this paper we address this question by proposing a novel notion of log completeness. Roughly, a log of traces, extracted from a given system, is k-complete, iff adding any new trace to the log will not change the resulting model k-Tails would build for it. Since the system and its full set of traces is unknown, we cannot know whether a given log is k-complete. However, we can estimate its k-completeness. We call this estimation k-confidence. We formalize the notion of k-confidence and implement its computation. Preliminary experiments show that k-confidence can be efficiently computed and is a highly reliable estimator for k-completeness.

AB - K-Tails is a popular algorithm for extracting a candidate behavioral model from a log of execution traces. The usefulness of k-Tails depends on the quality of its input log, which may include too few traces to build a representative model, or too many traces, whose analysis is a waste of resources. Given a set of traces, how can one be confident that it includes enough, but not too many, traces? While many have used the k-Tails algorithm, no previous work has yet investigated this question. In this paper we address this question by proposing a novel notion of log completeness. Roughly, a log of traces, extracted from a given system, is k-complete, iff adding any new trace to the log will not change the resulting model k-Tails would build for it. Since the system and its full set of traces is unknown, we cannot know whether a given log is k-complete. However, we can estimate its k-completeness. We call this estimation k-confidence. We formalize the notion of k-confidence and implement its computation. Preliminary experiments show that k-confidence can be efficiently computed and is a highly reliable estimator for k-completeness.

KW - Dynamic specification mining

KW - Probabilistic approach

UR - http://www.scopus.com/inward/record.url?scp=84908627889&partnerID=8YFLogxK

U2 - 10.1145/2642937.2642944

DO - 10.1145/2642937.2642944

M3 - פרסום בספר כנס

AN - SCOPUS:84908627889

T3 - ASE 2014 - Proceedings of the 29th ACM/IEEE International Conference on Automated Software Engineering

SP - 605

EP - 610

BT - ASE 2014 - Proceedings of the 29th ACM/IEEE International Conference on Automated Software Engineering

PB - Association for Computing Machinery, Inc

Y2 - 15 September 2014 through 19 September 2014

ER -