TY - JOUR
T1 - Matching with don't-cares and a small number of mismatches
AU - Linhart, Chaim
AU - Shamir, Ron
N1 - Funding Information:
This study was supported in part by the Israel Science Foundation (grant 802/08). We thank the anonymous referees for their constructive comments.
PY - 2009/2/15
Y1 - 2009/2/15
N2 - In matching with don't-cares and k mismatches we are given a pattern of length m and a text of length n, both of which may contain don't-cares (a symbol that matches all symbols), and the goal is to find all locations in the text that match the pattern with at most k mismatches, where k is a parameter. We present new algorithms that solve this problem using a combination of convolutions and a dynamic programming procedure. We give randomized and deterministic solutions that run in time O (n k2 log m) and O (n k3 log m), respectively, and are faster than the most efficient extant methods for small values of k. Our deterministic algorithm is the first to obtain an O (polylog (k) ṡ n log m) running time.
AB - In matching with don't-cares and k mismatches we are given a pattern of length m and a text of length n, both of which may contain don't-cares (a symbol that matches all symbols), and the goal is to find all locations in the text that match the pattern with at most k mismatches, where k is a parameter. We present new algorithms that solve this problem using a combination of convolutions and a dynamic programming procedure. We give randomized and deterministic solutions that run in time O (n k2 log m) and O (n k3 log m), respectively, and are faster than the most efficient extant methods for small values of k. Our deterministic algorithm is the first to obtain an O (polylog (k) ṡ n log m) running time.
KW - Analysis of algorithms
KW - Approximate wildcard matching
KW - Pattern matching
UR - http://www.scopus.com/inward/record.url?scp=58149308448&partnerID=8YFLogxK
U2 - 10.1016/j.ipl.2008.10.016
DO - 10.1016/j.ipl.2008.10.016
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
AN - SCOPUS:58149308448
SN - 0020-0190
VL - 109
SP - 273
EP - 277
JO - Information Processing Letters
JF - Information Processing Letters
IS - 5
ER -