Sample-Based Distance-Approximation for Subsequence-Freeness

Omer Cohen-Sidon*, Dana Ron*

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In this work, we study the problem of approximating the distance to subsequence-freeness in the sample-based distribution-free model. For a given subsequence (word) w = w1 . . . wk, a sequence (text) T = t1 . . . tn is said to contain w if there exist indices 1 ≤ i1 < · · · < ik ≤ n such that tij = wj for every 1 ≤ j ≤ k. Otherwise, T is w-free. Ron and Rosin (ACM TOCT 2022) showed that the number of samples both necessary and sufficient for one-sided error testing of subsequence-freeness in the sample-based distribution-free model is Θ(k/ϵ). Denoting by ∆(T, w, p) the distance of T to w-freeness under a distribution p : [n] → [0, 1], we are interested in obtaining an estimate ∆b , such that |∆b − ∆(T, w, p)| ≤ δ with probability at least 2/3, for a given distance parameter δ. Our main result is an algorithm whose sample complexity is Õ(k22). We first present an algorithm that works when the underlying distribution p is uniform, and then show how it can be modified to work for any (unknown) distribution p. We also show that a quadratic dependence on 1/δ is necessary.

Original languageEnglish
Title of host publication50th International Colloquium on Automata, Languages, and Programming, ICALP 2023
EditorsKousha Etessami, Uriel Feige, Gabriele Puppis
PublisherSchloss Dagstuhl- Leibniz-Zentrum fur Informatik GmbH, Dagstuhl Publishing
ISBN (Electronic)9783959772785
DOIs
StatePublished - Jul 2023
Event50th International Colloquium on Automata, Languages, and Programming, ICALP 2023 - Paderborn, Germany
Duration: 10 Jul 202314 Jul 2023

Publication series

NameLeibniz International Proceedings in Informatics, LIPIcs
Volume261
ISSN (Print)1868-8969

Conference

Conference50th International Colloquium on Automata, Languages, and Programming, ICALP 2023
Country/TerritoryGermany
CityPaderborn
Period10/07/2314/07/23

Funding

FundersFunder number
Israel Science Foundation1146/18

    Keywords

    • Distance Approximation
    • Property Testing

    Fingerprint

    Dive into the research topics of 'Sample-Based Distance-Approximation for Subsequence-Freeness'. Together they form a unique fingerprint.

    Cite this