TY - GEN
T1 - Testing shape restrictions of discrete distributions
AU - Canonne, Clément L.
AU - Diakonikolas, Ilias
AU - Gouleakis, Themis
AU - Rubinfeld, Ronitt
N1 - Publisher Copyright:
© Clément L. Canonne, Ilias Diakonikolas, Themis Gouleakis, and Ronitt Rubinfeld; licensed under Creative Commons License CC-BY.
PY - 2016/2/1
Y1 - 2016/2/1
N2 - We study the question of testing structured properties (classes) of discrete distributions. Specifically, given sample access to an arbitrary distribution D over [n] and a property P, the goal is to distinguish between D ∈ P and ℓ1(D, P) > ε. We develop a general algorithm for this question, which applies to a large range of "shape-constrained" properties, including monotone, log-concave, t-modal, piecewise-polynomial, and Poisson Binomial distributions. Moreover, for all cases considered, our algorithm has near-optimal sample complexity with regard to the domain size and is computationally efficient. For most of these classes, we provide the first non-trivial tester in the literature. In addition, we also describe a generic method to prove lower bounds for this problem, and use it to show our upper bounds are nearly tight. Finally, we extend some of our techniques to tolerant testing, deriving nearly-tight upper and lower bounds for the corresponding questions.
AB - We study the question of testing structured properties (classes) of discrete distributions. Specifically, given sample access to an arbitrary distribution D over [n] and a property P, the goal is to distinguish between D ∈ P and ℓ1(D, P) > ε. We develop a general algorithm for this question, which applies to a large range of "shape-constrained" properties, including monotone, log-concave, t-modal, piecewise-polynomial, and Poisson Binomial distributions. Moreover, for all cases considered, our algorithm has near-optimal sample complexity with regard to the domain size and is computationally efficient. For most of these classes, we provide the first non-trivial tester in the literature. In addition, we also describe a generic method to prove lower bounds for this problem, and use it to show our upper bounds are nearly tight. Finally, we extend some of our techniques to tolerant testing, deriving nearly-tight upper and lower bounds for the corresponding questions.
KW - Lower bounds
KW - Probability distributions
KW - Property testing
KW - Statistics
UR - http://www.scopus.com/inward/record.url?scp=84961564543&partnerID=8YFLogxK
U2 - 10.4230/LIPIcs.STACS.2016.25
DO - 10.4230/LIPIcs.STACS.2016.25
M3 - ???researchoutput.researchoutputtypes.contributiontobookanthology.conference???
AN - SCOPUS:84961564543
T3 - Leibniz International Proceedings in Informatics, LIPIcs
BT - 33rd Symposium on Theoretical Aspects of Computer Science, STACS 2016
A2 - Vollmer, Heribert
A2 - Ollinger, Nicolas
PB - Schloss Dagstuhl- Leibniz-Zentrum fur Informatik GmbH, Dagstuhl Publishing
T2 - 33rd Symposium on Theoretical Aspects of Computer Science, STACS 2016
Y2 - 17 February 2016 through 20 February 2016
ER -