TY - GEN
T1 - Sublinear time estimation of degree distribution moments
AU - Eden, Talya
AU - Ron, Dana
AU - Seshadhri, C.
N1 - Publisher Copyright:
© Talya Eden, Dana Ron, and C. Seshadhri.
PY - 2017/7/1
Y1 - 2017/7/1
N2 - We revisit the classic problem of estimating the degree distribution moments of an undirected graph. Consider an undirected graph G = (V, E) with n (non-isolated) vertices, and define (for s > 0) μs = 1/n ∑ν∈2V dνs. Our aim is to estimate μs within a multiplicative error of (1+ϵ) (for a given approximation parameter ϵ > 0) in sublinear time. We consider the sparse graph model that allows access to: uniform random vertices, queries for the degree of any vertex, and queries for a neighbor of any vertex. For the case of s = 1 (the average degree), Õ(√n) queries suffice for any constant ϵ (Feige, SICOMP 06 and Goldreich-Ron, RSA 08). Gonen-Ron-Shavitt (SIDMA 11) extended this result to all integral s > 0, by designing an algorithms that performs Õ(n1-1/(s+1)) queries. (Strictly speaking, their algorithm approximates the number of star-subgraphs of a given size, but a slight modification gives an algorithm for moments.) We design a new, significantly simpler algorithm for this problem. In the worst-case, it exactly matches the bounds of Gonen-Ron-Shavitt, and has a much simpler proof. More importantly, the running time of this algorithm is connected to the degeneracy of G. This is (essentially) the maximum density of an induced subgraph. For the family of graphs with degeneracy at most α, it has a query complexity of Õ (n1-1/s/μ s1/s (α1/s + min{α, μ s1/s })) = Õ(n1-1/sα/μ s1/s ). Thus, for the class of bounded degeneracy graphs (which includes all minor closed families and preferential attachment graphs), we can estimate the average degree in Õ(1) queries, and can estimate the variance of the degree distribution in Õ(√n) queries. This is a major improvement over the previous worst-case bounds. Our key insight is in designing an estimator for μs that has low variance when G does not have large dense subgraphs.
AB - We revisit the classic problem of estimating the degree distribution moments of an undirected graph. Consider an undirected graph G = (V, E) with n (non-isolated) vertices, and define (for s > 0) μs = 1/n ∑ν∈2V dνs. Our aim is to estimate μs within a multiplicative error of (1+ϵ) (for a given approximation parameter ϵ > 0) in sublinear time. We consider the sparse graph model that allows access to: uniform random vertices, queries for the degree of any vertex, and queries for a neighbor of any vertex. For the case of s = 1 (the average degree), Õ(√n) queries suffice for any constant ϵ (Feige, SICOMP 06 and Goldreich-Ron, RSA 08). Gonen-Ron-Shavitt (SIDMA 11) extended this result to all integral s > 0, by designing an algorithms that performs Õ(n1-1/(s+1)) queries. (Strictly speaking, their algorithm approximates the number of star-subgraphs of a given size, but a slight modification gives an algorithm for moments.) We design a new, significantly simpler algorithm for this problem. In the worst-case, it exactly matches the bounds of Gonen-Ron-Shavitt, and has a much simpler proof. More importantly, the running time of this algorithm is connected to the degeneracy of G. This is (essentially) the maximum density of an induced subgraph. For the family of graphs with degeneracy at most α, it has a query complexity of Õ (n1-1/s/μ s1/s (α1/s + min{α, μ s1/s })) = Õ(n1-1/sα/μ s1/s ). Thus, for the class of bounded degeneracy graphs (which includes all minor closed families and preferential attachment graphs), we can estimate the average degree in Õ(1) queries, and can estimate the variance of the degree distribution in Õ(√n) queries. This is a major improvement over the previous worst-case bounds. Our key insight is in designing an estimator for μs that has low variance when G does not have large dense subgraphs.
KW - Degree distribution
KW - Graph moments
KW - Sublinear algorithms
UR - http://www.scopus.com/inward/record.url?scp=85027257195&partnerID=8YFLogxK
U2 - 10.4230/LIPIcs.ICALP.2017.7
DO - 10.4230/LIPIcs.ICALP.2017.7
M3 - ???researchoutput.researchoutputtypes.contributiontobookanthology.conference???
AN - SCOPUS:85027257195
T3 - Leibniz International Proceedings in Informatics, LIPIcs
BT - 44th International Colloquium on Automata, Languages, and Programming, ICALP 2017
A2 - Muscholl, Anca
A2 - Indyk, Piotr
A2 - Kuhn, Fabian
A2 - Chatzigiannakis, Ioannis
PB - Schloss Dagstuhl- Leibniz-Zentrum fur Informatik GmbH, Dagstuhl Publishing
Y2 - 10 July 2017 through 14 July 2017
ER -