TY - GEN
T1 - Optimal workload-based weighted wavelet synopses
AU - Matias, Yossi
AU - Urieli, Daniel
N1 - Funding Information:
IA preliminary version of this paper appeared in Proc. of ICDT’05. Research partly supported by a grant from the Israel Science Foundation. ∗Corresponding author. E-mail addresses: [email protected] (Y. Matias), [email protected] (D. Urieli).
PY - 2005
Y1 - 2005
N2 - In recent years wavelets were shown to be effective data synopses. We are concerned with the problem of finding efficiently wavelet synopses for massive data sets, in situations where information about query workload is available. We present linear time, I/O optimal algorithms for building optimal workload-based wavelet synopses for point queries. The synopses are based on a novel construction of weighted inner-products and use weighted wavelets that are adapted to those products. The synopses are optimal in the sense that the subset of retained coefficients is the best possible for the bases in use with respect to either the mean-squared absolute or relative errors. For the latter, this is the first optimal wavelet synopsis even for the regular, non-workload-based case. Experimental results demonstrate the advantage obtained by the new optimal wavelet synopses, as well as the robustness of the synopses to deviations in the actual query workload.
AB - In recent years wavelets were shown to be effective data synopses. We are concerned with the problem of finding efficiently wavelet synopses for massive data sets, in situations where information about query workload is available. We present linear time, I/O optimal algorithms for building optimal workload-based wavelet synopses for point queries. The synopses are based on a novel construction of weighted inner-products and use weighted wavelets that are adapted to those products. The synopses are optimal in the sense that the subset of retained coefficients is the best possible for the bases in use with respect to either the mean-squared absolute or relative errors. For the latter, this is the first optimal wavelet synopsis even for the regular, non-workload-based case. Experimental results demonstrate the advantage obtained by the new optimal wavelet synopses, as well as the robustness of the synopses to deviations in the actual query workload.
UR - http://www.scopus.com/inward/record.url?scp=77049119254&partnerID=8YFLogxK
U2 - 10.1007/978-3-540-30570-5_25
DO - 10.1007/978-3-540-30570-5_25
M3 - ???researchoutput.researchoutputtypes.contributiontobookanthology.conference???
AN - SCOPUS:77049119254
SN - 3540242880
SN - 9783540242888
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 368
EP - 382
BT - Database Theory - ICDT 2005 - 10th International Conference, Proceedings
T2 - 10th International Conference on Database Theory, ICDT 2005
Y2 - 5 January 2005 through 7 January 2005
ER -