Estimating aggregates over multiple sets

Edith Cohen, Haim Kaplan

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Many datasets, including market basket data, text or hypertext documents, and measurement data collected in different nodes or time periods, are modeled as a collection of sets over a ground set of (weighted) items. We consider the problem of estimating basic aggregates such as the weight or selectivity of a subpopulation of the items. We extend classic summarization techniques based on sampling to this scenario when we have multiple sets and selection predicates based on membership in particular sets.

Original languageEnglish
Title of host publicationProceedings - 8th IEEE International Conference on Data Mining, ICDM 2008
Pages761-766
Number of pages6
DOIs
StatePublished - 2008
Event8th IEEE International Conference on Data Mining, ICDM 2008 - Pisa, Italy
Duration: 15 Dec 200819 Dec 2008

Publication series

NameProceedings - IEEE International Conference on Data Mining, ICDM
ISSN (Print)1550-4786

Conference

Conference8th IEEE International Conference on Data Mining, ICDM 2008
Country/TerritoryItaly
CityPisa
Period15/12/0819/12/08

Fingerprint

Dive into the research topics of 'Estimating aggregates over multiple sets'. Together they form a unique fingerprint.

Cite this