Absolute bounds on set intersection and union sizes from distribution information
12 min read
Rate this book:
About This Book
Estimation of set intersection and union sizes is important for access method selection for a database. Absolute bounds on sizes are often much easier to compute than size estimates, requiring no distributional or independence assumptions, and can answer many of the same needs. We present a large compendium of quick closed-form bounds on set intersection and union sizes, each applying to a different situation; they can be expressed as rules, and managed by rule-based or 'knowledge-base' architecture. These methods use general-purpose statistics precomputed on the data, and exploit homomorphisms (onto mappings) of the data items onto distributions that can be more easily analyzed. Our methods can be used anytime, but tend to work best when there are strong or complex correlations in the data. This circumstance is poorly addressed by the standard methods of independence-assumption and distributional-assumption estimates, and hence our methods fill a need.
Buy This Book
As an Amazon Associate and Bookshop.org affiliate, BookOrb earns from qualifying purchases.
Write a Review
Sign in to write a review.
More by Neil C. Rowe
Absolute bounds on the mean and standard deviation of transformed data for constant-derivative transformations
Aiding teachers in constructing virtual-reality tutors
Antisampling for estimation
Artifical Intelligence Through
Artifical Intelligence Through Prolog
Artificial intelligence through Prolog
Diophantine inferences from statistical aggregates on few-valued attributes