Set Representation and Set Intersection.

Abstract

This work discusses the representation and manipulation of sets based on two different concepts: tries, and hashing functions. The sets considered here are assumed to be static: once created, there will be no further insertions or deletions. For both trie- and hash-based strategies, a series of representations is introduced which together with the availability of preprocessing reduces the average sizes of the sets to nearly optimal values, yet retains the inherently good retrieval characteristics. The intersection procedure for trie-based representations is based on the traversal in parallel of the tries representing the sets to be intersected, and it behaves like a series of binary searches when the sets to be intersected are of very different sizes. Hashed intersection runs very fast. The average time is proportional to the size of the smallest set to be intersected and is independent of the number of sets (except for the intersection set itself which has to be checked for every set). (Author)

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Nov 01, 1978
Accession Number
ADA065283

Entities

People

  • Luis Trabb Pardo

Organizations

  • Stanford University

Tags

Communities of Interest

  • C4I
  • Counter IED

DTIC Thesaurus Topics

  • Algorithms
  • Asymptotic Series
  • Boundaries
  • Classification
  • Coding
  • Collisions
  • Computer Programming
  • Computer Science
  • Computers
  • Databases
  • Hash Tables
  • Notation
  • Probabilistic Models
  • Probability
  • Probability Distributions
  • Real Numbers
  • Sequences

Fields of Study

  • Computer science
  • Mathematics

Readers

  • Applied Combinatorial Optimization and Logic Circuit Design.
  • Graph Algorithms and Convex Optimization.