Differential Privacy for Protecting Multi-dimensional Contingency Table Data: Extensions and Applications

Abstract

The methodology of differential privacy has provided a strong definition of privacy which in some settings, using a mechanism of doubly-exponential noise addition, also allows for extraction of informative statistics from databases. In a recent paper, Barak et al.[1] extend this approach to the release of a specified set of margins from a multi-way contingency table. Privacy protection in such settings implicitly focuses on small cell counts that might allow for the identification of units that are unique in the database. We explore how well the mechanism works in the context of a series of examples, and the extent to which the proposed differential-privacy mechanism allows for sensible inferences from the released data. We conclude that the methodology, as it is currently formulated, is problematic in the context of the types of large sparse contingency tables encountered in statistical practice.

Document Details

Document Type
Pub Defense Publication
Publication Date
Jul 20, 2012
Source ID
10.29012/jpc.v4i1.613

Entities

People

  • Alessandro Rinaldo
  • Stephen E. Fienberg
  • Xiaolin Yang

Organizations

  • National Science Foundation
  • United States Army

Tags

Fields of Study

  • Mathematics

Readers

  • Neural Network Machine Learning.
  • Organizational Psychology.

Technology Areas

  • AI & ML
  • AI & ML - Bayesian Inference
  • AI & ML - Machine Learning Algorithms