Unique in the shopping mall: On the reidentifiability of credit card metadata

Abstract

Large-scale data sets of human behavior have the potential to fundamentally transform the way we fight diseases, design cities, or perform research. Metadata, however, contain sensitive information. Understanding the privacy of these data sets is key to their broad use and, ultimately, their impact. We study 3 months of credit card records for 1.1 million people and show that four spatiotemporal points are enough to uniquely reidentify 90% of individuals. We show that knowing the price of a transaction increases the risk of reidentification by 22%, on average. Finally, we show that even data sets that provide coarse information at any or all of the dimensions provide little anonymity and that women are more reidentifiable than men in credit card metadata.

Document Details

Document Type
Pub Defense Publication
Publication Date
Jan 30, 2015
Source ID
10.1126/science.1256297

Entities

People

  • Alex “sandy” Pentland
  • Laura Radaelli
  • Vivek Kumar Singh
  • Yves-alexandre De Montjoye

Organizations

  • Aarhus University
  • Belgian American Educational Foundation
  • European Commission
  • Massachusetts Institute of Technology
  • Rutgers University
  • United States Army Research Laboratory
  • Wallonie-Bruxelles International

Tags

Fields of Study

  • Computer science

Readers

  • Computer Vision.
  • Government Contracting/Procurement.
  • Psychological Intervention/Treatment for Stress, Anxiety, PTSD, and Related Emotional and Cognitive Health Symptoms.