Provenance Capture Disparities Highlighted through Datasets

Abstract

Provenance information is inherently affected by the method of its capture. Different capture mechanisms create very different provenance graphs. In this work, we describe an academic use case that has corollaries in offices everywhere. We also describe two distinct possibilities for provenance capture methods within this domain. We generate three datasets using these two capture methods: the capture methods run individually and a trace of what an omniscient capture agent would see. We describe how the different capture methods lead to such very different graphs and release the graphs for others to use via the ProvBench effort.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jan 01, 2014
Accession Number
AD1107703

Entities

People

  • Adriane P. Chapman
  • G. B. Coe
  • M. D. Allen
  • R. C. Doty

Organizations

  • Georgia Tech
  • MITRE Corporation

Tags

Communities of Interest

  • Materials and Manufacturing Processes

DTIC Thesaurus Topics

  • Abstracts
  • Acids
  • Automatic
  • Bibliographies
  • Communities
  • Computations
  • Corporations
  • Databases
  • Digital Information
  • Disparities
  • Education
  • Engineering
  • Governments
  • Interoperability
  • Lightweight
  • Multithreading
  • Nucleic Acids
  • Relational Databases
  • Scalability
  • Schools
  • Standards
  • Web Service
  • Workload

Fields of Study

  • Computer science

Readers

  • Computational Linguistics
  • Distributed Systems and Data Platform Development
  • Graph Algorithms and Convex Optimization.