Finding and Archiving the Internet Footprint

Abstract

With the move to "cloud" computing, archivists face the increasingly difficult task of finding and preserving the works of an originator so that they may be readily used by future historians. This paper explores the range of information that an originator may have left on computers out there on the Internet, including works that are publicly identified with the originator; information that may have been stored using a pseudonym; anonymous blog postings and private information stored on web-based services like Yahoo Calendar and Google Docs. Approaches are given for finding the content, including interviews forensic analysis of the originator's computer equipment and social network analysis. We conclude with a brief discussion of legal and ethical issues.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Feb 10, 2009
Accession Number
ADA549352

Entities

People

  • David Cox
  • Simson Garfinkel

Organizations

  • Naval Postgraduate School

Tags

Communities of Interest

  • Autonomy
  • Energy and Power Technologies
  • Human Systems

DTIC Thesaurus Topics

  • Cloud Computing
  • Commerce
  • Computer Programs
  • Computers
  • Deep Web
  • Electronic Mail
  • Forensic Analysis
  • Information Systems
  • Internet
  • Networks
  • Online Communications
  • Social Media
  • Social Networking Services
  • Social Networks
  • Web Browsers
  • Websites
  • Word Processors

Readers

  • Agent-Based Social Robotics and Mobile-Assisted Learning in Virtual Environments.
  • Computer Science.
  • Geospatial Intelligence and Artificial Intelligence Analytics