AUTOMATIC SECURITY CLASSIFICATION STUDY

Abstract

An investigation was made of the feasibility of using computers to assign the proper security classification (unclassified, confidential, secret) to textual material. The words in 998 paragraphs were transformed to computer-usable form. A set of 66 variables was computed for each paragraph by a two-stage process of attaching three scores to a word and then combining the scores in various ways over the words of a paragraph. Several experiments were conducted to validate assumptions involved in the method of scoring the words and the methods for combining the scores. The 66 variables were presented to a statistical technique which made a preferential selection of a small set of effective variables from the large set of 66 variables. The redundant or non-controlling variables were eliminated from subsequent analysis, and an objective system was developed for assigning security classifications using only the selected variables. The system was applied to an independent sample of paragraphs and 53.9 percent were correctly classified. It was concluded that the system does exhibit skill. However, the skill is probably too low to consider replacing the present system. Finally, it is concluded that the method for forming variables and the statistical technique, both apparently new to this field, show sufficient promise to merit application to other automatic indexing problems.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Oct 01, 1967
Accession Number
AD0660120

Entities

People

  • Ann L. Bussemey
  • Guy T. Merriman
  • Isadore Enger

Tags

Communities of Interest

  • Air Platforms
  • Human Systems

DTIC Thesaurus Topics

  • Accuracy
  • Air Force
  • Algorithms
  • Automatic
  • Biological Warfare
  • Computer Programs
  • Computers
  • Equations
  • Government Procurement
  • Governments
  • Information Science
  • Magnetic Tape
  • Materials
  • Numbers
  • Probability
  • Statistical Analysis
  • Statistical Decision Theory

Readers

  • Computational Linguistics
  • Psychometric Testing or Psychological Assessment.
  • Systems Analysis and Design