TOWARD EXPLOITATION OF A FILE OF RUSSIAN TEXT WITH SYNTACTIC ANNOTATIONS,

Abstract

The final report of RAND's work in linguistics for Rome Air Development Center, 1965-66. The work included the compilation of a million-word File of Russian Text with Syntactic Annotations, stored on magnetic tape, which can be duplicated for qualified requesters; and the design of a computer program called COLLECT for retrieving data from the File. The annotations, which are based on dependency theory, include not only systematic connections between dependent and governing words, but grammatical functions, indications of negation, pointers to antecedents of pronouns, and special features. Methods of automatic (statistical) classification usable in reducing File data are discussed, and steps toward ambiguity reduction and automatic parsing are described. Tentative research designs for investigating modal constructions and sentential apposition in Russian with computer assistance are outlined.

Document Details

Document Type
Technical Report
Publication Date
Jan 01, 1967
Accession Number
AD0649252

Entities

People

  • David G. Hays
  • Dean S. Worth

Organizations

  • RAND Corporation

Tags

DTIC Thesaurus Topics

  • Ambiguity
  • Automatic
  • Classification
  • Computer Programs
  • Computers
  • Computing Devices
  • Construction
  • Data Storage Systems
  • Linguistics
  • Magnetic Tape
  • Tapes

Readers

  • Computational Linguistics
  • Computer Science/Computer Engineering/Data Science/Digital Signal Processing.
  • Technical Research and Report Writing.