The COMLEX Syntax Project

Abstract

The goal of the COMLEX Syntax Project is to create a moderately-broad-coverage shareable dictionary containing the syntactic features of English words,intended for automatic language analysis. We are initially aiming for a dictionary of 35,000 to 40,000 base forms, although this of course may be enlarged if the initial effort is positively received. The dictionary should include detailed syntactic specifications, particularly for subcategofization; our intent is to provide sufficient detail so that the information required by a number of major English analyzers can be automatically derived from the information we provide. As with other Linguistic Data Consortium resources, our intent is to provide a lexicon available without license constraint to all Consortium members. Finally, our goal is to provide an initial lexicon relatively quickly within about a year, funding permitting. This implies a certain flexibility, where some of the features will probably be changed and refined as the coding is taking place.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Apr 01, 1993
Accession Number
ADA460231

Entities

People

  • Catherine Macleod
  • Ralph David Grishman
  • Susanne Wolff

Organizations

  • New York University

Tags

DTIC Thesaurus Topics

  • Coding
  • Computational Linguistics
  • Computer Science
  • Consortiums
  • Dictionaries
  • Graphical User Interface
  • Information Processing
  • Language
  • Linguistics
  • Natural Languages
  • New Mexico
  • New York
  • Notation
  • Specifications
  • Standards
  • Syntax
  • Word Lists

Readers

  • Clinical Trial Research.
  • Database Systems and Applications
  • Speech Processing/Speech Recognition.