Integrating Syntax, Semantics, and Discourse DARPA (Defense Advanced Research Projects Agency) Natural Language Understanding Program
Abstract
A merged lexicon for all domains has been created. The merged lexicon contains about 2400 distinct root words. A tool for extracting lexical items from a dictionary, given a word list, was used to obtain items from the merged lexicon for the resource management domain. Approximately 47% of the vocabulary (or 448 words) in this domain is new. However a large number of these words are proper names of ships and locations, which are relatively trivial to enter. There are about 225 other words, including about 50 verbs. Because there are so few verbs, we expect adding the lexical entries to take only a few days. Keywords: Syntax, Semantics, Natural language, Information retrieval.
Document Details
- Document Type
- Technical Report
- Publication Date
- Nov 21, 1988
- Accession Number
- ADA203747
Entities
People
- Lynette Hirschman