Language Processing Research and Technology - New Directions

Abstract

A new statistical parser has been developed which works on unconstrained newspaper text with an 89% success rate. It trains in minutes and its success is the highest in the field so far. A new robust parsing technology called "supertagging" was developed based on lexicalized tree-adjoining grammars (LTAG). This parser gives a very fine grain analysis and achieves a 93% success rate for assigning correct supertags to each word. This "almost parsing" becomes actual full parsing in many application domains. New technologies for automated entity based summarization of newspaper articles based on the conference technology developed earlier.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Nov 30, 1998
Accession Number
ADA358329

Entities

People

  • Aravind K. Joshi

Organizations

  • University of Pennsylvania

Tags

Communities of Interest

  • Human Systems

DTIC Thesaurus Topics

  • Abstracts
  • Applied Computer Science
  • Artificial Intelligence
  • Artificial Intelligence Computing
  • Artificial Intelligence Software
  • Automated Text Summarization
  • Computational Linguistics
  • Computational Science
  • Computer Languages
  • Grammars
  • Information Science
  • Language
  • Linguistics
  • Machine Translation
  • Natural Language Processing
  • Newspapers

Fields of Study

  • Computer science

Readers

  • Computational Linguistics