Language Processing Research and Technology - New Directions
Abstract
A new statistical parser has been developed which works on unconstrained newspaper text with an 89% success rate. It trains in minutes and its success is the highest in the field so far. A new robust parsing technology called "supertagging" was developed based on lexicalized tree-adjoining grammars (LTAG). This parser gives a very fine grain analysis and achieves a 93% success rate for assigning correct supertags to each word. This "almost parsing" becomes actual full parsing in many application domains. New technologies for automated entity based summarization of newspaper articles based on the conference technology developed earlier.
Document Details
- Document Type
- Technical Report
- Publication Date
- Nov 30, 1998
- Accession Number
- ADA358329
Entities
People
- Aravind K. Joshi
Organizations
- University of Pennsylvania