Development and Evaluation of a Korean Treebank and its Application to NLP
Abstract
This paper discusses issues involved in building a 54-thousand-word Korean Treebank using a phrase structure annotation and the development of annotation guidelines based on the morpho-syntactic phenomena represented in the corpus. The various methods that were employed for quality control are described. An evaluation of the quality of the Treebank and some of the Natural Language Processing (NLP) applications under development using the Treebank also are described.
Document Details
- Document Type
- Technical Report
- Publication Date
- Jan 01, 2002
- Accession Number
- ADA457100
Entities
People
- Chung-hye Han
- Eon-suk Ko
- Martha Palmer
- Na-rare Han
Organizations
- Simon Fraser University