Development and Evaluation of a Korean Treebank and its Application to NLP

Abstract

This paper discusses issues involved in building a 54-thousand-word Korean Treebank using a phrase structure annotation and the development of annotation guidelines based on the morpho-syntactic phenomena represented in the corpus. The various methods that were employed for quality control are described. An evaluation of the quality of the Treebank and some of the Natural Language Processing (NLP) applications under development using the Treebank also are described.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jan 01, 2002
Accession Number
ADA457100

Entities

People

  • Chung-hye Han
  • Eon-suk Ko
  • Martha Palmer
  • Na-rare Han

Organizations

  • Simon Fraser University

Tags

DTIC Thesaurus Topics

  • Abstracts
  • Accuracy
  • Computational Linguistics
  • Grammars
  • Information Operations
  • Language
  • Linguistics
  • Machine Translation
  • Morphology (Linguistics)
  • Natural Language Processing
  • Natural Languages
  • Pennsylvania
  • Quality Control
  • Sequences
  • Test And Evaluation
  • Universities

Readers

  • Canine Service Warrior Training Program for Wounded Warriors in the Veterinary Industry, Supported by Donors.
  • Speech Processing/Speech Recognition.
  • Systems Analysis and Design

Technology Areas

  • AI & ML
  • AI & ML - Machine Translation