GeoName: A System for Back-Transliterating Pinyin Place Names

Abstract

To be unambiguous about a Chinese geographic name represented in English text as Pinyin one needs to recover the name in Chinese characters We present our approach to this back-transliteration problem based on processes such as bilingual geographic name lookup, name suggestion using place name character and pair frequencies and confirmation via a collection of monolingual names or the WWW. Evaluation shows that about 48% to 72% of the correct names can be recovered as the top candidate and 82% to 86% within top ten, depending on the processes employed.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jan 01, 2003
Accession Number
ADA458057

Entities

People

  • Kui L. Kwok
  • Qiang Deng

Organizations

  • Queens College

Tags

Communities of Interest

  • C4I

DTIC Thesaurus Topics

  • Artificial Intelligence
  • Chinese Language
  • Computational Linguistics
  • Computer Science
  • Computer Vision
  • Frequency
  • Hong Kong
  • Information Retrieval
  • Language
  • Linguistics
  • Named Entity Recognition
  • New York
  • Personality
  • Probability
  • Syllables
  • Test And Evaluation
  • Translations

Readers

  • Computational Linguistics
  • International Journalism and Media Studies.