Research in Chinese Lexicography.

Abstract

The report describes results of a 27-month research in Chinese lexicography. The effort was oriented primarily toward compilation of a Chinese-English scientific and technical dictionary in computer processible format. Approximately 216K entries have been generated in the Information Maintenance System (IMS), of which roughly 155K require further revision. A complete entry contains the following information: Pin-Yin romanization, tone marker, radical index, grammar code, field of use, origin and date of publication, English gloss(es), romanized quotation(s) with English translation(s); comments; Yen-Ren-Chao romanization, and telecode/chicode equivalents. Simplified formats are available. Subsets of lexical data are selectively retrievable from the master store according to pre-specified descriptors (e.g., field of use). The end product will be applicable to both automated and conventional modes of translation. It will also provide a lexical data base for linguistic studies in the context of machine translation R and D. (Author)

Document Details

Document Type
Technical Report
Publication Date
Nov 01, 1968
Accession Number
AD0844684

Entities

People

  • Wayne L. Tosh
  • Winfred P. Lehmann

Organizations

  • University of Texas at Austin

Tags

DTIC Thesaurus Topics

  • Applied Computer Science
  • Computational Linguistics
  • Computational Science
  • Computers
  • Databases
  • Dictionaries
  • Lexicography
  • Machine Translation
  • Maintenance
  • Natural Language Processing
  • Translations

Readers

  • Computational Linguistics
  • Library and Information Science

Technology Areas

  • AI & ML
  • AI & ML - Machine Translation