A Scheme for the Computerized Composition of Urdu Nastaliq.

Abstract

A scheme for the computerized composition of Urdu Nastaliq is described. A standardized character set was defined for the Urdu language. The script was decomposed into its constituent elements which were then defined and digitized in accordance with pattern recognition principles. An efficient mapping of Urdu characters to the standard QWERTY keyboard was achieved by matching character frequency to finger agility and by applying additional pattern recognition concepts. An analysis of finger work loads was then used to make final adjustments to the layout. A set of rules for combining characters was developed, by which a sizable portion of the Nastaliq script could be composed. Rules for certain exceptional cases were not implemented. A program for displaying well-formed Nastaliq using a Vax 11/780 and a VT-550 graphics terminal was devised; the scheme is described in hardware-independent terms. Samples of text thus obtained illustrate the true composition, style, and elegance of the Nastaliq script. Recommendations for expanding the existing software to handle the entire language are included. (Author)

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Dec 06, 1983
Accession Number
ADA138067

Entities

People

  • A. H. Kizilbash
  • R. C. Durbin

Organizations

  • Air Force Institute of Technology

Tags

Communities of Interest

  • Materials and Manufacturing Processes
  • Weapons Technologies

DTIC Thesaurus Topics

  • Air Force
  • Classification
  • Computer Programming
  • Computer Programs
  • Computers
  • Design Criteria
  • Engineering
  • Frequency
  • Graphics
  • Keyboards
  • Language
  • Pattern Recognition
  • Recognition
  • Security
  • Standardization
  • Standards
  • Word Processors

Readers

  • Computational Linguistics
  • Computer Science.
  • Speech Processing/Speech Recognition.

Technology Areas

  • AI & ML
  • AI & ML - Machine Translation