Automatic Translation of English Text to Phonetics by Means of Letter- to-Sound Rules

Abstract

Speech synthesizers for computer voice output are most useful when not restricted to a prestored vocabulary. The simplest approach to unrestricted text-to-speech translation uses a small set of letter-to-sound rules, each specifying a pronunciation for one or more letters in some context. Unless this approach yields sufficient intelligibility, routine addition of text-to-speech translation to computer systems in unlikely, since more elaborate approaches embodying large pronunciation dictionaries or linguistic analysis require too much of the available computing resources. The work described here demonstrates the practicality of routine text-to-speech translation. A set of 329 letter-to- sound rules has been developed. These translate English text into the International Phonetic Alphabet (IPA), producing correct pronunciations for approximately 90% of the words in an average text sample. Most of the remaining 10% have single errors easily correctable by the listener. Another set of rules translates IPA into the phonetic coding for a particular commercial speech synthesizer. This report describes the technical approach used and the support hardware and software developed. It gives overall performance figures, detailed statistics showing the importance of each rule, and listings of a translation program and a program used in rule development.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Jan 21, 1976
Accession Number
ADA021929

Entities

People

  • Astrid Mchugh
  • Honey S. Elovitz
  • John E. Shore
  • Rodney W. Johnson

Organizations

  • United States Naval Research Laboratory

Tags

Communities of Interest

  • Weapons Technologies

DTIC Thesaurus Topics

  • Alphabets
  • Automatic
  • Coding
  • Computer Programming
  • Computers
  • Conversion
  • Databases
  • Dictionaries
  • Frequency
  • Intelligibility
  • Language
  • Linguistics
  • Numbers
  • Phonemes
  • Phonetics
  • Symbols
  • Vowels

Readers

  • Artificial Intelligence
  • Computer Science.
  • Library and Information Science