AUTOMATIC SPEECH RECOGNITION: EXPERIMENTS WITH A RECOGNISER USING LINGUISTIC STATISTICS

Abstract

The problem of transmitting speech over communication channels with smaller information-carrying capacity than that of conventional telephone links is discussed. Bandwidth compression systems using articulatory constraints (vocoders) are described and this is followed by a description of devices that analyse the speech sound wave in terms of linguistic units - machines performing this task are called automatic speech recognisers. Bandwidth economy can be achieved by recognising and transmitting these linguistic units. The difficulties of automatic recognition are discussed and its processes compared with the human mechanism for speech recognition. It is suggested that, just as in human speech recognition, the performance of an automatic recogniser could be improved by using information about the statistics and the structure of the language as well as the usual acoustic cues. The design and construction of a phoneme recogniser for putting this idea to the test is described. The machine has three parts: (1) the acoustic recogniser for detecting some simple phonemic cues, (2) stored knowledge about the digram frequencies of these phonemes, and (3) a device for selecting the phoneme that is most likely to occur in the light of both acoustic information and of the relevant digram frequencies. The selection is indicated on a typewriter.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Sep 01, 1960
Accession Number
AD0255978

Entities

People

  • P. Denes

Organizations

  • University College London

Tags

Communities of Interest

  • Advanced Electronics
  • Energy and Power Technologies

DTIC Thesaurus Topics

  • Acoustic Propagation
  • Acoustic Waves
  • Acoustics
  • Air Force
  • Automated Speech Recognition
  • Bandwidth
  • Communication Channels
  • Frequency
  • Frequency Bands
  • Language
  • Recognition
  • Recording Systems
  • Resonant Circuits
  • Sound Waves
  • Speech Transmission
  • Telephone Systems
  • Word Lists

Readers

  • Speech Processing/Speech Recognition.

Technology Areas

  • AI & ML
  • AI & ML - Machine Translation