Experiments on the Annotation of Large Quantities of Speech Using the 'ZIP' Dynamic Time-Warping Algorithm.

Abstract

Research into automatic speech recognition requires large amounts of speech data. This data needs to be carefully annotated (i.e. segmented and labelled) before controlled experiments can be carried out. For example, the researcher may need to know the identity and location of each word in the database. Because of the large amount of data involved it is desirable to have some way of automatically performing this annotation process. Previous experience has demonstrated that a class of algorithms based on dynamic programming can be used to accurately align the timescales of two examples of the same word. An extension of this technique, a dynamic programming algorithm called ZIP, has been shown to be capable of aligning longer examples of speech. This report investigates the feasibility of using the ZIP algorithm as the basis of an automatic annotation system. Experiments are conducted into the possible choice of algorithm parameters and a set of optimal parameters is described. The results obtained demonstrate that this is a viable approach. Methods of automatically determining the success of the algorithm are also considered. (Great Britain)

Document Details

Document Type
Technical Report
Publication Date
Jan 01, 1986
Accession Number
ADA168221

Entities

People

  • S. M. Peeling

Organizations

  • Royal Signals and Radar Establishment

Tags

DTIC Thesaurus Topics

  • Algorithms
  • Automated Speech Recognition
  • Automatic
  • Computer Programming
  • Databases
  • Dynamic Programming
  • Heuristic Methods
  • Identities
  • Mathematics
  • Recognition
  • Segmented

Fields of Study

  • Computer science

Readers

  • Computational Fluid Dynamics (CFD)
  • Computer Vision.
  • Speech Processing/Speech Recognition.

Technology Areas

  • AI & ML
  • AI & ML - Information Retrieval
  • AI & ML - Machine Learning Algorithms
  • AI & ML - Machine Translation