Text-To-Speech Phrasing Enhancement System Using Neural Networks.

Abstract

Much progress has been made in computer text-to-speech systems over the past several years. In particular, the Macintosh computer systems now provide the PlainTalk Text-To-Speech synthesizer which is capable of using high quality voices with various attributes to convert text to synthesized speech. Though these new voices and speech synthesizer are great improvements over the previous Macintalk system the synthesized speech still sounds far from natural. One attribute which could add greatly to the naturalness of the speech is improved phrasing. The PlainTalk Text-To-Speech synthesizer provides the means to embed speech commands within text to modify the spoken output. The purpose of this project is to build a neural network which through supervised learning will produce an algorithm for embedding pitch controls in text which will produce more natural sounding emphasis variations for the spoken output.

Open PDF

Document Details

Document Type
Technical Report
Publication Date
Aug 01, 1995
Accession Number
ADA299988

Entities

People

  • Louise F. Julig

Organizations

  • Naval Command, Control and Ocean Surveillance Center

Tags

DTIC Thesaurus Topics

  • Algorithms
  • Applied Computer Science
  • Artificial Intelligence
  • Artificial Intelligence Computing
  • Artificial Intelligence Software
  • Computational Processes
  • Computer Programs
  • Computer Science
  • Computers
  • Computing-Related Activities
  • Data Science
  • Embedding
  • Learning
  • Neural Networks
  • Supervised Machine Learning

Fields of Study

  • Computer science

Readers

  • Computer Science/Computer Engineering/Data Science/Digital Signal Processing.
  • Speech Processing/Speech Recognition.
  • Systems Analysis and Design

Technology Areas

  • AI & ML
  • AI & ML - Machine Translation
  • AI & ML - Neural Networks