Text-To-Speech Phrasing Enhancement System Using Neural Networks.

Abstract

Much progress has been made in computer text-to-speech systems over the past several years. In particular, the Macintosh computer systems now provide the PlainTalk Text-To-Speech synthesizer which is capable of using high quality voices with various attributes to convert text to synthesized speech. Though these new voices and speech synthesizer are great improvements over the previous Macintalk system the synthesized speech still sounds far from natural. One attribute which could add greatly to the naturalness of the speech is improved phrasing. The PlainTalk Text-To-Speech synthesizer provides the means to embed speech commands within text to modify the spoken output. The purpose of this project is to build a neural network which through supervised learning will produce an algorithm for embedding pitch controls in text which will produce more natural sounding emphasis variations for the spoken output.

Open PDF

Document Details

Document Type: Technical Report
Publication Date: Aug 01, 1995
Accession Number: ADA299988

Entities

People

Louise F. Julig

Organizations

Naval Command, Control and Ocean Surveillance Center

Text-To-Speech Phrasing Enhancement System Using Neural Networks.

Abstract

Document Details

Entities

People

Organizations

Tags

DTIC Thesaurus Topics

Fields of Study

Readers

Technology Areas