Text-To-Speech Phrasing Enhancement System Using Neural Networks.
Abstract
Much progress has been made in computer text-to-speech systems over the past several years. In particular, the Macintosh computer systems now provide the PlainTalk Text-To-Speech synthesizer which is capable of using high quality voices with various attributes to convert text to synthesized speech. Though these new voices and speech synthesizer are great improvements over the previous Macintalk system the synthesized speech still sounds far from natural. One attribute which could add greatly to the naturalness of the speech is improved phrasing. The PlainTalk Text-To-Speech synthesizer provides the means to embed speech commands within text to modify the spoken output. The purpose of this project is to build a neural network which through supervised learning will produce an algorithm for embedding pitch controls in text which will produce more natural sounding emphasis variations for the spoken output.
Document Details
- Document Type
- Technical Report
- Publication Date
- Aug 01, 1995
- Accession Number
- ADA299988
Entities
People
- Louise F. Julig
Organizations
- Naval Command, Control and Ocean Surveillance Center