Spectro-Temporal Modulation Transfer Functions and Speech Intelligibility
Abstract
The most obvious feature of a speech spectrogram is the energy modulations, both in time in any given frequency channel, and along the spectral axis at any instant, due to formant peaks and their transitions, spectral edges, and rapid amplitude modulations at onsets/offsets. These modulations occur at relatively slow temporal rates (few Hz) reflecting the speed of the articulatory gestures, and hence the phonetic and syllabic rates of speech. Speech intelligibility is critically dependent on the clarity of these spectro-temporal modulations. Thus speech reconstructed from smoothed spectrograms along either dimension suffers from progressive loss of intelligibility.
Document Details
- Document Type
- Technical Report
- Publication Date
- Jan 01, 1999
- Accession Number
- ADA439776
Entities
People
- Matthew C. Guyton
- Powen Ru
- Shihab A Shamma
- Taishih Chi
- Yujie Gao
Organizations
- University of Maryland