Low Cost Voice Synthesis



All Ears for Type 'n Talk by Gordon Mc Comb, published in Creative Computing year ? page 148-152

  1. A Few Words About Voice synthesis
    1. There are three major ways to give your computer a voice. One system, pioneered by Texas Instruments, is called Linear Predictive Coding (LPC) where an announcer speaks into a microphone connec ted to a computer. The computer digitizes and condenses the speech and stores it in memory. That memory can be duplicated and inserted in a finished product. In this way, calculators, toys and other electronic devices can be given voices-but can only speak those words originally recorded. Words can be combined to create complete phrases, such as "six times six equals thirty-six." Each word is recorded separately. A small computer inside the device picks out the proper sequence of words and strings them together.
    2. Another method used by many computer hobbyists is similar to LPC, but allows the user to speak into a microphone and digitize his own voice. His voice is then "recorded" in memory, and recalled from the computer at will. Super Talker from Mountain Computer provides this type of speech capability for the Apple.
    3. Phoneme-based synthesis is perhaps the truest form of voice synthesis. It creates words by imitating the sounds produced by the human vocal tract. In this way, words, phrases and even singing can be produced without the need to prerecord or digitize speech. One such unit was built for the Radio Shack TRS-80. However, it takes several hours to input a page or two of text even for the most experienced operator. The first synthesizers of this kind were built by Bell Laboratories in the 50's. (The first working model is featured on the Philadetphia Computer Music Festival LP record; $6 from Creative Computing.) Later, less elaborate commercial versions of this type of synthesizer were shown widely in the early 70's, however, all suffered from a lack of inflection. This gave them a decided Scandinavian or Eastern European accent and did not contribute to their widespread acceptance. Text-to-speech synthesis. still in its infancy, eliminates the tedious programming of the phoneme-based synthesizer. Text is typed into a computer and is translated by a built-in language interpreter. The translator has been programmed to correct for the majority of pronunciation variances inherent in our language.-DKA