Text to Speech Basics

Text to Speech Empowers Authors

Today’s text-to-speech (TTS) voices allow authors to add professional voice-over narration by themselves in presentations quickly and easily without microphone recording.

Advances in text to speech technology have replaced the old robotic computer voices with new, amazingly natural and realistic ones.

Synthesized from real voice talents, these remarkable text to speech voices can read books aloud beautifully without a mistake, guided only by grammar, sentence structure and punctuation.

The exciting news is that these articulate text to speech voices have now been harnessed by Tuval Software Industries’ Speech-Over™ to add narration to PowerPoint presentations.

Speech-Over accepts user narration text and launches text-to-speech voices from within PowerPoint to record professional narrations from the text alone.

Change the narration text as often as you need and these tireless voices record new versions quickly and faithfully without complaint.

Availability of Text-to-Speech Voices

Text-to-speech voices are separate computer applications. They are available in male and female gender, in all major languages, and in various regional dialects. Many vendors offer text to speech voices, including AT&T, NeoSpeech, Cepstral, Acapela-Group and Nuance.

Nowadays, TTS voices are embedded in many types of applications such as book-reading, GPS and kiosk software. Speech-Over is yet another example of software with embedded text to speech voices. Speech-Over recognizes any SAPI 5 standard TTS voice, a standard that most TTS voices adhere to.

Tuval Software has a special arrangement to bundle Acapela TTS voices with Speech-Over software. The resulting product is Speech-Over Professional, a product that has all you need to create professional narration in presentations. It is available in all major languages.

Licensing

The use of text to speech voices is permitted by purchasing a license. Low cost TTS licenses are available for personal use such as reading books. However, when TTS voices are used in a commercial or corporate environment, an audio-distribution license is generally required and these are more expensive.

The Acapela voices in Speech-Over Professional are provided with a commercial audio-distribution license included in the price.

What is Text-to-Speech?

Text to speech is the automated synthesis of speech from text. The heart of the system is the text to speech engine – a sophisticated piece of software that:

parses the text input,
analyzes its grammar, sentence structure, punctuation and capitalization, and
activates voice simulations to produce a vocal rendering of the text.

The data for individual voices, including regional accents, are provided in separate files called "voices". The text to speech engine can work with any of the voices interchangeably.

Improved Text To Speech Technology

Today’s text to speech technology is much improved over that of even a few years ago. The older systems -- which produced the robotic-like sounds that people tend to associate with computer voices -- used the parametric or formant synthesis method to simulate the acoustic properties of speech.

See our posts on Text to Speech in eLearning Technology:

1.     TTS Overview and NLP Quality
2.     Digital Signal Processor and TTS
3.     Using TTS in an eLearning Course
4.     TTS eLearning Tools - Integrated Products
5.     TTS vs Human Narration for eLearning
6.     Using Punctuation and Mark-Up Language to Increase TTS Quality
7.     TTS Examples
8.     TTS Costs – Licensing and Pricing ♦ ♦ ♦

Uses Real Voices

Recently, voices that use the concatenation method have become commercially available: the voice of a real human speaker is divided into phonemes, which are stored in the voice file. In a particular application, the text to speech engine assembles the phonemes according to the input text to reconstruct the original human voice to speak the text. Because a real human voice is used, it is sometimes hard to tell the difference between it and the real thing.

Text To Speech Applications

Text to speech technology can help businesses save time and money, especially when compared to the alternative of using pre-recorded speech files. The cost per minute of audio track of the text to speech system is less than half the cost of studio-based recording and synchronization. The savings are especially significant in the following cases:

Where the text can change – with text to speech, changes are made by simple text editing as opposed to expensive studio re-recording
Multi-lingual applications – with text to speech, you simply switch voices to the language desired
When you need to make deadlines – the text to speech system is available any time, any place

Knowledge Transfer Applications

Presentations whose purpose is to transfer knowledge are well-suited to text to speech technology. This type of presentation is usually given to people who have a need to receive and understand the information being presented, for example training presentations or corporate communications where text to speech voices are quite acceptable.

Speech-Over Professional TTS Platform

Tuval Software Industries’ Speech-Over Professional is an efficient platform for integrating the text to speech system with PowerPoint.

Among the features and benefits offered by Speech-Over Professional:

Embedded text-to-speech engine
Speech editing for quick and easy entry and editing of speech text, including SAPI voice modulation tags, and association of speech with screen shape. Text can be typed in or dictated.
Integration of multiple voices in a presentation to make it more interesting and varied
Alternative narrations of a presentation for different audiences, including in different languages.

Links

Press

Contact