Embedded Windows CE 5.0 Text To Speech Synthesis Kit using Labx86 SBC
Embedded Windows CE SAPI 5.0 TTS Developers Kit is your complete Embedded Speech Synthesis or Text To Speech Circuit Solution for Development of Speech Synthesis System at Electronics level. The design is based on Festival TTS and is Ported to Windows CE/Pocket PC/Smart Phone/ Symbian OS for Nokia Series 80 and above. With a memory overhead of 1 MB - for one sentence narration, the engine very well fits the needs of developers looking for porting highly advanced research of Festival TTS to the embedded world.
Shipped Hardware Kit Includes:
- PC/104
- VGA - AGP /Res: 1,280x1,024 true colors
- LAN - Realtek 10/100 Mbps
- Audio - AC97
- Flash Disk - PQI DiskOnModule
- SOC -
- CPU 466 MHz
- Real Time Clock with Lithium Battery
- Labx86 Single Board Computer
Dimensions 3.5' x 3.7'
Weight: 112g- 128 MB RAM
- Voltage : +5V (@920 MA)
- Windows CE 5.0/CE 4.2
- Speech Development Software
- Printed Manual of API's
- SAPI - Interface Samples
- External Electronic Circuit Samples for
- Biometrics for User Identification
- Speaker Verification with passwords
- Mobile Data Security
- Chat/Login
- Vehicular Systems Commands for Security
- Porting International Languages
- Connectors and screw drivers.
Festival TTS enables you to create production level speech synthesis software with following tests (please read relevant copyright notices):
Tools
Available : A complete multi-lingual speech synthesis workbench off research
Ported to Embedded : Edinburgh Speech Tools Library
Ported Open DSP Library : libsnd.dll
CMUdict -- pronunciation dictionary
Ported
OpenVXI -- VoiceXML browser
SALT browser - finally online!
Audio Databases -- AN4, Microphone array, etc
Advised TTS for Dictionary Resource:
The DICT Development Group: Clients for the RFC 2229 dictionary protocol.
Encyclopedia Britannica: Just like the twenty-book set, but it fits in your web browser.
Hypertext Webster Gateway: Search engine spanning multiple dictionaries.
IEEE Keywords: List of approved IEEE keywords for indexing publications.
Merriam-Webster Online: Perhaps the best on-line dictionary available.
Roget's Thesaurus: The all-in-one desktop reference - search the web, a dictionary, and Roget's Thesaurus.
Directories possible
555-1212: look up a telephone area code.
AnyWho: look up any address by it's telephone number.
CEOExpress: highly informational site catering to business professionals.
MapQuest: get directions anywhere in the US.
United States Postal Service: look up a zip code.
General Databases
CMU ARCTIC, 4 single speaker speech databases with around 1200 phonetically balanced utterances.
CMU FAF, 107 paragraphs (15,000 words) of single speaker monologues with interesting prosody. Basic of Aesop's fables and country descriptions in the CIA world fact book.
CMU SIN, speech in noise: speech recorded while noise is playing in the speakers ears (and when not).
CSTR US KED timit University of Edinburgh's male US TIMIT, 452 phonetically balanced utterances.
Limited Domain Databases
Diphone Databases
MBROLA voices and binaries
Check out these MBROLA projects wide range of pre-built diphone databases for many languages and binaries for the mbrola program itself for many platforms.
MBROLA page in Belgium with cute http front end for copying.