TEXT-SPEECH PPT.pptx

7TH SEMESTER SEMINAR

NAME:NADIMINTI SAROJA KUMAR ROLL NUMBER:12EEE032 REGISTRATION NUMBER:1201210503 BRANCH:ELECTRICAL & ELECTRONICS ENGINEERING(EEE)

27TH JULY 2K15 1

TEXT TO VOICE SYNTHESIS

CONTENTS:-

1. Introduction to speech synthesis2. Disadvantages of braille system3. Introduction to voice stick device4. Working principle5. Advantages6. Applications7. Further research & development


27TH JULY 2K15

INTRODUCTION TO SPEECH SYNTHESYS:-

• What is speech synthesis?– Computer technology that 'constructs' human speech

from electronic circuits to replace pre-recorded human voice.

• What is the task?– Generating natural sounding speech on the fly, usually

from text.– It is used to translate written information into aural

information where it is more convenient.• What are the main difficulties?

– What to say and how to say.


27TH JULY 2K15

http://www.businessdictionary.com/definition/computer.html

http://www.businessdictionary.com/definition/technology.html

http://www.businessdictionary.com/definition/construct.html

http://www.businessdictionary.com/definition/electronic.html

http://www.businessdictionary.com/definition/circuit.html


27TH JULY 2K15 2

Disadvadvantages of Braille system: • Errors cannot be erased.

• It is more costly.• Non availability of each type of books.

• Not possible to make articles, newspapers etc.• Cannot be read by a sighted person who has not learned it.


27TH JULY 2K15 3

INTRODUCTION TO VOICE STICK DEVICE:-➢ It is a text scanning device for the visually impaired

people.➢ The stick when scanned in the printed letters, the

OCR function recognizes the text and converts the information into voice.

➢ The voice is then read back and thus helping the visually challenged person.➢ It can read books, e-mails, atms, etc. with a perfect sound.


27TH JULY 2K15 4

Working principle:-➢ The speech synthesis is often known as text to speech (TTS)

system.

➢It usually consist of two parts: ▪First it takes the raw text and converts latters, numbers etc into their

written-out word equivalents. This process is often called text normalization, pre-processing, or tokenization.▪Then it assigns phonetic transcriptions to each word, and divides and

marks the text into various linguistic units like phrases, clauses, and sentences.▪In second it takes the symbolic linguistic representation and converts it

into actual sound output.


27TH JULY 2K157

Text-to-phoneme moduleArchitecture of TTS systems:

Text input

Grapheme-to-phoneme

conversion

Prosodic modelling

Acoustic synthesis

Abbreviation lexicon

Text in orthographic formExceptions

lexicon

Orthographic rules

Phoneme string

Normalization

Grammar rulesPhoneme string +

prosodic annotation

Prosodic model

Synthetic speech output

Phoneme-to-speech module

Various methods


27TH JULY 2K15 5

o Easy to operate.

o Provides nearly natural sound.

o More accuracy in medical systems.

o It reduces the human effort in the case of any application.

o It provides talking machines for vocally impaired or deaf

people and better aids for speech therapy.

ADVANTAGES:-


27TH JULY 2K15 6

APPLICATIONS:-✓Speech synthesis walking device for blind.✓Automatic reading of computer screen.✓Voice operating mode in smart phones.✓Voice controlled vehicle.✓Railway announcement. ✓Robotics. etc


27TH JULY 2K15 7

RESEARCH & DEVELOPMENT:-


27TH JULY 2K15 10

REFERENCES:-http://www.microsoft.com/msagent/downloads/user.asp

http://www.bytecool.com/voices.htm

http://www.digitalfuturesoft.com/texttospeechproducts.php

http://www.neospeech.com/product/technologies/tts.php http://nextup.com/TextAloud/SpeechEngine/voices.html#morefreevoices

http://www.microsoft.com/msagent/downloads/user.asp

http://www.bytecool.com/voices.htm

http://www.digitalfuturesoft.com/texttospeechproducts.php

http://www.neospeech.com/product/technologies/tts.php

http://nextup.com/TextAloud/SpeechEngine/voices.html#morefreevoices

http://nextup.com/TextAloud/SpeechEngine/voices.html#morefreevoices


27TH JULY 2K15 12

Documents

TEXT-SPEECH PPT.pptx