Upload
nsaroj-kumar
View
178
Download
3
Embed Size (px)
Citation preview
7TH SEMESTER SEMINAR
NAME:NADIMINTI SAROJA KUMAR ROLL NUMBER:12EEE032 REGISTRATION NUMBER:1201210503 BRANCH:ELECTRICAL & ELECTRONICS ENGINEERING(EEE)
27TH JULY 2K15 1
TEXT TO VOICE SYNTHESIS
CONTENTS:-
1. Introduction to speech synthesis2. Disadvantages of braille system3. Introduction to voice stick device4. Working principle5. Advantages6. Applications7. Further research & development
7TH SEMESTER SEMINAR
27TH JULY 2K15
INTRODUCTION TO SPEECH SYNTHESYS:-
• What is speech synthesis?– Computer technology that 'constructs' human speech
from electronic circuits to replace pre-recorded human voice.
• What is the task?– Generating natural sounding speech on the fly, usually
from text.– It is used to translate written information into aural
information where it is more convenient.• What are the main difficulties?
– What to say and how to say.
7TH SEMESTER SEMINAR
27TH JULY 2K15
7TH SEMESTER SEMINAR
27TH JULY 2K15 2
Disadvadvantages of Braille system: • Errors cannot be erased.
• It is more costly.• Non availability of each type of books.
• Not possible to make articles, newspapers etc.• Cannot be read by a sighted person who has not learned it.
7TH SEMESTER SEMINAR
27TH JULY 2K15 3
INTRODUCTION TO VOICE STICK DEVICE:-➢ It is a text scanning device for the visually impaired
people.➢ The stick when scanned in the printed letters, the
OCR function recognizes the text and converts the information into voice.
➢ The voice is then read back and thus helping the visually challenged person.➢ It can read books, e-mails, atms, etc. with a perfect sound.
7TH SEMESTER SEMINAR
27TH JULY 2K15 4
Working principle:-➢ The speech synthesis is often known as text to speech (TTS)
system.
➢It usually consist of two parts: ▪First it takes the raw text and converts latters, numbers etc into their
written-out word equivalents. This process is often called text normalization, pre-processing, or tokenization.▪Then it assigns phonetic transcriptions to each word, and divides and
marks the text into various linguistic units like phrases, clauses, and sentences.▪In second it takes the symbolic linguistic representation and converts it
into actual sound output.
7TH SEMESTER SEMINAR
27TH JULY 2K157
Text-to-phoneme moduleArchitecture of TTS systems:
Text input
Grapheme-to-phoneme
conversion
Prosodic modelling
Acoustic synthesis
Abbreviation lexicon
Text in orthographic formExceptions
lexicon
Orthographic rules
Phoneme string
Normalization
Grammar rulesPhoneme string +
prosodic annotation
Prosodic model
Synthetic speech output
Phoneme-to-speech module
Various methods
7TH SEMESTER SEMINAR
27TH JULY 2K15 5
o Easy to operate.
o Provides nearly natural sound.
o More accuracy in medical systems.
o It reduces the human effort in the case of any application.
o It provides talking machines for vocally impaired or deaf
people and better aids for speech therapy.
ADVANTAGES:-
7TH SEMESTER SEMINAR
27TH JULY 2K15 6
APPLICATIONS:-✓Speech synthesis walking device for blind.✓Automatic reading of computer screen.✓Voice operating mode in smart phones.✓Voice controlled vehicle.✓Railway announcement. ✓Robotics. etc
7TH SEMESTER SEMINAR
27TH JULY 2K15 7
RESEARCH & DEVELOPMENT:-
7TH SEMESTER SEMINAR
27TH JULY 2K15 10
REFERENCES:-http://www.microsoft.com/msagent/downloads/user.asp
http://www.bytecool.com/voices.htm
http://www.digitalfuturesoft.com/texttospeechproducts.php
http://www.neospeech.com/product/technologies/tts.php http://nextup.com/TextAloud/SpeechEngine/voices.html#morefreevoices
7TH SEMESTER SEMINAR
27TH JULY 2K15 12