18
Group members Luke Makischuk Abderahmane Sebaa Ameneh Sadat Yazdaninik Asma Faizi Professor: R Habash TA: Wei Yang

Group members Luke Makischuk Abderahmane Sebaa Ameneh Sadat Yazdaninik Asma Faizi Professor: R Habash TA: Wei Yang

Embed Size (px)

Citation preview

Page 1: Group members Luke Makischuk Abderahmane Sebaa Ameneh Sadat Yazdaninik Asma Faizi Professor: R Habash TA: Wei Yang

Group membersLuke Makischuk

Abderahmane Sebaa Ameneh Sadat Yazdaninik

Asma Faizi

Professor: R HabashTA: Wei Yang

Page 2: Group members Luke Makischuk Abderahmane Sebaa Ameneh Sadat Yazdaninik Asma Faizi Professor: R Habash TA: Wei Yang

Introduction The purpose of our project was to implement an efficient

voice recognition algorithm and improve upon the idea’s from the IEEE paper:

“Speech Recognition and Its Application in Voice-based Robot Control System” by Luo Zhizeng and Zhao Jinghing, August 2004

In the article, they used a technique called pattern-comparison to implement the voice recognition which is very complex.

We came up with a simpler way to implement the voice recognition using Matlab and we also expanded more on how you would go about controlling the robot using Bluetooth technology along with motor control of the robot.

Page 3: Group members Luke Makischuk Abderahmane Sebaa Ameneh Sadat Yazdaninik Asma Faizi Professor: R Habash TA: Wei Yang

References “Speech Recognition and Its Application in Voice-based Robot

Control System” by Luo Zhizeng and Zhao Jinghing, August 2004

“A dynamic-time-warp integrated circuit for a 1000-word speech recognition system” Kavaler, R.A.; Lowy, M.; Murveit, H.; Brodersen, R.W, Volume 22,  Issue 1,  Feb 1987

“Significance of the Modified Group Delay Feature in Speech Recognition” Rajesh M. Hegde; Hema A. Murthy; Venkata Ramana Rao Gadde; Volume 15,  Issue 1,  Jan. 2007

Lawrence Rabiner, Biing-Hwangluang, Fundomenrols Of Speech Recognition, Qinghua Press, 1999

http://cslu.cse.ogi.edu/HLTsurvey/ch1node7.html

Page 4: Group members Luke Makischuk Abderahmane Sebaa Ameneh Sadat Yazdaninik Asma Faizi Professor: R Habash TA: Wei Yang

Work distribution

Asma and Ameneh -Voice recognition research/algorithm and report writeup

Abderahmane and Luke-Matlab coding

Page 5: Group members Luke Makischuk Abderahmane Sebaa Ameneh Sadat Yazdaninik Asma Faizi Professor: R Habash TA: Wei Yang

Voice recognition• All done using Matlab

Recognized voice commands: Stop Turn Left Turn Right Move Forward Move Backwards

• Only works with one person’s voice but can easily be trained to work with anyone's voice

Page 6: Group members Luke Makischuk Abderahmane Sebaa Ameneh Sadat Yazdaninik Asma Faizi Professor: R Habash TA: Wei Yang

Voice recognition

How it works• Record voice command (Time domain)• Transform into frequency domain using

Fourier Transform and get the magnitude spectrum

• Compare spectrum of voice commands

Page 7: Group members Luke Makischuk Abderahmane Sebaa Ameneh Sadat Yazdaninik Asma Faizi Professor: R Habash TA: Wei Yang

Program block diagramVoice command

F.T

Frequency Spectrum

Minimum distancee?

Compare with stored voice commands

yes

command

Frequency Spectrum

Do they match?

No

Page 8: Group members Luke Makischuk Abderahmane Sebaa Ameneh Sadat Yazdaninik Asma Faizi Professor: R Habash TA: Wei Yang

Fourier transform

Speech Signal (time domain) Frequency domain

Page 9: Group members Luke Makischuk Abderahmane Sebaa Ameneh Sadat Yazdaninik Asma Faizi Professor: R Habash TA: Wei Yang

Communication with a robot The efficient way to communicate with

robot is using Bluetooth. We connect the Bluetooth to the serial

port at the back of the PC to transmit to the robot.

For receiving, we connect a receiver module to the robot.

Page 10: Group members Luke Makischuk Abderahmane Sebaa Ameneh Sadat Yazdaninik Asma Faizi Professor: R Habash TA: Wei Yang

Continue The robot receives the input commands

from the Bluetooth transmitter which is a 3 bit digital signal. Then, if the input commands are matched to the program, the robot moves based on state given. Otherwise, the robot will not respond to the given command.

Page 11: Group members Luke Makischuk Abderahmane Sebaa Ameneh Sadat Yazdaninik Asma Faizi Professor: R Habash TA: Wei Yang

Continue The programmed commands:

Microcontroller processor command

001 100011010 111Or 000

110101

Straight forward fast

Perfect stopStraight

backwardsTurn leftTurn right

Straight forward slow

Straight forward middle

Page 12: Group members Luke Makischuk Abderahmane Sebaa Ameneh Sadat Yazdaninik Asma Faizi Professor: R Habash TA: Wei Yang

System diagram

Page 13: Group members Luke Makischuk Abderahmane Sebaa Ameneh Sadat Yazdaninik Asma Faizi Professor: R Habash TA: Wei Yang

Motor control

Page 14: Group members Luke Makischuk Abderahmane Sebaa Ameneh Sadat Yazdaninik Asma Faizi Professor: R Habash TA: Wei Yang

ResultsCorrect Incorrect

Stop 10 0

Move Forward 9 1

Move Backwards

9 1

Turn Right 10 0

Turn Left 9 1

94 % accurate

Page 15: Group members Luke Makischuk Abderahmane Sebaa Ameneh Sadat Yazdaninik Asma Faizi Professor: R Habash TA: Wei Yang

Limitations The amount of words that could be

recognized by our program was limited, the more words we tried adding, the less accurate it became

The voice recognition program only works for the person’s voice who is trained for it

Program is less accurate in noisy environments

Bluetooth is an expensive technology to implement

Page 16: Group members Luke Makischuk Abderahmane Sebaa Ameneh Sadat Yazdaninik Asma Faizi Professor: R Habash TA: Wei Yang

Advantages Simple algorithm used for voice

recognition Easy to train a person’s voice Accurate Bluetooth is very fast and efficient Voice commands are a very easy way to

control a robot

Page 17: Group members Luke Makischuk Abderahmane Sebaa Ameneh Sadat Yazdaninik Asma Faizi Professor: R Habash TA: Wei Yang

Conclusion For small vocabulary applications, the

method of using the Fourier transform based speech recognition is more suitable then the other complex algorithms for it is easy to implement and has a high accuracy rate.

Page 18: Group members Luke Makischuk Abderahmane Sebaa Ameneh Sadat Yazdaninik Asma Faizi Professor: R Habash TA: Wei Yang

Thank you for your time

any Questions?