3
Speech Recognition in iOS Devices Speech Recognition, SR is a technique to translate audible words and phrases in spoken languages to machine-readable format. Speech Recognition depends on the accuracy of the software for reading voices and the capacity of the database to handle multiple languages and multiple voices. Trained speech recognition systems decode a user specific speech, for further use, providing better accuracy. Speaker independent systems do not remain confined to one user and handle multiple voices. Speech to Text or STT is used interchangeably with Speech Recognition Systems, but is in fact just one of the applications of SR. SR and STT help the users to control the device and direct it through speech, unfolding a completely different kind of User Experience in the very elegant iOS 7 design. The different apps enable voice dialing, call routing, device control, dictation, data entry, audible outputs from text with Text to Speech aids and user authentication to mention a few. Speech recognition apps extend support to differently able people enhancing the Accessibility features of the iOS genre. Virtual assistants like SIRI need no introduction to iOS users. Speech Recognition Benefits and Applications Speech Recognition Benefits and Applications

Speech recognition in i os devices

Embed Size (px)

Citation preview

Page 1: Speech recognition in i os devices

Speech Recognition in iOS Devices

Speech Recognition, SR is a technique to translate audible words and phrases in spoken

languages to machine-readable format. Speech Recognition depends on the accuracy of the

software for reading voices and the capacity of the database to handle multiple languages and

multiple voices.

Trained speech recognition systems decode a user specific speech, for further use, providing

better accuracy. Speaker independent systems do not remain confined to one user and handle

multiple voices.

Speech to Text or STT is used interchangeably with Speech Recognition Systems, but is in fact

just one of the applications of SR. SR and STT help the users to control the device and direct it

through speech, unfolding a completely different kind of User Experience in the very elegant

iOS 7 design. The different apps enable voice dialing, call routing, device control, dictation, data

entry, audible outputs from text with Text to Speech aids and user authentication to mention a

few.

Speech recognition apps extend support to differently able people enhancing the Accessibility

features of the iOS genre.

Virtual assistants like SIRI need no introduction to iOS users.

Speech Recognition Benefits and Applications

Speech Recognition Benefits and Applications

Page 2: Speech recognition in i os devices

Libraries for Speech Recognition in iOS

There are several commercial libraries as well as Open Source libraries available for

implementation of SR in iOS based devices, which can be used by iPhone and iPad application

developers.

• PocketSphinx

An Open Source library providing support for both desktop apps as well as mobile apps. This

library supports cross-platform development for Linux, Windows, Mac OS X, iOS and python

language binding.

• VocalKit

A free library and wrapper for already available libraries like Pocket Sphinx as an aid for the iOS

developers for creation of voice recognition solutions providing crisp Objective-C API.

• OpenEars

A free library for offline Speech Recognition and Text To Speech applications. OpenEars

performs speech recognition and language model generation in English and in Spanish.

NetSpeech

A TTS plugin for OpenEars, enabling tasks in both English and Spanish. It continuously listens

for speech in the background and activates automatically. The multiple voice support allows up

to 9 users including male and female voices with a good range of speed and quality level.

Switching between them on the fly is quite easily achieved. NeatSpeech has seamless integration

with Bluetooth, other communication protocols and a variety of audio & speech devices. The

robustness of this plugin increases as interaction is through Object Oriented-C methods. The use

of memory instead of disk improves the speed. NeatSpeech blends in smoothly into the Cocoa

Layer of the iOS.

OpenEars provides a few paid plugins support for integrating new speech features into apps with

ease for live speech recognition in real time. The engaging responsive behavior enables it to be

used as a gaming input , enriching the user experience.

• Rejecto A vocabulary rejection plugin for words and noises which are not a part of the vocabulary of the

language chosen for the device. Rejecto is ready to use with OpenEars and adds value to the app.

Page 3: Speech recognition in i os devices

• SaveThatWave A plugin to add the capability of recording audio files from speeches making them available

directly for later use.

iPhoneQualityApplications division offering iOS applications and iOS solutions using iOS 7,

iOS 6 and iOS 5 to work on a range of Apple devices; iPhone 4, iPhone 4S, iPhone 5, iPhone 5C,

iPhone 5S, iPad 2, the iPad Air and iPad Mini demonstrating expertise in various frameworks,

Cocoa Touch and core C Programming. The dedicated iOS Testing teams cater to iOS testing

services in addition to these.