Vision Based Gesture in Embedded Devices

8/7/2019 Vision Based Gesture in Embedded Devices

1/20

COMPUTERVISION & GESTURE

RECOGNITION

-- Mukesh Kumar


2/20

Introduction

Input devices

How System Works

Computer Vision Frameworks

OpenCV

Challenges

Few Apps Idead

Demo

Conclusions

OUTLINE


3/20

1. INTRODUCTION

Computer vision is the science where machine

is able to extract information from an image that

is necessary to solve some task. It enables

computer to understand visual input

The image data can be acquired in many forms

and from many kind of sources, such as video

sequences, views from multiple cameras, or

multi-dimensional data from a medical scanner,

video, normal 2D cameras, infrared cameras,

radars or specialized sensors.


4/20

GESTURE RECOGNITION

Gesture recognition is a language

technology with the goal of interpreting

human gestures via mathematical algorithms

Gesture recognition can be conducted with

techniques from computer vision and image

processing.


5/20


6/20

Two common technologies for hand gesture

recognition

y glove-based method

Using special glove-based device to extract hand posturey vision-based method

3D Depth data Model

Normal Color Camera / Appearance Model


7/20

Data Glove Based Gesture Recognition


8/20

VISION-BASED GESTURE RECOGNITION


9/20

HOW IT WORKS

Recognition + Gesture recognition

The classical problem in computer vision is that of determining whether or not theimage data contains some specific object, feature, activity, gesture, optical character,Motion e.t.c.

This can be achieved using following steps :

Image Acquisition :Image Acquisition is done to generate a 2D image, 3D depth data, Imagesequence from video feeds at real time or from any other sources usingspecific hardware.

Pre processing of ImagesNoise reduction, Contrast Enhancements,Gray scale conversion, Matrix of

Image creation, Histogram comparison, Image color Inversion.Few of these steps can also be done during Image Acquisition phase insidethe cameras.

Feature ExtractionLine,Edge, Corner Detection

Blobs and shape detection and extraction

Color recognition, like skin color


10/20

High Level ProcessingHigh Level Processing is done on the extracted features from the image for Gesture, pose,

Motion detection.

Pixel Classification

ImageCorrelation

Facial Recognition

Hand detectionFeature Tracking

In this phase Neural Network andArtificial intelligence is used to teach the system about the

gesture, kind of motions, faces.Adaptive algorithms are used to adapt the system for working under

different kind of scenarios and environments which learn itself and become more intelligent system after

learning from the various kind of inputs.


11/20


12/20

Training samples

Training samplesy Negative samples: images that must

not contain object representations. We

collected 500 random images as negative

samples.

y Positive samples: hand posture images

that are collected from humans hand, orgenerated with a 3D hand model. For each

posture, we collected around 450 positive

samples. As the initial test, we use the white

wall as the background.


13/20

AVAILABLE FRAMEWORKS

Few Computer Vision Frameworks available:

OpenNI Works with Microsoft Kinect and other 3D sensors generating

3D depth Images

OpenCV (started by Intel) Have a C/C++ interface for complex Image

Processing and Applying

Neural network Algorithm. Works with 2D or 3Dimage samples or live video feeds.

Aforge.Net Same as OpneCV, with a .NET interface. Have complex

Image Processing and Neural network Algorithm, genetic algorithms and

machine learning libraries for Applications. Subset of this project has a

Glyph recognition system.

NokiaCV Library

EmbedCV - An Embeddable Computer Vision Library

The OpenSURF Computer Vision Library


14/20

OPENCVINTEL OPENSOURCECOMPUTER VISIONLIBRARY

OpenCVis a library of programming functions mainly aimed at real

time computer vision, developed by Intel.

y It is free for use under the open source BSD license.

y The library is cross-platform.


15/20


16/20

MODULES

OpenCVOpenCV FunctionalityFunctionalityyy more than 350 algorithmsmore than 350 algorithms


17/20


18/20

CHALLENGES

Accuracy of gesture recognition software.

Image noise - not necessary be under consistent

lighting, or in the same location. Items in the

background or distinct features of the users maymake recognition more difficult.

Hardware Requirement like 3D depth Mapper.

Processing Power required for real time Vision.


19/20

EyeSight's hand-waving, gesture-based UIy a simple hand gesture over the phone will silence incoming calls while in a

meeting, scroll between photos in the photo gallery, skip tracks in the media

player


20/20

FEWAPPS IDEAS

Documents

Vision Based Gesture in Embedded Devices