Documente Academic
Documente Profesional
Documente Cultură
Term Paper
on
Rohan Prabhakar
A2305218186
Submitted to:
The Author attests that permission has been obtained for the use of any
copyrighted material appearing in the Project report other than brief
excerpts requiring only proper acknowledgement in scholarly writing and all
such use is acknowledged.
Date: __________________
Rohan Prabhakar
A2305219999
CSE- (2017-21)
1
CERTIFICATE
_________________________
Dr Akash Punhani
Department of Computer Science and Engineering
ASET, Noida
2
ACKNOWLEDGEMENT
Dr Akash Punhani
3
ABSTRACT
The following report discusses various aspects of the use of neural networks
in gesture recognition for better communication system between computers
and the user. The aim of the gesture recognition system lays emphasis on
the development of a system that can easily identify gestures, and use them
for controlling devices, or convey some formations. In this paper we are
discussing researches done in the field of gesture of recognition based on
Artificial Neural Networks. Several gesture recognition methods are
presented, advantages and drawbacks of the discussed methods are also
included. The specific environment, implementation tools and other
requirements for the methods are also discussed.
4
INTRODUCTION
5
APPROACH AND METHODS
6
Fig 2: Vision based
Source:shutterstock.com
7
IMAGE EXTRACTION AND IDENTIFICATION
where
Di,j = The difference image at pixel (i,j)
Ii,j = The input image at pixel (i,j)
Ri,j = The background image at pixel (i,j)
B(t-1) = α It + (1- α) Bt
8
where
t = Current time
The image now divided into foreground and background and accurately
bright. The other two functions of segmentation and skin colour filter is
finally applied to the image to extract meaningful symbol or gesture for the
computer to understand.
The further pages will explain how segmentation and skin filters work.
9
SKIN COLOUR FILTER
Human skin colour is composed by two extreme hues; red that is for blood
and yellow for melanin substances, with moderate saturation. Theseiskin
properties are essential information thatican be used in hand tracking
algorithm. Skin filterlmodel is modelled as follows: Three channeled pixel
(RGB values) are first transformed intoilog opponent values.
The greenlchannel is used to represent intensity because the red and blue
channels have poor spatial resolution. The constant 105 simpleiscales the
output of the log function intolthe range [0,254].n is a random noise value,
generated fromla distribution uniform over the range [0,1). The random
noise is added to prevent banding artifacts in dark areas of theiimage. The
constant 1 added,before the logitransformation prevents
excessive,inflation of color distinctions in veryidark regions. The log
transformation makes the Rg and By values, as well as differences
between I values (e.g. texture amplitude), independent,of illumination level.
The hue at a pixel is defined to be a tan(Rg,By), where Rg and By are the
smoothedivalues computed as in the previous section.
10
Once the the hue and saturation are calculated , the skin regions canibe
marked using the given properties:
(a) Hue is in between 110 and !50 and whose saturation is between 20
and 60.
(b) Hue is between 130 and 170 and whose saturation is between 30
and 130.
11
SEGMENTATION
This algorithm fails if one of the nonIrelevant objects has larger areas than
the actual main objects. Besides, region captured as foreground might not
be easily partitioned into disjoint regions. A way to solve this problem is
through collabration of several images segementation algorithms which
tackle the problem of simple characteristics altogether. Applyingledge
detection algorithm to image defines the outline of every object mentioned
in the sceneicaptured.
12
To obtain more information about the image, alskin color detection model
can be applied. This model can tell you about regions which contain human
skin colours. However, the output may come out as mingled region of
undefined borderline.
13
ARTIFICIAL NEURAL NETWORK (ANN)
14
NEURAL NETWORKS CLASSIFICATION
Feed forward Networks are the simplest devised type of neural network.
From its name ‘forward’ the information moves inione direction from the
input to output nodes goes through the hidden nodeslwith no cycles.
Recurrent neural network can be models with bi-directional data flow, which
allowsiconnection loops between perceptron.
15
GESTURE RECOGNITION USING NEURAL
NETWORKS
to the system, first, the positional data that have been extracted fromldata
glove was,augmented using pre-wiringinetwork and two kindiof positional
data have been used. And secondly, filtering data space, in which data in
three different time,points were givenito the input layer, and these dataiwill
be shiftedlfor next sample. With these two changes theiinput layer nodes
would be 93lnodes instead ofi16 nodes.
16
APPLICATIONS
With most of the home appliances now becoming more and more wifi
oriented, it will become more easy in the future to implement gesture
recognition to control such appliances.
Philip ‘hues’, a new lighting series by Philips is wifi oriented which can be
controlled by an app also promises the idea of gesture recognition lighting
is possible in near future.
17
DISCUSSION AND CONCLUSION
Non geometric such as color, silhouette and textures, but they are
inadequate in recognition. Neural Networks system canibe applied for
extracted features from the input image gestures after applying
segmentation, as inito extract the shape of the hand.
18
19