Documente Academic
Documente Profesional
Documente Cultură
IS
HUMAN
VISION
PERFECT ?
PENROSES TRIANGLE
PENROSES STAIRS
Why is it so?
Our vision system neural network has been tuned
to perform recognition, processing and
classification of phenomena that was vital to our
survival and progress. In this context, every
species has a different vision system.
Hence, we are not very good in dealing with
artificially generated images, as these phenomena
rarely occurred in nature during our evolution.
However, we are the best for natural images!
Functions of vision
Segment the image (or scene) and recognize the
objects distinguished
Compute distance to contact in every direction
Provide feedback and triggers for action
Provide a low-level summary of the 2-D and/or 3D features of the image, leaving it to the central
non-visual processes to draw conclusions
Is something left out?
Visual/spatial reasoning
Our ability to use diagrams and visual images to
reason about very abstract mathematical problems,
like thinking about the complexity of a search
strategy
Seeing that 7+5=12 by a rearrangement of dots
Seeing that angles of a triangle add up to a
straight line
Visualize infinitely thin and long lines of
Euclidean geometry
Many more examples
Visual/spatial reasoning(contd.)
Uses of spatial reasoning: Knowing where to search for an object
thrown over a wall, assembling toy crane from a toy set, uses of spatial
concepts(notion of search space) in programming design
Reasoning using a grasp of spatial structures requires at least: the
ability to see various structures involved in the proof, the possibilities
for variatins(rearrangements) in them, the invariant structures during
the rearrangements, etc.
In contrast, a reasoning system like logic is completely discrete and all
syntactic composition involves function appllication
Specification of the requirements for visual reasoning is very vague,
and would not be easy to mechanize
Visual Pathway
Hierarchical Neural Network
Architecture
Contents
Brain Mechanism of Vision
Hubels and Wiesel's hierarchy model
Cerebral Cortex
Evolution of cerebral cortex is one of the
great success in the history of living beings.
Insights of cortical organization:
Division into different regions having different
functionalities.
e. g. , Visual, auditory, somatic sensory, speech
and motor regions
Visual Pathway
Retina to the Visual Cortex
on Center.
(Excitatory Center and Inhibitory Surrounding)
off Center.
(Inhibitory Center and Excitatory Surrounding)
Extended Hubel-Weisel
Hubel-Weisel hierarchical models have been
extended to obtain a fine balance between
selectivity and invariance.
Simple and complex cells are interleaved at
different levels of the inferotemporal (IT) lobes.
Max-like pooling mechanisms have been
suggested at certain levels as opposed to a
weighted sum of afferents to boost invariancy in
scale, position and rotation.
Feedforward Architecture
The S cells (simple cells) in the previous figure passed on information
to the C cells (complex cells) by a bell-tuned weighted sum or a maxlike operation.
Feedforward Architecture
Primates have a very advanced level of attention modulation
(fixation) which is a feedback propagation from the IT lobes to
the primary visual cortex and lower levels.
This mechanism allows to shift attention from one part of the
image to another.
However, crude object recognition is done in a very small
duration after stimulus which indicates use of only the
feedforward architecture for rapid categorization.
Such a model was attempted at the McGovern Institute for Brain
Research at MIT with some simplifications.
The input consisted of 4 different orientations and several scales,
densely covering the gray-value input image of 7x 7
Results
The model was evaluated against human responses for input
stimulus of 20ms followed by varying inter-stimulus interval.
No single model parameter was adjusted to fit the human
data. All unsupervised parts were fixed and constant
throughout all the runs.
The supervised mode was tuned differently in different runs
using different test images. Humans were also shown these
test images.
An evaluation across all such runs for the identification of
animal objects was done for both humans and animals. The
results were compared.
Results
Various categories of
images in different clutter,
scale, position, rotation
were given.
Conclusions
Biologically inspired computation models have
shown very promising results. They are versatile
and fast learners. Why not learn from natures
best?
Advances in neuroscience are picking up,
allowing us greater understanding. Also,
simulations of hypothetical models will help us
validate neuroscience findings.
References
Talks by Aaron Sloman, Univ of Birmingham, UK
2005 - 2007
http://www.cs.bham.ac.uk/~axs/invited-talks.html
http://www.lifesci.sussex.ac.uk/home/George_Mather/Linked%20Pages/Physiol/Cort
ex.html