Documente Academic
Documente Profesional
Documente Cultură
Lecture 8 Prepared by R. Lathrop 11//99 Updated 3/06 Readings: ERDAS Field Guide 5th Ed. Ch 6:234-260
Learning objectives
Remote sensing science concepts
Basic concept of supervised classification Major classification algorithms Hard vs Fuzzy Classification.
Math Concepts Skills --Training set selection: Digital polygon vs. seed pixelregion growing --Training aids: plot of training data, statistical measure of separability; --Edit/evaluate signatures -- Applying Classification algorithms
Supervised
Select Training fields Edit/evaluate signatures Classify image
vs.
Unsupervised
Run clustering algorithm Identify classes Edit/evaluate signatures
Evaluate classification
Evaluate classification
N I R Red Unsupervised Posterior Decision: from Spectral Classes in Feature Space to Information Classes in the Image
Training
Training: the process of defining criteria by which spectral patterns are recognized Spectral signature: result of training that defines a training sample or cluster parametric - based on statistical parameters that assume a normal distribution (e.g., mean, covariance matrix) nonparametric - not based on statistics but on discrete objects (polygons) in feature space
Spectral Distance = 7
Spectral Distance = 10
Spectral Distance = 5
Spectral Distance = 10
Training Stage
Training set ---> training vector Training vector for each spectral classrepresents a sample in n-dimensional measurement space where n = # of bands
for a given spectral class j
Xj = [ X1 ]
[ X2 ]
X1 = mean DN band 1
X2 = mean DN band 2
Class 1
0
Class 2
Digital Number 255
Training Aids
Graphical portrayals of training data
good
bad
Training Aids
Graphical portrayals of training data
coincident spectral mean plots
Training Aids
Scatter plots: each training set sample constitutes an ellipse in feature space Provides 3 pieces of information - location of ellipse: mean vector - shape of ellipse: covariance - orientation of ellipse: slope & sign of covariance Need training vector and covariance matrix
N I R R e f l e c t a n c e
Grass
Mix: grass/trees Broadleaf Trees Conifer
Examine ellipses for gaps and overlaps. Overlapping ellipses ok within information classes; want to limit between info classes
Training Aids
Are some training sets redundant or overlap too greatly?
Statistical Measures of Separability: expressions of statistical distance that are sensitive to both mean and variance
- divergence - Mahalanobis distance
Training Aids
Training/Test Area classification: look for misclassification between information classes; training areas can be biased, better to use independent test areas Quick alarm classification: on-screen evaluation of all pixels that fall within the training decision region (e.g. parallelipiped)
N I R R e f l e c t a n c e
Upper and lower limit of each box set by either range (min/max) or # of standard devs. Note overlap in Red but not NIR band
Red Reflectance
.
ured Red reflectance
Signature ellipse
Candidate pixel
Soil 3
Water 2
Water 1 Red reflectance
Soil 1
Yd = 85-153
Distance = 111.2
180, 85 Xd = 180 -92 X
N I R R e f l e c t a n c e
Red Reflectance
Statistically-based classifiers
Defines a probability density (statistical) surface Each pixel is evaluated for its statistical probability of belonging in each category, assigned to class with maximum probability The probability density function for each spectral class can be completely described by the mean vector and covariance matrix
Class 1
0
Class 2
Digital Number 255
wj
Band 2
wi
Band 1
# of pixels
Digital Number
255
Band 1
# of pixels
wi
p (x | wi){ 0
p (x | wj) 255
Digital Number
N I R R e f l e c t a n c e
probability surfaces
Ellipses defined by class mean and covariance; creates likelihood contours around each spectral class;
Red Reflectance
N I R R e f l e c t a n c e
Some classes may have large variance and greatly overlap other spectral classes
Red Reflectance
Pro: takes the variability of the classes into account with info from COV matrix
Similar to maximum likelihood but without the weighting factors Con: parametric, therefore sensitive to large variances
Hybrid classification
Can easily mix various classification algorithms in a multi-step process First pass: some non-parametric rule (feature space or paralellipiped) to handle the most obvious cases, those pixels remaining unclassified or in overlap regions fall to second pass Second pass: some parametric rule to handle the difficult cases; the training data can be derived from unsupervised or supervised techniques
Thresholding
Statistically-based classifiers do poorest near the tails of the training sample data distributions Thresholds can be used to define those pixels that have a higher probability of misclassification; these pixels can be excluded and labeled un-classified or retrained using a clusterbusting type of approach
Threshold
Unclassified Regions
255
Thresholding
Chi square distribution used to help define a onetailed threshold
# of pixels
Chi Square
Water
Forested Wetland
Forest
Fuzzy Classification
Forested Wetland
Hard decision boundaries
Water
NIR reflectance
Homework
1 Homework: Unsupervised classification (Hand up your excel file and figure process); 2 Reading Textbook Ch. 9:337-389; 3 Reading Field Guide Ch. 7:226-231, 235253.