4
A NEW APPROACH FOR ACCURATE CLASSIFICATION OF HYPERSPECTRAL IMAGES USING VIRTUAL SAMPLE GENERATION BY CONCURRENT SELF-ORGANIZING MAPS Victor-Emil Neagoe and Adrian-Dumitru Ciotec Faculty of Electronics, Telecomm. & Information Technology, Polytechnic University of Bucharest Splaiul Independenţei No. 313, Sector 6, Bucharest, Romania Email: [email protected], [email protected] ABSTRACT This paper presents an original approach for improving performances of the supervised classifiers in hyperspectral remote sensing imagery using generation of synthetic samples to optimize the training set. The proposed model called Virtual Sample Generation by Concurrent Self- Organizing Maps (VSG-CSOM) is based on the idea of improving the training set of a supervised classifier by substituting the initial labeled sample set with the „virtual” samples generated with a system of concurrent SOMs. We have considered a Spatial-Contextual Support Vector Machine (SC-SVM) classifier, taking into consideration both intraband and also interband pixel correlation. The proposed method is implemented and evaluated using two of the most known data sets of hyperspectral images: Indian Pines and Pavia University. The experimental results prove the significant advantage of the new model. 1. INTRODUCTION The information content of hyperspectral images with hundreds of channels allows us to remotely identify ground materials, based on their spectral signature [2], [3], [8], [14]. Each hyperspectral image pixel is a high dimensional vector corresponding to the sampling of the incoming radiance at this pixel. Classification of hyperspectral images is a very challenging task due to the generally unfavorable ratio between the large number of spectral bands and the limited number of training samples available a priori, which results in the Hughes phenomenon [14]. The sample size is often limited and insufficient for most classifiers to give satisfying performance. Hence, small sample size problem becomes the main challenge to hyperspectral data analysis techniques. Kernel methods and, in particular, support vector machines (SVM) [1] have shown excellent performance in multispectral/hyperspectral image classification [2], [3], [4], [7, [14]. In order to obtain a robust classifier, the training set needs to contain a sufficient amount of samples to adequately represent the problem [1], [5], [12], [15]. The traditional way to cope with the small training set problem is interpolation, also known as noise injection [15]. By adding noise to the real training vectors, one can generate virtual samples to increase the size of the training lot and thus to improve representational capability. At the same time, enormous success has been achieved during the last decade through modeling of biological and natural intelligence under the umbrella of Computational Intelligence (CI), whose main chapter corresponds to artificial neural networks. The proposed model called Virtual Sample Generation by Concurrent Self-Organizing Maps (VSG-CSOM) is based on the idea of substituting the original labeled training set by a new set of synthetic samples generated with a system of Concurrent SOMs [12] which best describe the patterns to build an improved training set. For the hyperspectral pixel classifier, we further use a SVM based on spatial-contextual information (SC- SVM), taking into consideration both the interband and intraband pixel correlation [7]. The experimental results of the proposed model are given for two hyperspectral image data bases: Indian Pines and Pavia University. 2. VIRTUAL SAMPLE GENERATION BY CONCURRENT SELF-ORGANIZING MAPS (VSG-CSOM) The technique of Concurrent Self-Organizing Maps (CSOM) has been proposed and developed by Neagoe et al for pattern classification [9] and this has been applied for multispectral image recognition [9], [10], [11]. Recently, Neagoe et al have proposed the increasing of the training set size for a facial expression recognition neural classifier, by means of „Virtual” Sample Generation (VSG) using a set of Concurrent Self-Organizing Maps (CSOM) [12]. The novelty of the present paper based on the new model (VSG- CSOM) by comparison to that given by the same authors in [12] consists not only in applying now the mentioned model for Earth Observation in hyperspectral imagery instead of facial expression recognition, but also on the fact that instead of using the synthetic sample generator to extend a 1031 978-1-4799-1114-1/13/$31.00 ©2013 IEEE IGARSS 2013

[IEEE IGARSS 2013 - 2013 IEEE International Geoscience and Remote Sensing Symposium - Melbourne, Australia (2013.07.21-2013.07.26)] 2013 IEEE International Geoscience and Remote Sensing

Embed Size (px)

Citation preview

Page 1: [IEEE IGARSS 2013 - 2013 IEEE International Geoscience and Remote Sensing Symposium - Melbourne, Australia (2013.07.21-2013.07.26)] 2013 IEEE International Geoscience and Remote Sensing

A NEW APPROACH FOR ACCURATE CLASSIFICATION

OF HYPERSPECTRAL IMAGES USING VIRTUAL SAMPLE

GENERATION BY CONCURRENT SELF-ORGANIZING MAPS

Victor-Emil Neagoe and Adrian-Dumitru Ciotec

Faculty of Electronics, Telecomm. & Information Technology, Polytechnic University of Bucharest

Splaiul Independenţei No. 313, Sector 6, Bucharest, Romania

Email: [email protected], [email protected]

ABSTRACT

This paper presents an original approach for improving

performances of the supervised classifiers in hyperspectral

remote sensing imagery using generation of synthetic

samples to optimize the training set. The proposed model

called Virtual Sample Generation by Concurrent Self-

Organizing Maps (VSG-CSOM) is based on the idea of

improving the training set of a supervised classifier by

substituting the initial labeled sample set with the „virtual”

samples generated with a system of concurrent SOMs.

We have considered a Spatial-Contextual Support Vector

Machine (SC-SVM) classifier, taking into consideration

both intraband and also interband pixel correlation.

The proposed method is implemented and evaluated using

two of the most known data sets of hyperspectral images:

Indian Pines and Pavia University. The experimental results

prove the significant advantage of the new model.

1. INTRODUCTION

The information content of hyperspectral images with

hundreds of channels allows us to remotely identify ground

materials, based on their spectral signature [2], [3], [8], [14].

Each hyperspectral image pixel is a high dimensional vector

corresponding to the sampling of the incoming radiance at

this pixel. Classification of hyperspectral images is a very

challenging task due to the generally unfavorable ratio

between the large number of spectral bands and the limited

number of training samples available a priori, which results

in the Hughes phenomenon [14]. The sample size is often

limited and insufficient for most classifiers to give satisfying

performance. Hence, small sample size problem becomes the

main challenge to hyperspectral data analysis techniques.

Kernel methods and, in particular, support vector machines

(SVM) [1] have shown excellent performance in

multispectral/hyperspectral image classification [2], [3], [4],

[7, [14]. In order to obtain a robust classifier, the training set

needs to contain a sufficient amount of samples to

adequately represent the problem [1], [5], [12], [15]. The

traditional way to cope with the small training set problem is

interpolation, also known as noise injection [15]. By adding

noise to the real training vectors, one can generate virtual

samples to increase the size of the training lot and thus to

improve representational capability. At the same time,

enormous success has been achieved during the last decade

through modeling of biological and natural intelligence

under the umbrella of Computational Intelligence (CI),

whose main chapter corresponds to artificial neural

networks.

The proposed model called Virtual Sample Generation by

Concurrent Self-Organizing Maps (VSG-CSOM) is based on

the idea of substituting the original labeled training set by a new

set of synthetic samples generated with a system of Concurrent

SOMs [12] which best describe the patterns to build an

improved training set. For the hyperspectral pixel classifier, we

further use a SVM based on spatial-contextual information (SC-

SVM), taking into consideration both the interband and

intraband pixel correlation [7]. The experimental results of the

proposed model are given for two hyperspectral image data

bases: Indian Pines and Pavia University.

2. VIRTUAL SAMPLE GENERATION BY

CONCURRENT SELF-ORGANIZING MAPS

(VSG-CSOM)

The technique of Concurrent Self-Organizing Maps (CSOM)

has been proposed and developed by Neagoe et al for

pattern classification [9] and this has been applied for

multispectral image recognition [9], [10], [11]. Recently,

Neagoe et al have proposed the increasing of the training set

size for a facial expression recognition neural classifier, by

means of „Virtual” Sample Generation (VSG) using a set of

Concurrent Self-Organizing Maps (CSOM) [12]. The

novelty of the present paper based on the new model (VSG-

CSOM) by comparison to that given by the same authors in

[12] consists not only in applying now the mentioned model

for Earth Observation in hyperspectral imagery instead of

facial expression recognition, but also on the fact that

instead of using the synthetic sample generator to extend a

1031978-1-4799-1114-1/13/$31.00 ©2013 IEEE IGARSS 2013

Page 2: [IEEE IGARSS 2013 - 2013 IEEE International Geoscience and Remote Sensing Symposium - Melbourne, Australia (2013.07.21-2013.07.26)] 2013 IEEE International Geoscience and Remote Sensing

given sample set, here the training set is composed by virtual

samples only that completely substitute the input original

samples. The steps of the proposed algorithm VSG-CSOM

are the following (Fig.1):

a. Building M pattern subsets

One splits the original labeled sample set into M

pattern subsets corresponding to each class (M being the

number of classes).

b. Training of each SOM(k) module (k=1,…,M)

One uses the SOM unsupervised algorithm [6] to train

each of the SOM modules. Namely, each SOM module has

been trained only with the samples having the same class

with the neural module label.

c. Virtual sample generation

After training, the weight vectors the SOM modules

have become virtual samples that substitute the input real

samples, building an improved training set to obtain a more

accurate classifier.

Fig. 1. Flowchart of the VSG-CSOM algorithm.

Fig. 2. Spatial-contextual representation of the K-band pixel

by the corresponding 9K-dimensional vector corresponding

to the 3x3xK hyper rectangle centered in the current pixel.

3. SPATIAL-CONTEXTUAL CLASSIFICATIN OF

HYPERSPECTRAL PIXELS WITH SUPPORT

VECTOR MACHINE (SC-SVM)

The Support Vector Machine is a pattern classification

technique, which attempts to minimize the upper bound of

the generalization error by maximizing the margin between

the separating hyperplane and the training data [1]. SVM

achieves greater empirical accuracy and better generalization

capabilities than other standard supervised classifiers.

Recent studies show that SVMs with both spectral and

spatial information achieve effective and stable multispectral

image classification [7]. To classify the hyperspectral pixels

(K bands), one chooses a spatial-contextual representation of

the K-band pixel as a 9K dimensional vector (using a

second-order neighborhood system in the original space).

Consequently, we have used the interband and the intraband

correlation by representing the K-band pixel by the

corresponding hyper rectangle centered in the current

hyperspectral pixel (see Fig.2). This hyper rectangle has a

square base of 3x3 and a height of K, corresponding to the

9K dimensional vector representation.

4. EXPERIMENTAL RESULTS

4.1. Image data bases

4.1.1. Indian Pines

This scene was gathered by AVIRIS sensor over the Indian

Pines test site in North-western Indiana and consists of 145

x145 pixels and 224 spectral reflectance bands. The ground

truth available is designated into sixteen classes and is not

all mutually exclusive. The image resolution is of 20

meters/pixel. The number of bands has been reduced to 200

by removing bands covering the region of water absorption.

4.1.2. Pavia University

The image is acquired by ROSIS sensor over Pavia, northern

Italy. The number of spectral bands is 103 and the image

size is 610x340 pixels, but some of the samples in both

images contain no information and have to be discarded

before the analysis. The geometric resolution is 1.3

meters/pixel. The image groundtruth differentiates 9 classes.

4.2. Experimental Results

The experimental results are given in Tables 1 and 2 as well

as in Figs. 3, 4, 5, and 6.

1032

Page 3: [IEEE IGARSS 2013 - 2013 IEEE International Geoscience and Remote Sensing Symposium - Melbourne, Australia (2013.07.21-2013.07.26)] 2013 IEEE International Geoscience and Remote Sensing

Fig. 3. (a) Groundtruth of Indian Pines dataset. (b) Classified

hyperspectral pixel map (16 categories) with the model

(VSG-CSOM)-(SC-SVM), using a learning set containing

5% of the total pixels for each class.

Fig. 4. (a) Groundtruth of Pavia University dataset. (b)

Classified hyperspectral pixel map (9 categories) with the

model (VSG-CSOM)-(SC-SVM), using a learning set

containing 5% of the total.

Table 1. Maximum recognition score [%] for Indian Pines

hyperspectral image dataset (with a learning set of 5% of the

total pixels for each class)

Classifier SVM

(VSG-

CSOM)-

SVM

(VSG-

CSOM)-

(SC-SVM)

LDA-

SVM

Maximum correct

recognition score [%] 66.07 75.10 76.93 68.82

Fig. 5. Maximum recognition score [%] for Indian Pines

hyperspectral image dataset.

Table 2. Maximum recognition score [%] for

Pavia University hyperspectral image dataset (with a

learning set of 5% of the total pixels for each class)

Classifier SVM

(VSG-

CSOM)-

SVM

(VSG-

CSOM)-

(SC-SVM)

LDA-

SVM

Maximum correct

recognition score [%] 87.95 93.07 96.50 90.89

Fig. 6. Maximum recognition score [%] for Pavia University

hyperspectral image dataset.

1033

Page 4: [IEEE IGARSS 2013 - 2013 IEEE International Geoscience and Remote Sensing Symposium - Melbourne, Australia (2013.07.21-2013.07.26)] 2013 IEEE International Geoscience and Remote Sensing

5. CONCLUDING REMARKS

a. The model proves its significant advantage of substituting

the original image training samples with the synthetic ones

in challenging classification cases (when the learning set

contains only 5% of the sample set of each class).

b. For the Indian Pines dataset, choosing a learning set

containing 5% of the total pixels of each class, one obtains

with the proposed model (VSG-CSOM)-SVM an increasing

of 9.03% of the recognition score over the SVM score; for the

(VSG-CSOM)-(SC-SVM) variant (adding spatial-contextual

information to the proposed model), one obtains an increasing

of 10.86% over SVM. As a benchmark, one also uses the

LDA – SVM cascade (where LDA=Linear Discriminant

Analysis). One remark the performance of LDA-SVM is with

6.28% less than that of (VSG-CSOM)-SVM.

c. For the Pavia University dataset, with a learning set

containing 5% of the total pixels of each class, the classifier

(VSG-CSOM)-SVM leads to the recognition score of

93.07%, representing an increasing of 5.12 % over the

SVM. For the (VSG-CSOM)-(SC-SVM) variant, one obtains

a recognition score of 96.50%, representing an increasing

of 8.55% over SVM. For comparison, the LDA-SVM proves

to be less successful than the proposed model

(VSG-CSOM).

6. REFERENCES

[1] M. Bishop, Pattern Recognition and Machine Learning,

Springer, New York, 2006.

[2] C. H. Chen, Signal and Image Processing, for Remote Sensing,

Second Edition, CRC Press, New York, 2012.

[3] I. Dopido, M. Zortea, A. Villa, A. Plazza, P. Gamba,

“Unmixing Prior to Supervised Classification of Remotely Sensed

Hyperspectral Images”, IEEE Geosc. Remote Sensing Letters, vol.

8, nr. 4, pp. 760-764, July 2011.

[4] G. M. Foody and A. Mathur, “A Relative Evaluation of Multiclass

Image Classification by Support Vector Machines”, IEEE Trans.

Geosci. Remote Sens., Vol. 42, No. 6, pp. 1335-1343, Jun. 2004.

[5] A. Kohl, L. Kruger, C. Wohler, and U. Kressel, “Training the

Classifiers Using Virtual Samples only”, in Proc. of the 17th

International Conference on Pattern Recognition, Cambridge, UK,

pp. 418-421, 23-16 Aug. 2004.

[6] T. Kohonen, Self-Organizing Maps, Springer, Springer Verlag,

Berlin-Heidelberg, 1995.

[7] C.H. Li, B.C. Kuo, C.T. Lin, C. S. Huang, “A Spatial-

Contextual Support Vector Machine for Remotely Sensed Image

Classification”, IEEE Trans. Geosc. Rem. Sensing, Vol. 50, No. 3,

pp. 784-799, March 2012.

[8] J. Li, J. M. Bioukas-Diaz, A. Plaza, “Semisupervised

Hyperspectral Image Segmentation Using Multinomial Logistic

Regression with Active Learning”, IEEE Trans. Geosc. Remote

Sens. Vol. 48, nr. 11, pp. 4085-4098, Nov. 2010.

[9] V. E. Neagoe, A. Ropot, “Concurrent Self-Organizing Maps

for Pattern Classification”, Proc. First IEEE International

Conference on Cognitive Informatics, ICCI 2002, Calgary,

Alberta, Canada, pp. 304-312, 19-20 August 2002.

[10] V. E. Neagoe and G. E. Strugaru, A Concurrent Neural

Network Model for Pattern Recognition in Multispectral Satellite

Imagery, in Proc. World Automat/ Congress, 2008 (WAC 2008),

Int. Symp. on Soft Comp. in Industry (ISSCI'08), Hawaii, USA,

Sept. 28-Oct. 2, 2008.

[11] V. E. Neagoe and A. Ropot, A New Neural Approach for

Pattern Recognition in Space Imagery, in Harbour Protection

Through Data Fusion Technologies, NATO Science for Peace and

Security Series-C: Environmental Security, Springer, Netherlands,

pp. 283-289, 2009.

[12] V. E. Neagoe and A. D. Ciotec, “Virtual Sample Generation

Using Concurrent-Self-Organizing Maps and Its Application for

Facial Expression Recognition”, Proc. Internat. IEEEAM Conf.

Math. Models Eng. Science (MMES'10), Tenerife, Spain, Vol. 1,

pp. 167-181, Nov. 30-Dec. 2, 2010.

[13] V. E. Neagoe, M. Neghina, M. Datcu, “Neural Network

Techniques for Automated Land-Cover Change Detection in

Multispectral Satellite Time Series Imagery”, International

Journal of Mathematical Models and Methods In Applied

Sciences, NAUN Press, Vol. 6, Issue 1, pp. 131-139, 2012.

[14] A. Plazza et al, “Recent Advances in Techniques for

Hyperspectral Image Processing”, Remote Sensing Environ., vol.

113, pp. 110-122, Sep. 2009.

[15] M. Skurichina, S. Raudys, and R. P. W. Duin, “K Nearest

Neighbors Directed Noise Injection in Multilayer Perceptron

Training”, IEEE Trans. Neural Networks, Vol. 11, pp. 504-511,

March 2000.

1034