Upload
alessandro-giuliani
View
213
Download
0
Embed Size (px)
Citation preview
5 October 1998
PHYSICS LETTERS A
Physics Letters A 247 (1998) 47-52
On the constructive role of noise in spatial systems
Alessandro Giuliani a, Alfred0 Colosimo b, Romualdo Benigni a, Joseph l? Zbilut cpl -a Laboratory of Comparative Toxicology and Ecororicology, lstituto Superiore di Sanitd, Hale Regina Ekna 299, 00161 Rome, Italy
b reprint of Biochemistry, ~nive~~~ of Rome “La Sapierua”, Rome, Italy c repaint of ~oIecKlar Bio~hysies and ~~sio~g~ Rush ~nive~~~. 1653 K Congress, Chicago, It 60612, USA
Received 28 May 1998; revised manuscript received 13 Juty 1998; accepted for publication 13 July 1998 Communicated by CR. Doering
Abstract
The detection of weak signals has made a dramatic advance with the recognition of the constructive role that noise can play in the study of weak periodic signals. In this paper we show that the enhancement of weak signals has a wider reach, and is not contined to dynamical systems. In particular, the addition of artificial noise to a principal components analysis of a geographical problem permitted the discrimination between very weak but informative components, and the noise inherent in the data. The ~h~isrn involved is discussed. @ 1998 EIsevier Science B.V.
PACS: 43.50; 727O+m; 74.#.+k; 80 Keyrvor&: Signah; Noise: Principal components; Singular values; Stochastic resonance; Pattern recognition
1. Introduction
A crucial factor for pattern recognition is the abil- ity to discriminate between noise and signal. This dis- crimination is normally accomplished by devising a filter (either physical or ma~ematic~) , which elim- inates any signal below a given threshold [ l-31. Al- though a strong signal is certainly made clearer by a filter, semantically important information may be car- ried also by weak signals, which may be significantly degraded by the filtering [ 3,4-63. The discovery of the mechanism of stochastic resonance (SR) [7,8] has radically changed the way in which the problem of discriminating weak signals from background noise has been traditionally approached.
SR consists of a nonlinear cooperative effect, which arises when a weak periodic signal enters in resonance
’ Conesponding author; e-mail: j~ilut~ms~.~u.
with random fluctuations, thus producing the amplifi- cation of the periodic component; i.e., a maximum for the signal-to-noise ratio. More recently, claims have been made that the addition of a noise component can result in multiple maxima [ 21.
Although this phenomenon has been traditionally described with respect to dyn~ical systems, in prin- ciple, there is no reason why a spatially ordered sys- tem may not also exhibit signal enhancement due to added noise. The mathematical equivalence between principal components analysis (RCA) of a multivari- ate data set and the singular spectrum analysis of a time series [ 91 stimulated us to consider this possibil- ity. This attempt relies on the notion that a temporal structure is a correlation structure with no fundamen- tal difference from any non-time-dependent correla- tion structure 1 l&11].
0375~96X0/98/$ - see front matter @ 1998 Elsevier Science B.V. AR rights reserved. PII SO375-9601(98)00570-2
48 A. Git&zni et ~~./P~ysi&s Letters A 247 11998) 47-52
Table I Distances of European cities (km) from the main cities of Latium
Rome Latina Frosinone Viterbo Rieti
Amsterdam 430 447 449 415 409
Athens 347 321 331 346 364
Barcelona 283 305 293 292 271
Beograd 227 222 236 220 238
Berlin 393 400 409 374 373
Bern 227 249 247 220 205
Bonn 353 370 372 339 330
Brusehes 388 406 406 371 365
Bucharest 364 355 368 359 378
Budapest 268 261 274 246 259
CalaiS 418 448 446 418 405
Copenhagen 510 522 527 492 491
Dublin 622 645 641 615 600
Edinburgh 637 655 655 625 615
Frankfurt 318 333 336 302 295
Hamburg 435 448 453 417 414
Helsinki 727 729 739 706 713
tstanbut 452 430 443 443 464
Lisbon 615 637 622 624 604
London 474 494 493 464 456
Luxembourg 325 346 346 315 307
Madrid 449 470 458 460 440
Marseilfe 200 223 213 202 183
Moscow 782 773 785 7.59 774
Munich 230 245 250 216 213
Oslo 664 675 682 646 645
Paris 365 386 383 357 343
Prague 305 313 320 286 290
Sofia 294 273 286 280 301
Stockholm 653 658 668 632 636
Warsaw 435 433 444 413 421
Vienna 255 254 265 233 240
Zurich 227 246 246 214 205
2. Methods
The test material for our analysis is shown in Ta- ble 1, which reports the distances of 33 European cities from the 5 main towns of the Latium region (a province in central Italy containing Rome). The weak signal to be detected was the spatial pattern of the Eu- ropean cities, based on their distances from the Latium towns. In this case, the info~ation linked to the av- erage ch~acteristics of the data (strong signal) was highly degenerate, since it represented only the dis- tance of European cities from Latium, and did not per- mit the reconstruction of the relative spatial orienta- tion of the European cities (for example, Warsaw and
Madrid have almost the same distance from Latium) . The semantically important information, i.e., spatial orientation, was linked to the minor components of the variability of distance data (weak signal). This spe- cific problem was selected because of its difficulty: (a) the differences between the distances of the 5 Latium towns from each of the European cities are extremely small with respect to the correlated portion of the dis- tances (weak signal versus strong signal); i.e., the area ratio of Latium to Europe is 0.00163 [ 121; (b) since Latium is in an eccentric position with respect to Europe, and since there is no regular pattern in the distribution of European cities, the weak signal was very irregular and asymmetric; (c) the sample of data was rather small (33 statistical units) ; and (d) the dis- tances were measured manually on a I :3 000 000 scale European map in order to generate a certain amount of internal noise ( 1 mm = 3 km). On the other hand, the success or failure of the identification of the weak signal is easily recognizable by simple comparison of the results with the map of Europe. The problem was modeled with PCA, which can be considered as a fil- ter for correlated information [9,13- 15 ] .
We compute the principal components with an M x M covariance matrix Cx which is diagonalized and the eigenvalues are ranked in decreasing order,
Ax = ET,CxEx , (1)
where Ax = diag( At, AZ, . . . . AM) is the diagonal ma- trix with At 2 A2 2 ... 2 A,,, > 0 and Ex is the M x M matrix having the corresponding eigenvectors Ek, k = l,..., M, as its columns. The eigenvalue & gives the variance of the time series in the direction given by the eigenvector Ek; while the square roots of the eigenvalues are called the singular values. Pro- jection of the eigenvectors yields the corresponding principal components [ 16,171.
3. Results
Since the PCs are extracted in order of importance, the correlated portion of info~ation is amplified and incorporated into the first PC, whereas the minor PCs are more and more representative of noise [ 3,16,17 ] . Table 2 reports the factor loadings and the proportion of explained variance relative to the PCs generated by PCA of the 5 variables in Table 1. As expected,
A. Giuliani et al./Physics L.&ten A 247 (1998) 47-52 49
Table 2 Factor loadings and proportions of explained variance
Variables Components
PC1 PC2 PC3 PC4 PCS
Rome 0.9997 0.0137 -0.0184 -0.0120 0.0001 Frosinone 0.9973 -0.0715 0.0132 0.0011 0.0029 Latina 0.9987 -0.0420 -0.0272 0.0058 -0.0024 Rieti 0.9909 0.0162 0.0393 -0.0009 -0.0023 Viterbo 0.9964 0.0837 -0.0070 0.0060 0.0017
Explained variance 0.9965 0.0029 0.000569 0.000043 0.000005
the first PC (PC1 ) explained almost all of the vari- ance (more than 99%,), the 5 distance variables being highly interco~elat~ due to the small dimension of Latium with respect to Europe. The subsequent PCs represented a minimum amount of the initial infor- mation, and, at first glance, could be mistakenly con- sidered as noise. However, the plot of PC2 and PC3 scores (Fig. 1) clearly shows that these two PCs re- constructed the angular distance map of the European cities, as they are observed from the Latium point of view. In the figure, the European cities form a semi- circle around Latium, going from the Balkans to the Iberian peninsula. Cities having the same bearing with respect to Latium are close to each other in the fig- ure, even if they are distant from each other in real space (the size component being PCl). The analyt- ical demonstration of the above finding was that the angle, 0, which is formed by the European cities with respect to the north-south axis passing through Rome, as computed with the PC GLOBE 5.0 software [ 181, was accurately predicted based on the PC2 and PC3 values (PC1 was not significantly correlated with the angle),
B = -4.8306 + 41.~7(~2~ - 27.086(PC3),
r = 0.97, p < o.ooo1. (2)
Thus, PCA broke down the information into “size” (PC1 > and “shape” (PC2, PC3) components [ 6 1, and separated effects relative to different measurement scales. PC4 and PC5 were not correlated with any meaningful characteristic of the European map, and were considered to be noise derived from the measure- ment of distances on a European map using a ruler.
3
2
1
B cl
-1
1;
I-
,_
I-
:.0
I I I a Lisbon l Madrid
Barcelona a
Marseille a
Athens (
Calals aa Dublin
Berna 2y⁢9h . “mrbu’
0 Sofia
‘** Hamburg a
Munich l ,a l ’ Moscow .
Vienna a l Budapest
I I I
-0.75 0.50 1.75 3.00 PC2
Fig. 1. Projection of the European towns on the second and third PCs obtained from Table 1 data. It is emphasised that this projec- tion represents the angular orientation centered on Latium, thus distorting the usual visual presentation of the European map. Fur- thermore, the distance component (PC1 ) is required for complete characterimtion (see text).
In order to test the ~ssibility of inducing an SR- like phenomenon able to magnify the relative im- portance of the weak signal, increasing amounts of Gaussian noise were added to the Table 1 variables. The weak signal amplitude was estimated by mul- tiplying the mean variance of the distance variables times the proportion of variance explained by PC2 plus PC3. The signal-to-noise ratio (SNR) ranged from 52% to 939% corresponding to a numerical value of added noise of respectively 1 and 18 mm. New PCAs of the noise-corrupted distances were performed, for
50 A. Giuiiani et d/Physics Letters A 247 fI998) 47-52
0.010
0.009
0.008
0.007
0.006
0.005
0.004
0.003
0.002
0.001 0.0
0
Noise Levels
5 10 15 20 Noise Levels
0.0020 1 1 1 I 1 IC I
x 0.0015 .s
B w 8 0.0010
5 .C
s 0.0005
0.0 0 5 10 15 20
Noise Levels Fig. 2. Table 1 variables were added with noise at different SNR values, and subjected to PCA. The figure reports the variation of the proportion of explained variance relative to the first 3 PCs in the presence of different amounts of noise. The noise is the standard deviation of zero-mean Gaussian deviates added to the original variables. (a) Behavior of the strong signal PC; (b) relative to the weak signa PCs; (c) relative to the noise components. The abscissa of the graphs is expressed in standard deviation (SD) units (mm) ( I mm = 3 km).
PC1
-0.2 I 0 5 10 15 20
Noise Levels Fig. 3. The figure displays the degree of recognition (Pearson’s correlation coefficient) betwe@n the noise-cormpted PCs and their original counterparts, for different amounts of noise. The abscissa in expressed in SD units (mm) (I mm = 3 km).
different noise values, Fig. 2 displays the variance explained by the new PCs. The proportion of vari- ance explained by the strong signal (PC1 ) decreased when the added noise increased (d~reasing SNR val- ues) (Fig. 2a), whereas the variance explained by the weak signal components (PC2 and PC3) increased (Fig. 2b). From a purely computational point of view this result is due to the fact that white noise corre- sponds to a flat eigenvalues spectrum (all the compo- nents explain the same amount of variability) [ 171’. This implies that adding white noise to our data set tends to equalize the eigenvalue normalized spectrum (proportion of explained v~iabi~ity) , diminishing the proportion explained by strong signal components, and enhancing the proportion of variability relative to weak signal and noise components. The variance exptained by the noise components (PC4 and PC5) increased as well (Fig. 2~). Fig. 3 demonstrates the results of a “recognition test” between the original PCs and the PCs relative to the noise-coopted data. The correla- tion coefficients between homologous PCs ( 1 to 3) indicated that both the strong and the weak signals re- mained recognizable (statistically significant corrcla- tion coefficients for the entire simutation range). On
*We note that this approach is similar to a class of techniques termed “pmwhi~ning” and advocated by Tukey in the 1950s. See, e.g.. Ref. [241.
A. Gi~l~~i et at, /Physics Letters A 247 (1998) 47-52 51
the contrary, the pure noise components had no re- lationship with their original counterparts. Thus, the overall procedure shown in Figs. 2 and 3 (corrup- tion of the original data with noise, followed by the computation of the correlations between homologous PCs) permitted a clear discrimination between strong signal, weak signal and noise.
4. Discussion and conclusions
Our results support the view that noise can have a constructive role in the recognition of weak signals in non-dynamical frames; however, the method may not be similar to the mechanism as found in SR of dy- namical systems. The basic paradigm for SR is given by a double well potential,
V(x) = jbx4 - $a,*, (3)
the minima being located at fx,, where xnt =
(a/b) ri2. A potential barrier, whose height is given by AV = az/4b, separates the minima, while its top is located at x& = 0. With a periodic driving force, the double well potential, V( x, t) = V(x) - AOX cos( LB), tilts back and forth, raising and lowering the potential barriers to the right and left antisymmetrically. If the period of the driving approximately equals twice the noise induced escape time, a synchronized hopping to the globally stable state will occur [ 81. In our case, however, we do not have a driving force, and the noise does not derive from a time dependent process. Instead, the noise results from errors of measurement scale.
In our example, the noise may be considered a variant of the “quantization” problem encountered in analog-to-digital (A/D) conversions, and floating point calculations: it is well known that such pro- cesses generate error which is often correlated with the signal (calculation) being transformed; i.e., a mapping from a continuous range to, e.g., an integer range. Generally speaking such “rounding” error is not significant if the signal is large amplitude and wideband. If the signal, however, is small amplitude and/or narrow bandwid~ (such as in the case of our small PCs), signiticant correlations can occur, which distort the input signal. To overcome this prob- lem, audio engineers have employed the technique of “dithering”, whereby noise is added to sampled
signals to make the quantization error independent of the input signal [ 19,201. Thus if d(n) is a white noise source added to x(n), then
e(n) = y(n) - x(n) (4)
is also white, regardless of the spectrum of X, where e(n) is the error, y(n) is the output, and x(n) is the input. That this occurs in our example can be seen in the gradual (but still statistic~ly signific~t) decorre- lation of PCs 2 and 3; lack of change for PC1 (large amplitude); whereas PCs 4 and 5 are unchanged, since they already are white (Fig. 3).
A range of applications is possible for the present findings. An immediate application is to add increas- ing amounts of noise to a data set, in order to check the congruency of the hypotheses made on the weak informative components. This kind of implementation can be imagined in many instances, ranging from the study of the relationships between chemical structure and biological activity 1211 to the analysis of bio- logical signals [ 221. This is p~icul~ly impo~nt in light of the increasing role of PCA and regression on KS as tools to investigate many different phe- nomena [ 231. The trends of the correlations between noise-corrupted components and original components, and consequently their identification as proper signals, may be investigated easily by applying classical in- ferential methods, such as a simple one-way analysis of variance to check for the statistical significance of the decay of the correIation between original and de- graded homologous components at increasing noise levels, the quantity of added noise being the source of v~ahility.
Acknowledgement
Dr. Barbara Camerini and Dr. Marta Menghini are gratefully acknowledged for the continued interest in our work, Ms. Eve Silvester is acknowledged for the patient editing of the text. JPZ acknowledges useful discussions with Zeev Schuss.
References
[ 11 C.E. Shannon, N. Weaver, The Mathematical Theory of Communication (University of Illinois Press, Urbana, IL, 1949).
52 A. Giuliani et al./Physics Letters A 247 (1998) 47-52
[2] K. Wiesenfeld, F. Moss, Nature 373 (1995) 33. [3] E. Oja, Neural Networks 5 ( 1992) 927. [4] P Thompson, in: Proc. 13th Asilomar Conf. Circuits,
Systems, and Computers (Pacific Grove, CA, 1979) pp. 529- 533.
(51 H. Frauenfelder, S.G. Sligar? Science 254 (1991) 1.598. 161 J.N. Darroch, J.E. Mosimann, Biometrika 72 (1985) 241. [7] R. Benzi, A. Sutera, A. Vulpi~i, J. Phys. A 14 (1981) L453. [S] G. Gammaitoni, P Honggi. P Jung, Rev. Mod. Phys. 70
(1998) 223. [9] R.W. Preisendorfer, Develop. Atmosph. Sci. 17 (Elsevier,
Amsterdam, 1988). [ lo] F. Moss, K. Wiesenfeld, Scientific American 273(2) (1995)
50. [ 111 B.J. West, Physica D 195 (1995) 12. [ 121 Anonymous, II grande atlante dell’Eutopa e de1 Mondo
( DeAgostini E&tote, Novam, 1975). [ 131 L. Lebart, A. Morineau, K.M. Warwick, Multivariate
Descriptive Statistical Analysis (Wiley, New York, 1984).
1141 R. Benigni, A. Giuliani, Am. J. Physiol. 266 (1994) R1697. [ 151 J.A. Anderson, J.W. Silverstein, S.A. Ritz, S.J. Randall,
Psychol. Rev. 5 (1977) 413. [ 161 R. Vautard, R.P. You, M. Ghil, Physica D 58 (1992) 95. [ 171 D.S. Broomhead, G.l? King, Physica D 20 (1986) 217. [IS] PC GLOBE 5.0, PC GLOBE Inc. [ 191 I. Kolhir, Periodica Polytechnica Ser. Electrical Engineering
28 (1984) 173. [20] B. Widrow, I. Kolhir, M.-C. Liu, IEEE Trans. instrum. &
Meas. 45 (1996). 353. [21] R. Benigni, A. Giuliani, Mutat. Res. 306 (1994) 181. [22] T. Elbert, W.G. Ray, Z.J. Kowalik, J.E. Skinner, K.E. Graph,
N. Birbaumer. Physiol. Rev. 74 (1994) 1. [23] E.S. Soot?. J. Am. Stat. Assoc. 89 (1994) 1243. 1241 D.R. Bales, ed., The Collected Works of John W. Tukey.
Vol. I. Time series (Wadsworth Advanced Books, Belmont, CA, 1984); M.B. Priestley, Spectral Analysis and Time Series (Academic Press, London, 198 1)