4
Canonical cortical module as a spatial-frequency filter Yu. D. Kropotov and I. Z. Kremen Institute of the Human Brain, Russian Academy of Sciences, St. Petersburg ~Submitted March 17, 1999! Opticheskił Zhurnal 66, 81–84 ~September 1999! The authors earlier suggested a realistic neural network ~called a canonical cortical module! that simulates the structure and functions of the visual system in the brain. Computer experiments showed a wide range of possibilities of using this network for automatic image analysis. This paper concentrates on the abilities of the network to filter spatial frequencies, to discriminate textures, and to suppress high-frequency spatial noise. © 1999 The Optical Society of America. @S1070-9762~99!01109-4# I. INTRODUCTION In Ref. 1, the authors suggested a mathematical model of a canonical cortical module that imitates the structure and function of an element of the visual system of the brain, from the retina to a module of the striate cortex, inclusive. This model simulates the processes of input image filtering that take place in mammalian visual systems. During recent de- cades, enough experimental data have been gathered over the world to prove that the mammalian brain carries out piece- wise spatial frequency analysis ~Fourier analysis or Gabor decomposition! of the input visual signal. 2,3 In this paper, we concentrate on the abilities of our network to carry out this specific function. II. NETWORK STRUCTURE The network structure is shown schematically in Fig. 1. The network consists of several layers or levels, represented by parallel planes ~Fig. 1a!. Layer R simulates the retina, layers LGB on and LGB off represent two parts of the lateral geniculate body ~LGB!: the layer of on neurons and the layer of off neurons. The striate cortex is simulated by two modules. Each module contains four layers of inhibitory neurons ~In1..In4!, four layers of simple cortical neurons ~S1..S4!, and one layer of complex cortical neurons ~C!. These two modules differ in the spatial organization of the inhibitory neurons in the In- planes ~Fig. 1b!. In one module, they are assembled as di- vergent sectors ~the sector module!, in the other in the form of concentric rings ~the ring module!. For all the planes except the In-planes, the distribution densities of the model neurons are unity. For the inhibitory neurons in planes In i ( i 51..4), it is described by for the sectors, n In~ i ! on 51 2sin~ v c 1p i /2! , n In~ i ! off 51 1sin~ v c 1p i /2! , ~1! for the rings, n In~ i ! on 51 2sin~ 2 pn r 1p i /2! , n In~ i ! off 51 1sin~ 2 pn r 1p i /2! . Here, v is the number of sectors, n is the number of rings in the mosaic and r and c are the polar coordinates of an inhibitory neuron in plane In i . III. OUTPUT SIGNALS OF THE MODEL CORTICAL NEURONS The formula for the input signals and for the outputs ~responses! of model cortical neurons of different levels have been derived in Ref. 1. The input signal for a simple neuron of the ring module is determined from I Si ~ r ! ~ X , Y ! 5 EE exp~ 2a ~ r ! 2 @~ X 2x ! 2 1~ Y 2 y ! 2 # / R 2 ! sin~ 2 pn ~ x 2 1 y 2 ! 1/2 1p i /2! O R ~ x , y ! dxdy , ~2! and the input signal for a simple neuron of the sector module is I Si ~ s ! ~ X , Y ! 5 EE exp~ 2a ~ s ! 2 @~ X 2x ! 2 1~ Y 2 y ! 2 # / R 2 ! sin~ v c ~ x , y ! 1p i /2! O R ~ x , y ! dxdy . ~3! Here O R ( X , Y ) 5s ( X , Y ) 2^ s & is the signal at the output of the retina, s ( X , Y ) is the input signal arriving at the retina, and ^s& is the input signal averaged over the field of view of the network, c ~ x , y ! 5 H arccos~ x / ~ x 2 1 y 2 ! 1/2 ! , p 1arccos~ 2x / ~ x 2 1 y 2 ! 1/2 ! , y >0; y ,0. The integration is performed over the circular receptive field of a simple neuron; the field width is defined by parameters a ( r ) and a ( s ) . The weighting factors by which the response of the retina in the integrands is multiplied are similar to Gabor functions. The output signal for a simple neuron is defined as a step function with zero threshold: O Si ~ s ! ~ X , Y ! 5I Si ~ s ! ~ X , Y ! H @ I Si ~ s ! ~ X , Y !# , O Si ~ r ! ~ X , Y ! 5I Si ~ r ! ~ X , Y ! H @ I Si ~ r ! ~ X , Y !# , ~4! 832 832 J. Opt. Technol. 66 (9), September 1999 1070-9762/99/090832-04$15.00 © 1999 The Optical Society of America

Canonical cortical module as a spatial-frequency filter

  • Upload
    i-z

  • View
    225

  • Download
    1

Embed Size (px)

Citation preview

Canonical cortical module as a spatial-frequency filterYu. D. Kropotov and I. Z. Kremen

Institute of the Human Brain, Russian Academy of Sciences, St. Petersburg~Submitted March 17, 1999!Opticheski� Zhurnal66, 81–84~September 1999!

The authors earlier suggested a realistic neural network~called a canonical cortical module! thatsimulates the structure and functions of the visual system in the brain. Computer experimentsshowed a wide range of possibilities of using this network for automatic image analysis. This paperconcentrates on the abilities of the network to filter spatial frequencies, to discriminatetextures, and to suppress high-frequency spatial noise. ©1999 The Optical Society of America.@S1070-9762~99!01109-4#

lanm

hithdercer

hi

1nt,aler

c

Indi

ioor

f

tse

le

ule

f,of

eldersseto

tep

I. INTRODUCTION

In Ref. 1, the authors suggested a mathematical modea canonical cortical module that imitates the structurefunction of an element of the visual system of the brain, frothe retina to a module of the striate cortex, inclusive. Tmodel simulates the processes of input image filteringtake place in mammalian visual systems. During recentcades, enough experimental data have been gathered ovworld to prove that the mammalian brain carries out piewise spatial frequency analysis~Fourier analysis or Gabodecomposition! of the input visual signal.2,3 In this paper, weconcentrate on the abilities of our network to carry out tspecific function.

II. NETWORK STRUCTURE

The network structure is shown schematically in Fig.The network consists of several layers or levels, represeby parallel planes~Fig. 1a!. Layer R simulates the retinalayers LGBon and LGBoff represent two parts of the latergeniculate body~LGB!: the layer of on neurons and the layof off neurons.

The striate cortex is simulated by two modules. Eamodule contains four layers of inhibitory neurons~In1..In4!,four layers of simple cortical neurons~S1..S4!, and one layerof complex cortical neurons~C!. These two modules differ inthe spatial organization of the inhibitory neurons in theplanes~Fig. 1b!. In one module, they are assembled asvergent sectors~the sector module!, in the other in the formof concentric rings~the ring module!.

For all the planes except the In-planes, the distributdensities of the model neurons are unity. For the inhibitneurons in planes Ini( i 51..4), it is described by

for the sectors,

nIn~ i !on512sin~vc1p i /2!,

nIn~ i !off511sin~vc1p i /2!, ~1!

for the rings,

nIn~ i !on512sin~2pnr 1p i /2!,

nIn~ i !off511sin~2pnr 1p i /2!.

832 J. Opt. Technol. 66 (9), September 1999 1070-9762/99/09

ofd

sate-the-

s

.ed

h

--

ny

Here, v is the number of sectors,n is the number ofrings in the mosaic andr andc are the polar coordinates oan inhibitory neuron in plane Ini .

III. OUTPUT SIGNALS OF THE MODEL CORTICALNEURONS

The formula for the input signals and for the outpu~responses! of model cortical neurons of different levels havbeen derived in Ref. 1.

The input signal for a simple neuron of the ring moduis determined from

I Si~r !~X,Y!5E E exp~2a~r !2@~X2x!2

1~Y2y!2#/R2!sin~2pn~x21y2!1/2

1p i /2!OR~x,y!dxdy, ~2!

and the input signal for a simple neuron of the sector modis

I Si~s!~X,Y!5E E exp~2a~s!2@~X2x!2

1~Y2y!2#/R2!sin~vc~x,y!

1p i /2!OR~x,y!dxdy. ~3!

Here OR(X,Y)5s(X,Y)2^s& is the signal at the output othe retina,s(X,Y) is the input signal arriving at the retinaand^s& is the input signal averaged over the field of viewthe network,

c~x,y!5 H arccos~x/~x21y2!1/2!,p1arccos~2x/~x21y2!1/2!,

y>0;y,0.

The integration is performed over the circular receptive fiof a simple neuron; the field width is defined by parameta (r ) anda (s). The weighting factors by which the responof the retina in the integrands is multiplied are similarGabor functions.

The output signal for a simple neuron is defined as a sfunction with zero threshold:

OSi~s!~X,Y!5I Si

~s!~X,Y!H@ I Si~s!~X,Y!#,

OSi~r !~X,Y!5I Si

~r !~X,Y!H@ I Si~r !~X,Y!#, ~4!

8320832-04$15.00 © 1999 The Optical Society of America

pith

he

om

lex

u-theeu-

ing.on-

ar-ctortheod-flarhebee-

ec-ec-

atialnsea

heer

ori-lelums intinge of

to

s.ti-

rons.

,

whereH(z)51 if z.0, otherwiseH(z)50.The output signal for a complex neuron with~X,Y! coor-

dinates is equal to the sum of the output signals of the simneurons of all four layers of the corresponding module wthe same coordinates:

OC~s!~X,Y!5(

1

4

I Si~s!~X,Y!H@ I Si

~s!~X,Y!#,

OC~r !~X,Y!5(

1

4

I Si~r !~X,Y!H@ I Si

~r !~X,Y!#. ~5!

IV. NETWORK RESPONSES FOR SINUSOIDAL GRATINGS

To investigate the spatial-frequency sensitivity of tnetwork, sinusoidal gratings with different orientationsF,spatial frequenciesV, and spatial phasesQ were used asinput signals:

s~X,Y!511sin@~2V sin F!X1~V cosF!Y1Q#. ~6!

The dependences of the input signals of simple and cplex neurons~for sector and ring modules! on the parametersof the input grating were derived in Ref. 1@polar coordinates(R,w) are used#:

I Si~s!~R,w!5~pR2/2a~s!2!@cos~vw1p i /22Q

2VR sin~w2F!!exp~2~v21V2R2

22vVR cos~w2F!!/4a~s!2!2cos~vw

1p i /21Q1VR sin~w2F!!exp~2~v2

1V2R212vVR cos~w2F!!/4a~s!2!#,

FIG. 1. Structural diagram of canonical cortical module:~a! general schemeof network levels;~b! sector and ring mosaics of cortical inhibitory neuronR—retina, LGB—lateral geniculate body of thalamus, In—inhibitory corcal neurons, S—simple cortical neurons, C—complex cortical neurons.

833 J. Opt. Technol. 66 (9), September 1999

le

-

I Si~r !~R,w!5~pR2/2a~r !2!@cos~2pnR1p i /22Q

1VR sin~w2F!!exp~2R2~4p2n21V2

24pnV sin~w2F!!/4a~r !2!2cos~2pnR

1p i /21Q2VR sin~w2F!!

3exp~2R2~4p2n21V214pnV

3sin~w2F!!/4a~r !2!#. ~7!

To compute the responses of the simple and compneurons, Eqs.~7! were substituted into Eqs.~4! and ~5!.

High orientational selectivity of the model cortical nerons was demonstrated in Ref. 1. It was also shown thatresponses of complex neurons, unlike those of simple nrons, are invariant to the spatial phase of the input gratTherefore, the responses of the complex neurons will be csidered below to be responses of the modules.

Figure 2 shows examples of the spatial frequency chacteristics for the responses of complex neurons of seand ring modules. The results of the study proved thatoptimal input spatial frequency for neurons of a sector mule ~simple or complex! is proportional to the number osectors in the module and inversely proportional to the poradius of the neuron. As for neurons of a ring module toptimal spatial frequency of the input signal is found toproportional to the number of rings in the module and indpendent of the position of the neuron in the module.

Therefore, a sector module with a fixed number of stors is able to decompose the input image over a wide sptrum where each spatial frequency is encoded by the spradius of the specific neuron that gives the maximal respoto that frequency. As for a ring module, it can be calledspatial-frequency filter only in a very limited sense of tword, if we consider that a module with a different numbof rings selects specific spatial frequencies.

Figure 3 shows the responses of both modules to hzontal gratings. If the input signal is a sum of two paralsinusoidal gratings, it is possible to select areas of maximresponse corresponding to each of two spatial frequenciethe response of a sector module. The response to a grawith a higher frequency cannot be selected in the responsa ring module.

In what follows, the term ‘‘responses of the networktextures’’ will mean the responses of a sector module.

FIG. 2. Spatial frequency characteristics of responses of complex neu~a! for a complex neuron of a sector module~V—frequency of the inputgrating,R—polar radius of the neuron,v—number of sectors in the modulea—width parameter of receptive field!; ~b! for a complex neuron of a ringmodule~V—frequency of the input grating,n—number of rings in the mod-ule, a—width parameter of receptive field!.

833Yu. D. Kropotov and I. Z. Kremen

isis

atee-a

ss

ler’’ul

in,q

antauhes

eri-g

the. Itd by

con-of

ialres,k toith axed,isyver-

tial

hatchon

theose

-gnalble

es;.

V. TEXTURE DISCRIMINATION

The function of the discrimination of surface texturesclosely associated with spatial frequency filtering. In thcase, the surface texture is considered as a set of repeelements filling a surface in a more or less periodic mannwhich can be interpreted as a set of certain spatial frequcies and orientations.3 Our canonical cortical module performs a piecewise Gabor decomposition of the input signThe position of the regions of maximal response makepossible to define the spatial frequencies and orientationthe harmonics selected by the filter.

Figure 4 shows examples of the responses of compneurons of a sector module to ‘‘checkered’’ and ‘‘cellulatextures with different frequencies. In practice, the modperceives these textures as sums of several gratings.

VI. SPATIAL NOISE FILTERING

Gaussian filtering is a common method of suppresshigh-frequency spatial noise.4,5 Our model cortical neuronshaving a Gaussian factor in their weighting functions, E~3!, perform a similar function. However, unlike Gaussifilters, the canonical cortical module maps different orientions and spatial frequencies to different areas of the modThis allows our network to distinguish the signal from tnoise much more effectively than is done by ordinary Gauian filtering.

To study the noise sensitivity of the network, checkerand cellular textures with white noise with a normal distbution added to them were used as input images. Imawith different signal-to-noise ratio~SNR! were used.

FIG. 3. Responses of the network to spatial gratings.~a! input images;~b!responses of complex neurons~s corresponds to a sector module, andr to aring module!; ~c! response intensity scale. Model parameters:v519, n59,a (s)55.0, a (r )53.0.

834 J. Opt. Technol. 66 (9), September 1999

ingr,n-

l.itof

x

e

g

.

-le.

s-

d

es

The signal was defined as the difference betweenintensities of the light and dark section of the input texturewas set equal to 64 for each image. The noise was defineits rms value.

Four sets of input textures were generated. Each settained an image of a texture without noise and six setsnoisy images~for SNRs of 1.5, 2, 3, 4, 5 and 10!. Each setconsisted of ten noisy implementations.

The criterion for the network to be sensitive to spatnoise was chosen as follows: For each of four input textuthe maximal response of the sector module of the networthe noise-free image was calculated, and then neurons wresponse no less than 0.67 of the maximum value were fidetermining the maximal response area. Then, for all noimplementations of the same texture, we calculated the aage response for the maximal response area (Omr) and forthe remaining neurons of the module (Ob). For each set withfixed SNR, the mean valuesOmr and Ob over ten noisyimplementations were calculated. The value ofOmr /Ob

served as the criterion of sensitivity of the network to spanoise.

The results are shown in Table I. It can be seen tOmr /Ob increases with SNR in the input image for eatexture, so thatOmr /Ob may be used as an adequate criterifor noise sensitivity.

Figure 4 illustrates the responses of the neurons ofsector module to noise-free textures and Figure 5 to thwith noisy textures.

Any definition of the sensitivity threshold of an imageprocessing system has some uncertainty when defining sidetection and when determining the minimal detecta

FIG. 4. Discrimination of textures by the network~sector module!. ~a!checkerboard input textures,~b! cellular input textures~the numbers give thespatial frequencies!; ~c! responses of the network to checkerboard textur~d! responses of the network to cellular textures;~e! response intensity scaleModel parameters:v519, a (s)55.0.

TABLE I. Sensitivity of the network to spatial noise.

Imageset

Texture~with spatialfrequency, cycles per

module!

Omr /Ob

withoutnoise SNR51.5 SNR52 SNR53 SNR54 SNR55 SNR510

1 Checkerboard~12.5! 9.0 2.560.4 3.560.3 4.260.4 5.260.4 5.460.3 7.560.22 Checkerboard~8.3! 6.5 2.060.4 2.760.4 3.160.3 3.660.4 3.860.4 5.960.13 Cellular ~12! 4.5 2.360.4 3.160.4 3.360.5 3.760.4 3.860.4 4.160.34 Cellular ~8! 3.7 1.960.3 2.260.4 2.560.4 2.860.4 3.060.3 3.460.3

834Yu. D. Kropotov and I. Z. Kremen

rethfi-

re-ial-en-ly

nt

aliant

e-

r,

s

ro-

rs

SNR.6 For the sensitivity threshold of a neural network, theis the same uncertainty. Our experiments have shownthe network discriminates a texture with sufficient condence ifOmr /Ob exceeds 3.5.

FIG. 5. High-frequency spatial noise filtering by the network~sector mod-ule!. ~a! noisy checkerboard input textures~spatial frequency 12.5! ~b! noisycellular input textures~spatial frequency 8!; ~c! responses of the network tonoisy checkerboard textures;~d! responses of the network to noisy cellulatextures;~e! response intensity scale. Model parameters are the same aFig. 4.

835 J. Opt. Technol. 66 (9), September 1999

at

VII. CONCLUSION

The experimental data of the current studies lets usgard our canonical cortical module as a distinctive spatfrequency filter that can map different frequencies and oritations to different module areas and can reliabdiscriminate different textures in conditions of significanoise of the input signal.

1J. D. Kropotov, I. Z. Kremen’, and V. A. Ponomarev. ‘‘A realistic neurnetwork simulating a visual system and its usage in tasks of invarimage description,’’ Opt. Zh.65, No. 9, 40~1998! @J. Opt. Technol65,716 ~1998!#.

2D. H. Hubel,Eye, Brain and Vision~W. H. Freeman, New York, 1988;Mir. Moscow, 1990!.

3D. Marr, Vision: A Computational Investigation into the Human Reprsentation and Processing of Visual Information~W. H. Freeman, SanFrancisco, 1982; Radio i svyaz’, Moscow 1987!.

4N. N. Krasil’nikov, Theory of Image Transmission and Perception~Radioi svyaz’, Moscow, 1986!.

5B. K. P. Horn,Robot Vision~M.I.T. Press, Cambridge Mass, 1986; MiMoscow, 1989!.

6M. M. Miroshnikov, Theoretical Principles for Optoelectronic Device~Mashinostroenie, Leningrad, 1983!.

This article was published in English in the original Russian journal. Repduced here with stylistic changes by the Translation Editor.

for

835Yu. D. Kropotov and I. Z. Kremen