Upload
cheng-peng
View
213
Download
0
Embed Size (px)
Citation preview
ICONIP2006
Electrogastrogram extraction using independent componentanalysis with references
Cheng Peng Æ Xiang Qian Æ Datian Ye
Received: 9 January 2007 / Accepted: 2 March 2007 / Published online: 27 March 2007
� Springer-Verlag London Limited 2007
Abstract Electrogastrogram (EGG) is a noninvasive
measurement of gastric myoelectrical activity cutaneously,
which is usually covered by strong artifacts. In this paper,
the independent component analysis (ICA) with references
was applied to separate the gastric signal from noises. The
nonlinear uncorrelatedness between the desired component
and references was introduced as a constraint. The results
show that the proposed method can extract the desired
component corresponding to gastric slow waves directly,
avoiding the ordering indeterminacy in ICA. Furthermore,
the perturbations in EGG can be suppressed effectively. In
summary, it can be a useful method for EGG analysis in
research and clinical practice.
Keywords Independent component analysis �Independent component analysis with references �Electrogastrogram
1 Introduction
Electrogastrogram (EGG) usually refers to the surface
measurement of gastric myoelectrical activity by placing
electrodes on the abdominal skin. The gastric myoelectrical
activity of healthy human is mainly composed of rhythmic
slow waves and spikes. The normal frequency of slow
waves is about 3 cycles per minute (3 cpm or 0.05 Hz). As
recorded cutaneously, the EGG presents a weighted sum-
mation of the electrical activity of various regions of the
stomach. Studies have shown that the cutaneous electrodes
can only pick up the rhythm of the slow waves but not that
of the spikes [1].
Since the first measurement of EGG in 1921, the non-
invasiveness of EGG attracted a lot of interests. A great
deal of researchers focused on the relationship between
EGG and the gastric function, expecting to take EGG as a
clinical assessment of gastric motility disorders [2, 3].
However, the real gastric signal in EGG recording is usu-
ally weak and perturbed by stronger noises. The noises are
composed of electrocardiogram (ECG), respiratory artifact,
motion artifact, electrical interference of small intestine
and the electrode-skin interface noise, etc. [1, 2]. The fre-
quency of the respiratory artifact is around 0.2–0.4 Hz,
which is close to that of slow waves. The motion artifact is
usually a broad-band signal [1, 4]. Both artifacts are almost
inevitable and can not be suppressed without affecting the
real gastric signal by conventional frequency-dominant
filter.
Plenty of signal processing techniques have been ap-
plied to improve the quality of EGG. Besides of the classic
methods such as band-pass filter, phase-locking filter, and
autoregressive modeling, several modern methods emerged
in the last a few decades were introduced, including the
adaptive filtering [5], feature extraction [4], empirical
mode decomposition [6], etc. A blind source separation
method called independent component analysis (ICA) was
introduced to separate the real gastric signal from the
multichannel EGG recordings in 1999 [7]. Successively, an
adaptive ICA method was presented and applied in EGG
C. Peng � X. Qian � D. Ye (&)
Department of Biomedical Engineering,
Tsinghua University, Beijing, China
e-mail: [email protected]
C. Peng
e-mail: [email protected]
C. Peng � X. Qian � D. Ye
Research Center of Biomedical Engineering,
Graduate School at Shenzhen, Tsinghua University,
Shenzhen, China
123
Neural Comput & Applic (2007) 16:581–587
DOI 10.1007/s00521-007-0100-3
analysis [8]; the FastICA algorithm was applied to mag-
netogastrography [9]; and a hybrid method combining ICA
and adaptive signal enhancement was used to tracking the
gastric slow waves [10].
The model of ICA assumes that the multichannel ob-
served signals are linear mixtures of several mutual inde-
pendent sources with unknown mixing coefficients. The
assumption of independence is enough to extract the
sources and estimate the demixing matrix when only ob-
served signals are available [11]. Several approaches
including maximization of nongaussianity [12–14], maxi-
mum likelihood estimation [15], minimization of mutual
information [16, 17], etc. were proved to be effective to
solve the problem. More recently, an alternative approach,
known as constrained ICA, incorporated some priori
knowledge as constraints into the model in order to elim-
inate the indeterminacy of ICA and extract the desired
components [18–20]. The priori knowledge can be tem-
poral reference signals [21, 22], as well as spatial con-
straints of the demixing matrix [23].
In this paper, an ICA model with temporal constrained
by reference signals was introduced and applied to extract
real gastric signal from EGG recording. The respiratory
and cardiac rhythms were recorded roughly by piezoelec-
tric sensor as two reference signals. The independent
component was extracted by negentropy-based method and
further constrained by nonlinear uncorrelatedness with the
reference signals. According to this method, the component
corresponding to gastric slow waves signal can be acquired
in a single one-unit ICA algorithm. Furthermore, the per-
turbation of respiratory and cardiac rhythms can be sup-
pressed more effectively than conventional ICA
approaches.
2 Method
2.1 ICA and ICA with references
The classical ICA model assumes that N channels of ob-
served signals x(t) = [ x1(t), x2(t),..., xN(t) ]T are linear
mixtures of M (usually N ‡ M) channels of mutually
independent source signals s(t) = [ s1(t), s2(t),..., sM(t) ]T:
xðtÞ ¼ AsðtÞ ð1Þ
where A is an unknown mixing matrix of N · M. In many
practical applications, x(t) are whitened during
preprocessing, so the whitened form of observations,
denoted by z(t), is considered here. The object of ICA is
to estimate the demixing matrix and the unknown sources,
denoted by W and y(t), respectively, satisfying:
yðtÞ ¼WzðtÞ ð2Þ
A flexible approximation of negentropy was introduced as
object function for one-unit ICA by Hyvarinen [13, 14]:
JðwÞ ¼ q½EfGðyÞg � EfGðmÞg�2
¼ q½EfGðwT zÞg � EfGðmÞg�2ð3Þ
where w and y specify a certain column of W and the cor-
responding component of y(t), respectively, and the time
index t is omitted for simplicity. q is a positive constant, m is a
Gaussian variable with zero mean and unit variance, and
Gð�Þ is a nonquadratic function. The object function in (3) is
further constrained by EfwT zzT wg ¼ wT wEfzT zg¼ jwj ¼ 1:
In some applications, one or more reference signals,
denoted by r(t) = [r1(t), r2(t),..., rL(t)]T, are available be-
sides the observed signals. The object function in (3) can
be further constrained by a vector of energy functions
eðy; rÞ ¼ ½eðy; r1Þ; eðy; r2Þ; . . . ; eðy; rLÞ�T ; which measures
the closeness between each reference signal and a certain
component. Hence, the ICA model with reference signals
turns out to be a problem of constrained optimization:
maximize JðwÞ;subject to eðy; rÞ6n and wk k ¼ 1
ð4Þ
where n is a vector that specifies the thresholds of each en-
ergy function. By selecting eðy; rÞ properly, a certain com-
ponent, which is ‘close to‘ or ‘far from’ the reference signals,
can be extracted. Correlation and mean square error (MSE)
were proved to be useful energy functions. In next section,
we propose an energy function based on nonlinear uncorre-
latedness, and derive a gradient learning rule to solve the
optimization problem in (4) based on augmented Lagrangian
method.
2.2 Nonlinear uncorrelatedness as a constraint
Suppose some priori information of several components,
which are considered to be perturbations, can be acquired as
reference signals. In order to extract the very component of
desired signal, it is reasonable to constrain it as far as possible
from each reference signal under a certain measurement, i.e.
uncorrelatedness or independence. In this paper, nonlinear
uncorrelatedness was introduced as a constraint in the
framework of constrained ICA.
The desired component and each reference signal are
said to be uncorrelated if:
Efðy� EfygÞðri � EfrigÞg ¼ Efyrig � EfygEfrig ¼ 0
ð5Þ
In order to find a ‘stronger’ constraint, nonlinear functions
gcð�Þ and fcð�Þ were added into (5), satisfying
582 Neural Comput & Applic (2007) 16:581–587
123
EfgcðyÞfcðriÞg � EfgcðyÞgEffcðriÞg ¼ 0 ð6Þ
The idea in (6), known as nonlinear decorrelation, was
widely used during the early research in ICA [24]. How-
ever, it can also be a measurement of ‘distance’ between
the desired component and reference signals in the sense of
nonlinear uncorrelatedness.
Assume that gc and fc have derivatives of all orders in
the neighborhood of the origin, a zero-mean form of (6)
can be rewritten and expanded in Taylor series:
EfgcðyÞfcðrÞg ¼X1
i¼1
X1
j¼1
gifjEfyirjg ¼ 0 ð7Þ
where gi and fj are coefficients of ith power in the series
and the subscript of r corresponding to the number of
reference signals is omitted here. According to Hyvarinen
et al. [11], a sufficient (but not essential) condition for (7)
to hold is that y and r are independent, and at least one of
the nonlinear functions is an odd function. Selecting gc as
such an odd function, only odd powers exist, satisfying:
Efyig ¼ 0; i ¼ 1; 3; 5; . . . ð8Þ
Under the presume that y and r should be independent
theoretically (this is because y is the exact component we
are interested in and r is a rough rhythm of a certain
perturbation), not only uncorrelatedness but also high order
uncorrelatedness of odd powers should be satisfied:
Efyirjg ¼ EfyigEfr jg ¼ 0; i ¼ 1; 3; 5; . . . ð9Þ
and the equation in (7) should be fulfilled; if not, y should
be modified iteratively according to the algorithm. As
discussed in [11], there do exist other conditions to fulfill
the equation in (7); but for nonpolynomial functions, it
seems unlikely; and in our practical applications, the
algorithm does not fall into components that are ‘closed to’
the reference signals. In this paper, an odd and sigmoid
hyperbolic tangent function widely used in previous ICA
literatures was selected as gc and fc.
Thus, incorporating with negentropy-based object
function in (3), the constrained optimization problem can
be rewritten as
Maximize JðwÞsubject to eðy;rÞ¼EfgcðyÞfcðrÞg�EfgcðyÞgEffcðrÞg6n
and wk k¼1
ð10Þ
The first constraint in (10) written in vector form reveals
that the nonlinear correlatedness of desired component and
each reference signal should be smaller than a preset
threshold. According to (10), the augmented Lagrangian is
formed as [25]:
Lðw;kÞ¼JðwÞ
þXL
i¼1
ki maxf0;eðy;riÞ�nig
þXL
i¼1
Ki
2maxf0;eðy;riÞ�nig
2
¼JðwÞþXL
i¼1
ki maxf0;eðwTz;riÞ�nig
þXL
i¼1
Ki
2maxf0;eðwT z;riÞ�nig
2
ð11Þ
where ki, i = 1,2,..., L are Lagrange multipliers and Ki, i =
1,2,..., L are the penalty parameters. The constraint |w| = 1
is not considered here because it can be easily treated by a
projection process:
w ¼ w= wk k ð12Þ
The gradient learning rule for (11) can be derived as
w ¼ w� lw
� @J
@wþXL
i¼1
Siðki þ KiðeðwT z; riÞ � niÞÞ@eðwT z; riÞ
@w
" #
ð13Þ
where
Si ¼0 if eðwT z; rÞ6n
1 if eðwT z; rÞ[n
(ð14Þ
@J
@w¼ 2qEfzgðwT zÞÞg½EfGðwT zÞg � EfGðmÞg� ð15Þ
@eðwT z; riÞ@w
¼ Efzg0cðwT zÞfcðriÞg � Efzg0cðwT zÞgEffcðriÞg
ð16Þ
g0cð�Þ is the derivative of nonlinear function gcð�Þ: The
Lagrange multipliers should also be updated as:
ki ¼ ki þ lki
@L
@ki¼ ki þ lki
ðeðwT z; riÞ � niÞ ð17Þ
The lw and lk_i in (13) and (17) are leaning rates that
should be updated as:
uw ¼1
1=uw þ wk k2ð18Þ
Neural Comput & Applic (2007) 16:581–587 583
123
uki¼ 1
1=ukiþ k2
i
ð19Þ
3 Experimental results
3.1 The measurement of EGG
The EGG data used in this paper was obtained from five
healthy humans. The subjects lay on the back and were
asked to keep quiet and as still as possible during the
measurements.
Five Ag/AgCl electrodes were placed on the abdomen,
including four active electrodes, and one reference elec-
trode (see Fig. 1). The first active electrodes was posi-
tioned 45� upper left of the midpoint between the umbilicus
and the xiphoid process with an interval of 2–3 cm, the last
active electrode was positioned 1–3 cm right to the mid-
point mentioned above. The other two were placed between
them with proper distance. The reference electrode was
positioned on the right ribs with the same height of the first
electrode. Four-channel EGG signals were derived by
connecting each active electrode to the reference electrode.
A piezoelectric sensor was placed near the umbilicus to
record the movement of the abdomen simultaneously. As
the abdomen movement is mainly caused by respiration
and heart beating, so the recorded signal is mainly con-
sisted of the respiratory and cardiac rhythms.
The signals were recorded by a multichannel physio-
logical signal recorder (RM6280C Chengdu Instrument
Factory, Chengdu, China) with a time constant of 5 s, low-
pass cutoff frequency of 10 Hz and sample frequency of
20 Hz.
3.2 Application on EGG
The raw data were decimated to 4 Hz after an anti-aliasing
filter. Furthermore, data whitening and dimension-reducing
were achieved by the principal component analysis (PCA).
Two reference signals were obtained by filtering the
recordings of the piezoelectric sensor with proper low-pass
and high-pass filters respectively (see Fig. 2). All data were
analyzed using MATLAB 6.5 by both the conventional
ICA (FastICA was used here [13, 14]) and the proposed
ICA with references. The signals shown here were regu-
lated to unit variance excepting the original recordings. A
general-purpose function G(u) = log cosh (u), proposed by
Hyvarinen in [14], was selected in (3). The selection of
thresholds in (10) is discussed in detail as follows:
In the previous work of Lin et al. [22], the threshold n
was initialized with a small value and increased gradually;
but in this paper, the EGG signals can be considered as a
same sort of signals, so it is reasonable to use an identical
threshold for each energy function in all experiments. The
identical thresholds were fixed according to one of the
experiments and the same procedure in [22] with a minor
modification, namely, the thresholds were initialized with
large values and decreased gradually. The fixed thresholds,
both 0.01 in this paper, were used in all other experiments.
Figure 3 shows a four-channel EGG recording for about
250 s. The separated components of conventional ICA and
ICA with references are shown in Fig. 4, and Fig. 5 shows
the spectra of the desired components obtained by both
methods. The components extracted by the conventional
ICA are ordered randomly, and a further step is needed to
determine which component is desired. In the ICA with
references, only one component is obtained, which is the
very component we are interested in (see Fig. 4). Theo-
retically, the two methods should extract exactly the same
desired component; but in our practical applications, the
conventional ICA gives a little ‘noisy’ component still,
which may be ascribed to the initialization of demixing
Fig. 1 The position of electrodes. Four active electrodes (a to d) and
one reference electrode (g) for EGG. A piezoelectric sensor (e) for
reference signals
Fig. 2 The reference signals with unit variance. a Corresponding to
the respiratory rhythm; b corresponding to the cardiac rhythm
584 Neural Comput & Applic (2007) 16:581–587
123
matrix and the orthonormal process. But in the ICA with
references, due to the constraint of ‘as far as possible from
the reference signals’, the perturbations of the desired
component are suppressed more effectively (see Fig. 5).
Figure 6 shows another experiment with high amplitude
disturbance. The desired component from conventional
ICA algorithm (Fig. 7b) is still perturbed by cardiac
rhythm severely; the ICA with references, however, pro-
vides a better result (Fig. 7a).
4 Conclusion and discussion
A new approach of ICA with references is presented in this
paper. The applications on EGG signals show the proposed
method can extract the desired component successfully. By
measuring the unlikeness with two reference signals in the
framework of constrained ICA algorithm, the method can
both avoid the ordering indeterminacy of conventional ICA
algorithm and suppress the perturbation effectively. This
method shows even better performance than the conven-
tional ICA when the observed data is composed with high-
amplitude disturbances, which appear in EGG recordings
frequently.
This method proves to be an improvement to the tradi-
tional extraction of gastric slow waves from the EGG
recordings. As a consequence, it will promote applications
Fig. 4 Separated components of conventional ICA (a) and ICA with
references (b) for recording in Fig. 2. All components are regulated to
unit variance
Fig. 3 Four-channel EGG recording preprocessed
Fig. 5 Spectra of components corresponding to the gastric slow
waves obtained from the conventional ICA (lower) and the ICA with
reference (upper) for recordings in Fig. 2
Fig. 6 Four-channel EGG recording with high-amplitude disturbance
in fourth channel
Neural Comput & Applic (2007) 16:581–587 585
123
of EGG in research and clinical practice, following by
further analysis on the component of slow waves.
The signal recorded by the piezoelectric sensor near
umbilicus was used firstly in the adaptive noise canceling
system [5]. Generally, the adaptive system showed good
performance in respiratory artifacts cancellation in our
previous work [26]. However, in some situations, the
adaptive method did not give the expected results. Figure 8
shows the waveform and spectrum of the first channel of
Fig. 6 after adaptive noise cancellation, which is still per-
turbed by cardiac and respiratory rhythms; and the ICA-r
method gave a better result as mentioned above (see
Fig. 7a). This may be owed to the independent assumption
in the ICA-r method, which is more ‘physiological’ for
such biomedical applications.
Acknowledgements The authors would like to thank Prof. Z. C. Wu
and his colleagues in Acupuncture Institute of China Academy of
Traditional Chinese Medicine for their assistance in acquiring the
EGG data used in this paper.
References
1. Chen J, McCallum RW (1993) Clinical applications of electro-
gastrography. Am J Gastroenterol 88:1324–1336
2. Chen J, McCallum RW (1994) Electrogastrography: principles
and applications. Raven Press, New York
3. Chen J, McCallum RW (1991) Electrogastrography: measure-
ment, analysis and prospective applications. Med Biol Eng
Comput 29:339–350
4. Liang J, Cheung JY, Chen JD (1997) Detection and deletion of
motion artifacts in electrogastrogram using feature analysis and
neural networks. Ann Biomed Eng 25:850–857
5. Chen J, Vandewalle J, Sansen W, Vantrappen G, Janssens J
(1989) Adaptive Method for Cancellation of Respiration Artifact
in Electrogastric Measurements. Med Biol Eng Comput 27:57–63
6. Liang H, Lin Z, McCallum RW (2000) Artifact reduction in
electrogastrogram based on the empirical mode decomposition
method 3. Med Biol Eng Comput 8:35–41
7. Wang ZS, Cheung JY, Chen JD (1999) Blind separation of
multichannel electrogastrograms using independent component
analysis based on a neural network. Med Biol Eng Comput
37:80–86
8. Liang H (2001) Adaptive independent component analysis of
multichannel electrogastrograms. Med Eng Phys 23:91–97
9. Irimia A, Bradshaw LA (2005) Artifact reduction in magneto-
gastrography using fast independent component analysis. Physiol
Meas 26:1059–1073
10. Liang H (2005) Extraction of gastric slow waves from electro-
gastrograms: combining independent component analysis and
adaptive signal enhancement. Med Biol Eng Comput 43:245–251
11. Hyvarinen A, Karhunen J, Oja E (2001) Independent component
analysis. Wiley, New York
12. Delfosse N, Loubaton P (1995) Adaptive blind separation of
independent sources: a deflation approach. Signal Process 45:59–
83
13. Hyvarinen A, Oja E (1997) A fast fixed-point algorithm for
independent component analysis. Neural Comput 9:1483–1492
14. Hyvarinen A (1999) Fast and robust fixed-point algorithms for
independent component analysis. IEEE Trans Neural Netw
10:626–634
15. Bell AJ, Sejnowski TJ (1995) An iinformation-maximization
approach to blind separation and blind deconvolution. Neural
Comput 7:1129–1159
16. Comon P(1994) Independent component analysis—a new con-
cept? Signal Process 36:287–314
17. He Z, Yang L, Liu J, Lu Z, He C, Shi Y (2000) Blind source
separation using clustering based multivariate density estimation
algorithm. IEEE Trans Signal Process 48:575–579
Fig. 7 Waveforms (a) and spectra (b) of the resulted components
corresponding to the gastric slow waves from the conventional ICA
(lower) and the ICA with references (upper)
Fig. 8 Waveform (a) and spectrum (b) of the first channel of Fig. 6
after adaptive noise cancellation
586 Neural Comput & Applic (2007) 16:581–587
123
18. Lu W, Rajapakse JC (2000) Constrained independent component
analysis. In: Advances in neural information processing systems,
vol 13. MIT Press, Cambridge, pp 570–576
19. Lu W, Rajapakse JC (2003) Eliminate indeterminacy in ICA.
Neurocomputing 50:271–290
20. Lu W, Rajapakse JC (2005) Approach and applications of con-
strained ICA. IEEE Trans Neural Netw 16:203–212
21. James CJ, Gibson OJ (2003) Temporally constrained ICA: an
application to artifact rejection in electromagnetic brain signal
analysis. IEEE Trans Bio-med Eng 50:1108–1116
22. Lin Q, Zheng Y, Yin F, Liang H (2004) Speech segregation using
constrained ICA. Lect Notes Comput Sci 3173:755–760
23. Hesse CW, James CJ (2005) The fast ICA algorithm with spatial
constraints. IEEE Signal Proc Lett 12:792–795
24. Jutten C (2000) Source separation: from dusk till dawn. In:
Proceedings of the 2nd international workshop on independent
component analysis and blind source separation, Helsinki, Fin-
land, pp 15–26
25. Ham FM, Kostanic I (2001) Principles of neurocomputing for
science & engineering. McGraw-Hill, New York
26. Peng C, Ye DT (2005) Cutaneous electrical stimulation of mid-
frequency on acupiont affects the electrogastrogram. In: Pro-
ceedings of the 27th annual conference on IEEE-EMBS, pp
4933–4935
Neural Comput & Applic (2007) 16:581–587 587
123