Upload
others
View
0
Download
0
Embed Size (px)
Citation preview
1
2
3
4
5
6
7
10
11
12
13
14
15
21
22
23
24
25
26
27
28
29
30
31
32
40
41
43
44
45
46
47
48
49
50
51
S C I E N C E O F T H E T O T A L E N V I R O N M E N T X X ( 2 0 0 8 ) X X X – X X X
ava i l ab l e a t www.sc i enced i rec t . com
www.e l sev i e r. com/ loca te / sc i to tenv
ARTICLE IN PRESSSTOTEN-10695; No of Pages 14
Assessment of the environmental and physiological processesdetermining the accumulation of organochlorine compoundsin European mountain lake fish through multivariate analysis(PCA and PLS)
F OM. Felipe-Sotelo⁎, R. Tauler, I. Vives, J.O. GrimaltDepartment of Environmental Chemistry, IIQAB-CSIC, C/Jordi Girona 18-26, E-08034, Barcelona, Spain OA R T I C L E I N F O
⁎ Corresponding author. Currently at the DepSpain. Fax: +34 932045904.
E-mail addresses: [email protected], mfe
0048-9697/$ – see front matter © 2008 Elsevidoi:10.1016/j.scitotenv.2008.06.020
Please cite this article as: Felipe-Sotelo Mthe accumulation of organochlorine com
A B S T R A C T
Article history:Received 10 December 2007Received in revised form 3 June 2008Accepted 4 June 2008Available online xxxx
RECT
EDPRThe presence of organochlorine compounds (OCs)–namely hexachlorobenzene (HCB),
hexachlorocyclohexanes (HCH), polychlorobiphenyls (PCBs #28, 52, 101, 118, 138, 153 and180), dichlorodiphenyltrichloroethane (DDT) and dichlorodiphenyldichloroethane (DDE)–was examined in various fish tissues (muscle and liver) sampled in 23 mountain lakes inEurope. The dependence of these organochlorine compounds on geographical parameters(altitude, longitude, latitude and temperature) and physiological parameters (lipid content,age, weight and size) was assessed. Principal components analysis (PCA) and partial leastsquares (PLS) models were used for the analyses.PCA results showed that organochlorine compound concentrations in fish tissues increasedwith increasing altitude and decreasing temperatures. This trend appeared to bemoremarkedfor the less volatile compounds. Some differenceswere found between themuscle and liver inthe effects of the percentage of lipids on the accumulation of organochlorine compounds andthe behaviour of HCB. Moreover, PCBs tended to accumulate more in liver rather than inmuscle. PCA scores clearly differentiated samples according to the lake of origin. PLS modelsconfirmed that temperature and altitude were themain factors influencing the accumulationof most organochlorine compounds in the lipids of the fish tissue.
© 2008 Elsevier B.V. All rights reserved.
Keywords:ChemometricsPCAPLSOrganochlorine compounds
2
3
4
5
6
7
8
9
0
1
2
UNCO
R1. Introduction
Organochlorine compounds are distributed worldwide, asthey are rapidly transported from their emission sites todistant places through the atmosphere. This transfer involvessingular processes having in mind the currently acceptedenvironmental paradigms, i.e. besides dispersion and dilutionconcentration steps are required since some of these pollu-tants are found at higher concentrations in remote sites thanclose to their sources (Grimalt et al., 2004). A global distillationeffect has been proposed as explanation, involving the
artment of Chemistry, U
er B.V. All rights reserved
, et al, Assessment ofpounds in European m.
5migration of pollutants from warm areas to cold places,5where they are trapped (Muir et al., 1990; Simonich and5Hites, 1995; Wania and Mackay, 1995, 1996). This effect has5also been observed in the context of mountain areas, leading5to higher concentrations of organochlorine compounds at5higher elevation (Grimalt et al., 2001). Daly and Wania (2005)5emphasized that examination of altitudinal trends of organo-5chlorine compound concentrations in mountains requires the6study of sites that are located at comparable distances from6the polluting sources. However, this is not always possible, e.g.6in the European topography.
niversity of Lleida, Av. Alcalde Rovira Roure 177, E-25198, Lleida,
Felipe-Sotelo).
.
the environmental and physiological processes determining.., Sci Total Environ (2008), doi:10.1016/j.scitotenv.2008.06.020
UNCO
RREC
TEDPR
OOF
Fig.
1–Lo
cation
ofth
em
ountain
lake
sfrom
whichfish
sam
ples
weretake
n.
2 S C I E N C E O F T H E T O T A L E N V I R O N M E N T X X ( 2 0 0 8 ) X X X – X X X
ARTICLE IN PRESS
Please cite this article as: Felipe-Sotelo M, et al, Assessment of the environmental and physiological processes determiningthe accumulation of organochlorine compounds in European m..., Sci Total Environ (2008), doi:10.1016/j.scitotenv.2008.06.020
EDPR
OOF
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
Table 1 t1:1– Number of samples collected from eachmountain lake t1:2
t1:3Lake Number of samples
t1:4Musclesamples
Liversamples
Muscle+liver a
t1:51. — Øvre Neådalsvtan 25 13 13t1:62. — Fallbekktjørna 6 6 6t1:73. — Nedre Neådalsvatn 4 2 2t1:84. — Øvre Heimdalsvatnet 4 1 1t1:95. — Stavsvatn 10 – –t1:106. — Lochnagar 4 1 1t1:117. — Maan 5 – –t1:128. — Zielony Staw Gąsienicowy 5 – –t1:139. — Vel’ké Hincovo 5 3 3t1:1410. — Gossenkoellesee 17 16 12t1:1511. — Rotfelsee 5 5 5t1:1612. — Oberder Plenderlesee 4 – –t1:1713. — Schwarzsee 3 – –t1:1814. — Milchsee 6 – –t1:1915. — Lungo 6 – –t1:2016. — Jörisee 9 – –t1:2117. — Paione inferiore 5 – –t1:2218. — Aubé 15 – –t1:2319. — Redò 35 23 20t1:2420. — Okoto 5 4 4t1:2521. — Bliznaka 5 4 4t1:2622. — Cimera 5 – –t1:2723. — Escura 3 – –t1:28Total number of samples 191 78 71
a Composite samples of muscle and liver, corresponding to thesame specimen. These samples were used for the joint analysis ofliver and muscle tissues by unfolding-PCA (augmented matrixDaug). t1:29
3S C I E N C E O F T H E T O T A L E N V I R O N M E N T X X ( 2 0 0 8 ) X X X – X X X
ARTICLE IN PRESS
In this context, fish have been used as sentinel organismsfor the assessment of long-range transport of organochlorinecompounds tomountain lakes, as they are the top predators inthese environments (Grimalt et al., 2001; Vives et al., 2004).However, accumulation of organochlorine compounds in fishtissues can also be dependent on many other biologicalfactors. Consequently, both environmental and physiologicalparameters must be considered to unravel the distributionpatterns of these compounds.
Chemometric methods allow valuable information to beextracted from multivariate data arrays, which are difficult tohandle using classical univariate statistical methods. Envir-onmental monitoring studies produce large amounts of data,containing multiple parameters whose interpretation is farfrom simple. Chemometric methods provide tools for findingrelationships between groups of analyzed analysed samplesand/or related variables or parameters (Massart et al., 1988).
The aim of the present study was to determine thedependences between the accumulation of organochlorinecompounds in fish tissues (muscle and liver) and differentparameters, including both geographical (altitude, longitude,latitude and average temperature) and physiological variables(age, weight, size and lipid content, see Grimalt et al., 2001 andVives et al., 2004 for further details of the physiologicalparameters used in this research). This paper addresses anew approach to this problem, using chemometric methods(principal components analysis) to unravel these depen-dences. The data are handled as a whole, in a fast andstraightforward way. Moreover, we attempt to determine therelative importance of such parameters in the accumulation oforganochlorine compounds by means of partial least squares(PLS) models.
T21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
UNCO
RREC2. Experimental
Experimental data in this work correspond to the concentra-tion of the following 12 organochlorine compounds: hexa-chlorobenzene (HCB), hexachlorocyclohexanes (α- andγ-HCH),polychlorobiphenyls (PCBs #28, 52, 101, 118, 138, 153 and 180),dichlorodiphenyltrichloroethane (DDT) and dichlorodiphenyl-dichloroethane (DDE). Fish were captured in 23 mountain lakesin Europe, whose locations are shown in Fig. 1. Table 1 indicatesthe total number of samples collected at each mountain lake.There were a total of 191muscle samples (note that the numberof samples isnot evenlydistributedbetweenthedifferent lakes).Seventy-eight liver tissuesampleswerealso takenandanalysedfrom some of the captured specimens. Full details of thesampling procedure and analysis are described elsewhere(Grimalt et al., 2001). Several additional parameters, includingboth physiological (age, weight, size and lipid content) andgeographical variables (altitude, longitude, latitude and annualaverage temperature) were included in the multivariate dataanalysis. They were obtained from previous studies (Grimaltet al., 2001; Vives et al., 2004). The altitude of lakes correspondsto the number of meters above sea level and the averagetemperature is the mean air temperature, provided by theDepartment of Geology and Geography of the University ofEdinburgh (UK). The experimental data were transferred intoMATLAB 6.5 (the Mathworks, Natick, USA) and PLS 3.5 Toolbox
Please cite this article as: Felipe-Sotelo M, et al, Assessment ofthe accumulation of organochlorine compounds in European m.
1software (Eigenvector Research Ltd. Manson, WA, USA) was1used for the multivariate analyses.
12.1. Principal components analysis (PCA) and matrix1augmentation
1PCA transforms amultivariate data array into a new data set in1which the new variables are orthonormal and explain max-1imum variance (Jolliffe, 2002). It is based on the bilinear decom-1position of the original data set: D(I× J)=U(I×N)VT
(N× J)+E(I×J ), where1D is the original data array, with I rows (observations) and J1columns (variables);U is thematrixof scores of dimensions I×N,1whereN is the reduced number of components;VT is thematrix1of loadingswith dimensionsN× J and E is thematrix of residuals1that are notmodelled by theN principal components (PC). Thus,1a reduced set of components are found that synthesize all the1information from the original matrix. These new variables can1be expressed as linear combinations of the original ones. PCA1provides the analyst with useful tools for data exploration, like1the scores U (sample-related) and the loadings VT (variable-1related). The loadings VT quantify how much of each of the1original variables are used to define each PC. Thus, the variables1that best describe the sources of variance in the data can be1identified. The scores U are the projections of the original data1onto the new vector space, defined by the loadings. Their plots1reveal the sample clusters and groups (Massart et al., 1988).
the environmental and physiological processes determining.., Sci Total Environ (2008), doi:10.1016/j.scitotenv.2008.06.020
C
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181Q1
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
4 S C I E N C E O F T H E T O T A L E N V I R O N M E N T X X ( 2 0 0 8 ) X X X – X X X
ARTICLE IN PRESS
UNCO
RRE
The aim of the present study was to establish andinvestigate the possible relationships between the accumula-tion of organochlorine compounds and several physiologicaland geographical parameters. For this purpose, two data setswere built and analysed by PCA. The first one corresponded tofish muscle samples Dmuscle (191×20) and the second one toliver samples Dliver (78×20). In these two matrices, the rowscorresponded to the different samples and the columns to theorganochlorine compounds and physiological and geographi-cal variables. A discussion about the loadings revealed therelationships between the variables and the parameters.
Before carrying out the analysis, values below the detectionlimit were set to one half of the limit value, as recommendedby Farnham et al. (2002). Missing values were imputed by PCAusing the function implemented in PLS Toolbox 3.5. Pre-treatment of the data sets was autoscaling (mean centeringand scaling to variance equal to 1).
Additionally, PCA can be applied straightforwardly andsimultaneously to multiple data sets by means of data matrixaugmentation (Smilde et al., 2004). This procedurewas appliedto the joint analysis of liver and muscle tissues, to determinewhether any of the organochlorine compounds were accu-mulated preferentially in either of the two analyzed analysedtissues. In this case, only the organochlorine compoundconcentrations were considered. Consequently, the numberof variables was reduced from 20 to 12. A new subset wasextracted from the matrix Dmuscle, corresponding to the 71specimens for which a liver sample was also available. Thisresulted in the new matrix Dmuscle2 (71×12). Concatenatedwith thismatrix (column-wise datamatrix augmentation), theDliver2 (71×12) data matrix was set down on the Dmuscle2
(71×12) matrix, giving the new augmented matrix Daug, ofdimensions 142×12. The resulting matrix Daug was thenautoscaled before the calculation of the joint PCA model.
2.2. Partial least squares
Themain purpose of partial least squares (PLS) regression is tobuild a multilinear regression model, y=Xb+e, where y is thepredicted response vector, X is the predictor matrix, b is theregression vector, and e is the residuals vector (Martens andNaes, 1991). PLSmodels produce latent variables (LV)whicharelinear combinations of the original predictor X variables,covarying optimally with the sought predicted values in y.Thus, there is no correlation between the new latent variables(or factors) on the one hand and maximum correlationbetween X and y variables on the other hand. PLS modelssearch for the latent variables that maximize the covariancebetween X and y blocks. In order to optimally interpret thecorrelation structurebetweenXvariables and y variables in PLSmodels, it is better to look at the PLSweight vectors (w) insteadof looking directly at the variable coefficients in the final bregression vector. In the regression vector, the variousphenomena that correlate X and y are intermixed, combinedand averaged. This confuses interpretation when looking forthe main correlation sources between X and y. In contrast, theweights provide information separately for each latent vari-able (LV). This information can be interpreted separately asnon-directly measured hidden phenomena expressing caus-ality relationships. In particular, weight vector w1 for the first
Please cite this article as: Felipe-Sotelo M, et al, Assessment ofthe accumulation of organochlorine compounds in European m.
TEDPR
OOF
PLS latent variable gives precise information about how the Xvariables combine to form the first score vector of matrix X,which optimally correlates with the y variable.X variables thatare highly correlated with the y variable will give high positiveor negative weightsw1, depending on whether they are highlypositively or negatively correlatedwith y. Consequently, in thiswork PLS was considered a tool for unravelling the relativeimportance of the geographical, temperature and physiologi-cal (X) variables affecting the accumulation of organochlorinecompounds in fish tissues (y). In the present study, PLS wasused basically as an explanatory tool, since an optimalprediction would probably require more complete data sets,including additional X variables. Data autoscaling pretreat-ment was applied to both X predictor matrices and y predictedvectors, and cross-validation by the leave-one-outmethodwasused to select the appropriate number of latent variables ineach case and to avoid overfitting. See references for a moredetailed explanation of the PLS method (Massart et al., 1988;Martens and Naes, 1991; Vandeginste et al., 1998).
A separate PLS model was considered to investigate thechanges in concentration of each organochlorine compoundin the different samples, taking as predictor variables thecorresponding geographical and physiological parameters(altitude, longitude, latitude, temperature, age, weight, sizeand lipids). Only muscle tissue data were considered here.Initially, the PLS models were developed for all the individualmuscle samples, regardless of their origin. This gave apredictor matrix X of dimensions 191×8 (common to all themodels) and 12 predicted vectors y 191×1, each one corre-sponding to one of the organochlorine compounds. Subse-quently, new PLS models were developed, which includedlake-averaged organochlorine compound concentrations andparameters. This resulted in a new predictor matrix X ofdimensions 22×8 and 12 new predicted vectors y of dimen-sions 22×1. This latter approach provided similar conclusionswith better fits. These results are discussed in more detail inthe following section. In order to interpret the correlationbetween X and y variables, the set of PLS weight vectorsw1 forthe first PLS component was examined in detail in this work.
3. Results and discussion
The calculation of pair-wise correlations revealed some of themost important relationships between the 20 variables andwas used as an exploratory tool. The two data sets, frommuscle and liver samples, were considered separately. Theresults are shown in Tables 2 and 3 respectively, as are thevariation ranges and basic statistics for all the parametersstudied in this work. In general terms, the average concentra-tions of organochlorine compounds were higher in liver thanin muscle tissue, with the exception of DDT and DDE inparticular. The dispersion of these values was also slightlyhigher for the concentrations of organochlorine compounds inliver than inmuscle, and wasmore noticeable for PCBs of highmolecular weight.
In the case of muscle samples (Table 2), high positivecorrelations were observed between the concentrations of theheaviest PCBs (larger than 0.6, in bold in Table 2). Moreover,the percentage of lipids in muscle correlated positively with
the environmental and physiological processes determining.., Sci Total Environ (2008), doi:10.1016/j.scitotenv.2008.06.020
UNCORRECTED PROOF
Table 2t2:1 – Basic statistics and binary correlationsa between the variables for the muscle samples (all lakes considered)t2:2t2:3 Parameter Range Mean SD Correlation coefficients
HCB α-HCH
γ-HCH
DDE DDT PCB#28 PCB#52 PCB#101 PCB#118 PCB#138 PCB#153 PCB#180 Lipids Weight Size Age Altitude Temperature Latitude Longitude
t2:5 HCB (ppb) 0.006–2.66 0.51 0.44 1.00t2:6 α-HCH (ppb) 0.007–1.80 0.16 0.21 0.49 1.00t2:7 γ-HCH (ppb) 0.022–9.83 0.79 1.26 0.49 0.66 1.00t2:8 DDE (ppb) 0.041–274 18.1 38.7 0.41 0.45 0.49 1.00t2:9 DDT (ppb) 0.009–19.1 1.34 2.21 0.56 0.36 0.40 0.65 1.00t2:10 PCB#28 (ppb) 0.003–2.67 0.34 0.40 0.02 0.02 0.03 0.27 0.29 1.00t2:11 PCB#52 (ppb) 0.001–5.56 0.40 0.57 −0.08 0.00 0.07 0.25 0.22 0.72 1.00t2:12 PCB#101 (ppb) 0.005–2.44 0.58 0.51 0.31 0.43 0.43 0.60 0.55 0.28 0.32 1.00t2:13 PCB#118 (ppb) 0.006–8.80 0.75 0.95 0.21 0.15 0.26 0.54 0.57 0.32 0.36 0.70 1.00t2:14 PCB#138 (ppb) 0.050–33.7 2.13 3.25 0.37 0.22 0.29 0.66 0.67 0.14 0.13 0.62 0.74 1.00t2:15 PCB#153 (ppb) 0.085–35.7 2.26 3.77 0.32 0.27 0.25 0.67 0.65 0.14 0.14 0.61 0.72 0.96 1.00t2:16 PCB#180 (ppb) 0.033–18.8 1.56 2.21 0.31 0.30 0.36 0.72 0.65 0.16 0.17 0.64 0.75 0.94 0.95 1.00t2:17 Lipidsb (%) 0.010–8.18 2.20 1.34 0.69 0.36 0.34 0.34 0.40 0.09 −0.02 0.20 0.11 0.24 0.21 0.18 1.00t2:18 Weight (g) 20.2–765 202 137 0.16 0.03 −0.10 −0.07 −0.07 0.00 −0.13 −0.10 −0.18 −0.15 −0.18 −0.20 0.20 1.00t2:19 Size (cm) 12.6–365 73.7 99.2 0.27 0.06 0.09 −0.14 0.02 0.00 −0.05 0.01 −0.06 −0.04 −0.11 −0.12 0.24 0.15 1.00t2:20 Age (year) 1–21 6.34 3.88 0.06 0.14 0.24 0.29 0.25 0.07 0.16 0.45 0.34 0.31 0.32 0.40 −0.15 0.02 −0.05 1.00t2:21 Altitude (m) 436–2799 1779 721 0.17 0.21 0.28 0.38 0.41 0.28 0.21 0.56 0.41 0.42 0.39 0.44 0.07 −0.22 −0.05 0.26 1.00t2:22 Temperature
(°C)–1.45–8.7 2.49 2.07 −0.08 −0.09 −0.03 −0.33 −0.26 −0.15 −0.07 −0.16 −0.31 −0.36 −0.36 −0.43 −0.02 0.15 0.08 −0.34 −0.43 1.00
t2:23 Latitude (°) 40.3–62.8 50.2 8.13 −0.13 −0.21 −0.28 −0.26 −0.32 −0.28 −0.19 −0.52 −0.27 −0.26 −0.23 −0.26 −0.07 0.08 0.01 −0.11 −0.88 0.00 1.00t2:24 Longitude (°) –8.12–23.3 7.50 7.17 −0.03 −0.01 −0.11 0.22 0.14 0.29 0.08 −0.02 0.14 0.15 0.16 0.23 0.10 −0.01 −0.12 0.12 0.15 −0.76 0.13 1.00
a Correlation N0.6 (absolute value) are indicated in bold characters.t2:25b % of lipids in muscular tissue.t2:26
5SC
IEN
CE
OF
TH
ET
OT
AL
EN
VIR
ON
MEN
TX
X(2008)
XX
X–X
XX
ARTICLE
INPRESS
Pleasecite
this
articleas:Felipe-Sotelo
M,et
al,Assessm
entof
theen
vironmen
talan
dph
ysiologicalprocesses
determining
theaccu
mulation
oforganoch
lorinecom
pounds
inEu
ropeanm
...,SciTotalEn
viron(2008),doi:10.1016/j.scitoten
v.2008.06.020
UNCORRECTED PROOF
Table 3t3:1 – Basic statistics and binary correlationsa between the variables for the liver samples (all lakes considered)t3:2t3:3 Parameter Range Mean SD Correlation coefficients
HCB α-HCH
γ-HCH
DDE DDT PCB#28 PCB#52 PCB#101 PCB#118 PCB#138 PCB#153 PCB#180 Lipids Weight Size Age Altitude Temperature Latitude Longitude
t3:5 HCB (ppb) 0.006–4.89
0.74 0.83 1.00
t3:6 α-HCH (ppb) 0.002–3.86
0.35 0.70 0.12 1.00
t3:7 γ-HCH (ppb) 0.001–7.3 1.01 1.39 0.05 0.61 1.00t3:8 DDE (ppb) 0.18–114 11.4 18.9 0.15 0.57 0.57 1.00t3:9 DDT (ppb) 0.02–5.1 0.99 1.02 0.26 0.20 0.06 0.48 1.00t3:10 PCB#28 (ppb) 0.016–
0.620.62 0.75 0.15 0.01 −0.16 0.10 0.10 1.00
t3:11 PCB#52 (ppb) 0.07–1.06 1.06 1.01 0.21 0.13 0.11 0.03 0.03 0.59 1.00t3:12 PCB#101 (ppb) 0.14–1.76 1.76 1.52 0.29 −0.01 0.15 0.28 0.23 0.20 0.32 1.00t3:13 PCB#118 (ppb) 0.08–1.18 1.18 0.94 0.40 0.07 0.24 0.26 0.20 0.04 0.35 0.72 1.00t3:14 PCB#138 (ppb) 0.15–3.22 3.22 3.52 0.25 0.02 0.14 0.31 0.30 −0.06 0.06 0.63 0.72 1.00t3:15 PCB#153 (ppb) 0.16–3.63 3.64 3.68 0.25 0.18 0.30 0.59 0.43 −0.02 0.06 0.61 0.67 0.93 1.00t3:16 PCB#180 (ppb) 0.07–1.66 1.66 2.09 0.26 0.44 0.48 0.85 0.50 0.00 −0.01 0.28 0.34 0.47 0.70 1.00t3:17 Lipidsb (%) 0.01–2.12 2.12 1.29 −0.25 −0.06 −0.10 −0.20 −0.35 −0.19 −0.24 −0.38 −0.37 −0.38 −0.39 −0.36 1.00t3:18 Weight (g) 166–2220 236 266 0.06 −0.02 0.13 0.09 −0.15 −0.03 −0.11 −0.17 −0.12 −0.07 −0.06 0.06 0.31 1.00t3:19 Size (cm) 15–38.5 26.5 5.01 0.19 0.01 0.10 0.21 −0.11 0.16 0.01 −0.09 −0.02 0.08 0.07 0.15 0.17 0.52 1.00t3:20 Age (year) 1–21 6.84 4.11 0.11 0.27 0.46 0.55 0.26 0.05 0.06 0.25 0.19 0.32 0.44 0.65 −0.25 0.24 0.41 1.00t3:21 Altitude (m) 566–2485 1888 691 −0.34 0.03 0.14 0.14 0.25 −0.04 −0.10 0.16 0.04 0.22 0.23 0.07 −0.18 −0.02 −0.18 0.01 1.00t3:22 Temperature (°C) –0.74–4.22 2.04 1.50 0.01 0.24 0.42 −0.01 −0.52 −0.16 −0.01 −0.24 −0.19 −0.33 −0.28 −0.06 0.26 0.13 0.13 0.11 −0.40 1.00t3:23 Latitude (°) 42–62.8 49.7 8.44 0.39 −0.13 −0.27 −0.19 −0.08 0.09 0.18 −0.01 0.11 −0.05 −0.10 −0.06 0.07 −0.07 0.09 −0.03 −0.94 0.13 1.00t3:24 Longitude (°) 0.8–23.3 8.88 6.94 −0.14 −0.20 −0.45 0.00 0.19 0.18 −0.08 −0.08 −0.05 −0.01 −0.01 −0.10 0.04 −0.02 0.07 −0.27 0.03 −0.73 0.09 1.00
a Correlation N0.6 (absolute value) are indicated in bold characters.t3:25b % of lipids in muscular tissue.t3:26
6SC
IEN
CE
OF
TH
ET
OT
AL
EN
VIR
ON
MEN
TX
X(2008)
XX
X–X
XX
ARTICLE
INPRESS
Pleasecite
this
articleas:Felipe-Sotelo
M,et
al,Assessm
entof
theen
vironmen
talan
dph
ysiologicalprocesses
determining
theaccu
mulation
oforganoch
lorinecom
pounds
inEu
ropeanm
...,SciTotalEn
viron(2008),doi:10.1016/j.scitoten
v.2008.06.020
ECTEDPR
OOF
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
00
Fig. 2 –Loadings of the first two principal components of the PCAmodels ofmuscle tissue (a) PC1; (b) PC2 and liver (c) PC1; (d) PC2data sets.
7S C I E N C E O F T H E T O T A L E N V I R O N M E N T X X ( 2 0 0 8 ) X X X – X X X
ARTICLE IN PRESS
UNCO
RRthe concentration of HCB (0.69). For PCBs, this correlation wasrather poor. The rest of the compounds correlated moderatelyand always positively with the lipids (0.34–0.4). PCBs of highmolecular weight also exhibited moderate positive correla-tions with the age of the captured specimens. Regarding thegeographical variables, temperature and latitude correlatednegatively with the concentrations of all the organochlorinecompounds. This tendency was more marked for the highermolecular weight PCBs and for DDE and DDT.
In the liver samples (Table 3), good correlations betweenthe heavy PCBs were observed, as in the muscle samples. Thecorrelations between the percentage of lipids (in muscletissue) and the concentration of organochlorine compoundsin liver were always negative in this case, in contrast to theobserved tendencies in muscles. The concentrations oforganochlorine compounds in liver were positively correlatedwith the age of the specimens. This tendency was clearer thanin muscle tissue. However, poor correlations were obtainedbetween the weight and size of the captured specimens andorganochlorine compound concentrations. Correlations oftemperature with organochlorine compound concentrations
Please cite this article as: Felipe-Sotelo M, et al, Assessment ofthe accumulation of organochlorine compounds in European m.
2in liver were in general moderate. The main accumulation2tendency was not as clear for liver as it was for muscle2samples, since both positive and negative correlation coeffi-2cients were observed for liver samples.
23.1. Principal components analysis (PCA) and matrix2augmentation
2Three data pretreatment methods were assayed: logarithmic2transformation (log scaling), column scaling and column2autoscaling. Log scaling is sometimes recommended when2either the range of values for a given variable is large, covering2several orders of magnitude, or when the data distributions2are very asymmetric (this is common in environmental data,2where some samples exhibit values that are too high/low)2(Massart et al., 2005). Previously, a constant value was added2to each individual value, to avoid negative values after log2calculation (xnew=log (K+xold)). In the column scaling pre-2treatment, each individual value was divided by the standard2deviation of its corresponding column. In autoscaling, the3mean of the corresponding column was first subtracted and
the environmental and physiological processes determining.., Sci Total Environ (2008), doi:10.1016/j.scitotenv.2008.06.020
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
8 S C I E N C E O F T H E T O T A L E N V I R O N M E N T X X ( 2 0 0 8 ) X X X – X X X
ARTICLE IN PRESS
then column scaling was applied. After preliminary studies,autoscaling was finally chosen. Column scaling and logarith-mic pretreatments gave PCA models in which data variancewas mostly assigned to the latitude and altitude of lakes(which were the variables with the highest values, in absoluteterms). Hence, they did not provide any valuable informationabout the dependence of organochlorine compounds on othergeographical and physiological parameters. Autoscaling wasused throughout the study, since it provided more interpre-table models.
UNCO
RREC
Fig. 3 –Representation of PC1 loadings versus PC2 to show the decontent (— line) and relations between temperature, latitude andand (b) liver data sets.
Please cite this article as: Felipe-Sotelo M, et al, Assessment ofthe accumulation of organochlorine compounds in European m.
PCA models developed for the muscle and liver tissuesexplained 65% and 63% of total variance respectively, when 4principal components were considered for each model.
In the PCA model for the muscle data set, PC1 (35%explained variance, Fig. 2a) gave high positive loadings forDDE, DDT and for the PCBs of higher molecular weight (PCBs #101, 118, 138, 153 and 180), followed by the remaining organo-chlorine compounds for altitude, age and lipid percentage.Temperature and latitude showed relatively high negativeloadings. PC2 (13% explained variance, Fig. 2b) clearly
TEDPR
OOF
pendence of organochlorine compound accumulation on lipidaltitude and the accumulation of OCs (---- line) for (a) muscle,
the environmental and physiological processes determining.., Sci Total Environ (2008), doi:10.1016/j.scitotenv.2008.06.020
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
00
01
02
03
04
05
06
t4:1
t4:2t4:3
t4:4
t4:5
t4:6
t4:7
t4:8
t4:9
t4:10
t4:11
t4:12
t4:13
t4:14
t4:15
9S C I E N C E O F T H E T O T A L E N V I R O N M E N T X X ( 2 0 0 8 ) X X X – X X X
ARTICLE IN PRESS
RREC
associated concentrations of HCB, α- and γ-HCH with the lipidcontent, and, to a lesser extent, with temperature, fish weightand length—all with high positive loadings. In contrast,length, age, altitude and concentration of PCBs gave negativeloadings. PC3 (9% explained variance) was mostly positivelyloaded by latitude, as opposed to altitude and temperature, aswell as to PCB#28 and 52 to a lesser extent. Finally, PC4 (8%explained variance) explained the particular behaviour ofPCB#28 and 52, with high positive loadings in this PC, as wellas in longitude.When the PCAmodel for the liver sampleswasevaluated, similar loading profiles were observed (see PC1 andPC2 loadings for liver in Fig. 2c and d respectively). Acomparison of Fig. 2a and c shows that the main differencebetween both PC1 loading profiles for muscle and liver is thechange in the sign of the correlation for lipids. In musclesamples, lipids were positively correlated with organochlorinecompounds. In contrast, in liver samples this correlation wasnegative. This could be due to the fact that a reduction of lipidsin the muscle would tend to decrease the concentration oforganochlorine compounds in this tissue and increase theiraccumulation in the liver. In addition, the age of the specimenseemed to play a more important role in organochlorinecompound concentrations in liver than in muscle (see thehigher loading of age in PC1 for liver than for muscle, Fig. 2cand a respectively). However, in the comparison betweenFig. 2b (muscle) and d (liver), the HCB loadings differ. WhereasHCB loadings for PC2 were high and positive for musclesamples, they were around zero for liver samples. The mainconclusion is that although liver and muscle tissues followedthe same trends, there were differences in some of thepatterns for specific compounds. The representation of PC1loadings versus PC2 in Fig. 3 gives a better view of therelationships between the variables. Regardingmuscle data, inFig. 3a it can be seen that concentrations of HCB, α- and γ-HCHare clearly grouped and associated with the lipid content inmuscle tissue, whereas PCB#28 and #52 showed negativecorrelations with lipids. The remaining compounds showintermediate behaviour in terms of lipids. In the samerepresentation (Fig. 3a), relationships among organochlorinecompounds and temperature, altitude and latitude are high-lighted. It can be observed that temperature opposes altitude.Latitude is in an intermediate situation. When the latitudeincreases, the altitude of the lakes tends to decrease. All the
UNCO
07
08
09
10
11
12
13
14
15
16
17
18
19
20
21
22
23
Table 4 – Vapour pressure for the organochlorinecompounds analysed (Mackay et al., 1992)
Compound Vapour pressure (Pa)
γ-HCH 10−0.70
α-HCH 10−0.92
HCB 10−1.2
PCB#28 10−1.5
PCB#52 10−1.8
DDE 10−2.5
PCB#101 10−2.5
PCB#118 10−3.0
PCB#138 10−3.2
DDT 10−3.3
PCB#153 10−3.3
PCB#180 10−3.9
Please cite this article as: Felipe-Sotelo M, et al, Assessment ofthe accumulation of organochlorine compounds in European m.
TEDPR
OOF
3organochlorine compounds were negatively correlated to3temperature and positively associated with altitude, which is3in agreement with the global distillation theory. Thus, as the3temperature decreased, and/or the altitude increased, the3concentrations of organochlorine compounds increased and3accumulated. However, the same figure shows that this3relation was not exactly the same for all the organochlorine3compounds considered. In addition, some clustering was3observed, which can be explained in terms of the relative3volatility of the compounds. First, the less volatile compounds,3like DDT, DDE and PCBs # 101, 118, 138, 153 and 180, were3found at higher positive loadings of PC1 (lower temperatures)3(see vapour pressures in Table 4). In contrast, the remaining3compounds (HCB, α- and γ-HCH, PCB#28 and #52), which are3more volatile (see Table 4) appeared at somewhat lower3positive PC1 loadings (and higher temperatures). This is in3agreement with the effect of temperature on compounds of3different volatility. As temperature decreases, the concentra-3tion of less volatile organochlorine compounds increases, as3they are more easily trapped than those with higher vapour3pressures (more volatile). The explanation for altitude would3be exactly the opposite. These conclusions agree with3previous studies showing that soils (Ribes et al., 2002) and3snow (Blais et al., 1998) located at higher altitude exhibited3higher organochlorine concentrations than at lower sites.3Note that the main purpose of the present work was to reveal3dependences among variables and to try to give a simplified3interpretation of the relationships observed. The aim was not3to deliver a “cause and effect” conclusion. Moreover, multi-3variate approaches attribute the variation observed to some3(few and independent) new variables, which are combinations3of the individual parameters. Therefore, multivariate models,3like the ones presented here, provide a better overview of the3interdependences among variables than other studies that use3univariate linear regression to evaluate the dependence3between the pollutant variation and only one (so-called)4independent variable. When the representation of PC1 versus4PC2 loadings for liver samples was analyzed analysed (Fig. 3b),4it was found that the results for liver samples also agreed well4with the global distillation effect. In the case of liver samples,4all organochlorine compounds were negatively correlated4with temperature and latitude, and positively associated4with altitude. Moreover, organochlorine compounds in liver4were grouped according to their volatility, as was previously4highlighted for muscle samples. The main difference between4results obtained for liver and muscle samples involved the4relationship between the percentage of lipids and the4concentrations of organochlorine compounds (compare4Fig. 3a and b). For muscle samples, the increment in lipids4was clearly associated with higher concentrations of HCB, α-4HCH and γ-HCH. However, in liver samples, the percentage of4lipids was opposed to all the organochlorine compounds4considered. Moreover, in the case of liver tissue samples, the4effect of lipids on the concentration of organochlorine4compounds seemed to be more marked.4The representation of PC1 versus PC2 scores (Fig. 4) shows4that samples were clustered in different groups according4to their geographical origin. In general, samples located on the4right side of the plot (high positive PC1 scores) correspond4to muscle or liver samples of specimens that were more
the environmental and physiological processes determining.., Sci Total Environ (2008), doi:10.1016/j.scitotenv.2008.06.020
ECTEDPR
OOF
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
Fig. 4 –Plot of PC1 scores versus PC2, showing the grouping of samples according to their origin; data corresponding to(a) muscle and, (b) liver samples.
10 S C I E N C E O F T H E T O T A L E N V I R O N M E N T X X ( 2 0 0 8 ) X X X – X X X
ARTICLE IN PRESS
UNCO
RRcontaminated by organochlorine compounds (more positivePC1 loadings for both muscle and liver data sets). Musclesamples are illustrated in Fig. 4a. High contents of organo-chlorine compounds were found in most of the musclesamples from fish collected at Milchsee Lake and one samplefrom Schwarzsee Lake. These two lakes are at high altitudes(2500–2800 m) and middle latitudes (Central Europe, in theAlpine region). In contrast, one fishmuscle sample collected atRedò Lake and those collected at Aubé Lake (both in thePyrenees) had high positive PC2 scores, indicating a relativelyhigher concentration of HCB, α-HCH and γ-HCH (high positivePC2 loadings) and lower concentrations of the PCB family ofcompounds (negative PC2 loadings). In general, musclesamples collected at the highest latitudes (Northern locations)gave closer to zero PC1 and PC2 scores, indicating lower inputsof organochlorine compound contamination sources. Moresubtle differences among fish samples collected at differentlakes could have been found if samples with very highconcentrations of organochlorine compounds (especially forMilchsee Lake, which showed the highest scores of PC1) had
Please cite this article as: Felipe-Sotelo M, et al, Assessment ofthe accumulation of organochlorine compounds in European m.
been eliminated and PCA calculations were rerun. In the caseof liver samples (scores in Fig. 4b), Øvre Neådalsvtan andNedre Neådalsvtan Lakes showed the lowest concentrationsof organochlorine compounds (negative scores of PC1, to theleft in Fig. 4b). Similar results were observed for musclesamples (Fig. 4a). Moreover, the highest concentrations oforganochlorine compounds in liver samples corresponded toone of the samples from Lake Redò, followed by the samplesfrom the Fallbekktjørna and Vel’ké Hincovo Lakes. Oncemore,Redò Lake showed the highest concentrations of α-HCH and γ-HCH (at the top of Fig. 4b, highest scores of PC2). However, thenumber of muscle samples was higher than the number ofliver samples (Table 1). Nevertheless, for the initial purposes ofthis study and for the sake of brevity, the plots in Fig. 4 areconsidered sufficiently illustrative of the main tendencies inthe effect of different contamination sources of organochlor-ine compounds in different mountain lakes in the Europeangeographical region under study.
Furthermore, in order to determine whether any of theanalysed organochlorine compounds tended to accumulate
the environmental and physiological processes determining.., Sci Total Environ (2008), doi:10.1016/j.scitotenv.2008.06.020
EC
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
00
01
02
03
04
05
06
07
08
09
10
11
12
13
14
15
16
17
18
19
20
Fig. 5 –Biplots corresponding to the simultaneous analysis byPCA of the liver and muscle data (augmented matrix Daug),(a) PC1 versus PC2 and (b) PC2 versus PC3.
Table 5 t5:1– Characteristics of the PLS models developed foreach organochlorine compound
t5:2t5:3Compound Number
of LV%
Explainedvariance
% Error ofprediction
y X
t5:5HCB 3 83 62 29t5:6α-HCH 3 53 60 53t5:7γ-HCH 3 55 62 59t5:8DDT 4 82 73 34t5:9DDE 4 84 72 35t5:10PCB#28 4 32 70 54t5:11PCB#52 4 19 71 67t5:12PCB#101 4 82 72 24t5:13PCB#118 4 58 71 46t5:14PCB#138 4 76 72 40t5:15PCB#153 4 78 72 38t5:16PCB#180 4 79 72 35
11S C I E N C E O F T H E T O T A L E N V I R O N M E N T X X ( 2 0 0 8 ) X X X – X X X
ARTICLE IN PRESS
UNCO
RRpreferentially in any of the two fish tissues previouslyconsidered, the data corresponding to liver and muscle wereanalysed simultaneously, considering only the concentrationsof organochlorine compounds (neither the geographical norphysiological variables). The PCA model including the 3principal components for the augmented data matrix Daug
explained 78.1% of the total variance. In Fig. 5a, the biplotrepresentation (simultaneous plotting of loadings and scores)for PC1 versus PC2, shows that samples tended to groupaccording to their tissue nature, i.e. either muscle or liver.Liver samples tended to be associated with relatively higherconcentrations of PCBs, and appeared to be displaced towardslower PC2 scores. The biplot of PC2 versus PC3 (Fig. 5b) revealsthis association evenmore clearly. In both plots, liver samplesexhibited higher dispersion thanmuscle samples, which weremore homogeneous among different specimens. This resultagain confirms the different behaviour of the samplescollected for the two types of tissues, particularly in relationto the dependence between organochlorine compound con-centrations and the percentage of lipids.
Please cite this article as: Felipe-Sotelo M, et al, Assessment ofthe accumulation of organochlorine compounds in European m.
TEDPR
OOF
43.2. Partial least squares
4Partial least squares (PLS) models were developed to establish4the relative importance of the geographical and physiological4parameters in the accumulation of organochlorine com-4pounds in fish muscle. The aim was not to develop good4predictive models for the accumulation of organochlorine4compounds as a function of geographical and physiological4parameters, since this would require a higher number of4variables (not available in this work) and samples. Instead, the4goal was to establish which of the parameters had more4influence on the observed concentration of organochlorine4compounds in the fish tissues of the different mountain lakes.4In addition, we aimed to find potential systematic patterns or4trends in these dependencies. Therefore, the intention was4more qualitative than quantitative. PLS was chosen because it4is not affected by correlation among predictor X variables (in5this case the geographical and physiological parameters). In5fact, it improves precision inmodelling and prediction proper-5ties when this correlation occurs. PLS models were built5separately for each organochlorine compound. The concen-5tration of each compound was taken as the predicted variable5(y) and the set of 8 geographical and physiological parameters5as predictor variables (X). Several pretreatment procedures5were tested on both y and X (autoscaling, scaling to unit5variance and mean centering). Finally, autoscaling was5chosen, since it provided better models and largely explained5the variances of y variables. Some transformations of the5original data were also evaluated, in order to consider possible5non-linearities in the model. These transformations included5augmentation of the predictormatrix using quadratic terms of5the original variables and/or logarithmic transformation of y5variables. In some cases, these transformations slightly5improved the explained y variance for some of the organo-5chlorine compounds. However, these models were not con-5sidered significantly better, as they were more complex. This5complexity makes it more difficult to interpret the depen-5dences between the accumulation of organochlorine
the environmental and physiological processes determining.., Sci Total Environ (2008), doi:10.1016/j.scitotenv.2008.06.020
CED
PROO
F521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548Q2
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
Fig. 6 –Representations of the weight vectors (w1w1) of the first latent variable, obtained for the PLS models developed for eachindividual organochlorine compound (organochlorine compounds were grouped for the representation according to theirsimilar behaviour).
12 S C I E N C E O F T H E T O T A L E N V I R O N M E N T X X ( 2 0 0 8 ) X X X – X X X
ARTICLE IN PRESS
UNCO
RRE
compounds and the original variables. Twelve separatemodels were finally built using autoscaling pretreatmentand cross-validation by the leave-one-out method, to selectfor an appropriate number of latent variables. Explainedvariances as well as the number of latent variables for eachmodel are given in Table 5. Results shown in this table onlycorrespond to models obtained from individual fish values ofthe concentrations of each organochlorine compound in eachof the studied lakes. The explained concentration (y) variancesdiffered depending on the compound. Higher explainedvariances were found for HCB, DDT, DDE and most of thehigher molecular weight PCBs. In contrast, concentrations oflower molecular weight PCBs, #28 and #52, were more poorlyexplained by the geographical, temperature and physiologicalparameters investigated in this work. These two compoundsshowed clearly separate behaviour in the previous study usingPCA and their distinct behaviour is confirmed here. Com-pounds like α-HCH, γ-HCH and PCB#118 gave intermediateresults, with explained variances between 50 and 60%.
The discussion of the relative importance of the para-meters in terms of the accumulation of organochlorinecompounds can be made on the basis of the weight vectorfor the first score or latent variable, w1w1. Direct comparisonof the weight vectors can be made, as the autoscalingprocedure used to preprocess the data eliminated the scaledifferences between predictor variables or for differentpredicted (organochlorine compound concentration) vari-ables, since both y and X variables were autoscaled. Four
Please cite this article as: Felipe-Sotelo M, et al, Assessment ofthe accumulation of organochlorine compounds in European m.
Tgroups of organochlorine compounds were considered in thediscussion, as they had similar structure and behaviour,which was also revealed by previous PCA studies. Fig. 6shows the representation of the first weight PLS vectors,w1w1,for these four groups of compounds. The larger the weightvalue (either positive or negative) for one particular X variable,the greater the effect (either positive or negative accordingly)this variable had in the explanation of the correlation with they variable (the observed concentration and accumulation ofthe considered organochlorine compound).
Fig. 6 shows that similar patterns were recognized for mostof the organochlorine compounds in all cases, with relativelyhigh positive weight values (positive correlation) for Xvariables such as lipid content, specimen age and lakealtitude, and negative weight values (negative correlation)for X variables like temperature and latitude. The other Xvariables, such as specimen weight and size and geographicallongitude of the lake, showed a random, lower contribution tow1w1 vectors. A first group of compounds was formed by HCB,α-HCH and γ-HCH. Fig. 6a shows that the concentration ofthese compounds positively correlated with the lipid contentin muscle and was also positively correlated with the age ofthe specimen and the latitude of the lake. Simultaneously, itwas inversely correlated with the temperature and latitude ofthe lake. The same occurred with the second group ofcompounds formed by DDE and DDT, which showed verysimilar patterns (Fig. 6b). Again, the main factors affecting theaccumulation of these compounds in the lipids of fish muscle
the environmental and physiological processes determining.., Sci Total Environ (2008), doi:10.1016/j.scitotenv.2008.06.020
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
87
89
90
91
92
93
13S C I E N C E O F T H E T O T A L E N V I R O N M E N T X X ( 2 0 0 8 ) X X X – X X X
ARTICLE IN PRESS
UNCO
RREC
were the specimen’s specimen's age, the lake altitude (bothwith high positive weights), the temperature and lake latitude(with negative weights). The concentrations of the group ofcompounds formed by PCBs of lower molecular weight,PCB#28 and #52, were not well explained by the investigatedvariables and were poorly correlated with the lipid content.Therefore, in this case, little can be inferred from the observedpositive and negative weight values and correlations. Finally,the fourth group embraces the remaining PCBs of highermolecular weight (PCB#101, 118, 138, 158 and 180). They againshowed quite a uniform pattern, as depicted in Fig. 6d. Again,the main factors affecting the concentration and accumula-tion of these PCBs in the muscle tissue were the percentage oflipids, the age of the fish specimen, the air temperature andlatitude (these two with negative correlations). This confirmsthat the lower the temperature (and the higher the altitude)the higher the concentration of these PCBs in the lipids of themuscle tissue of fish specimens. All these effects also agreewell with those previously obtained by PCA. It is thereforeconfirmed that the concentrations of most of the organo-chlorine compounds in the muscle tissue lipids of fishspecimens collected in different mountain lakes of Europeduring this study showed a clear tendency to be higher whenlake temperatures were lower (also at higher lake altitudesand lower lake latitudes). It is also clear that this accumulationwas also higher for the older studied specimens (higher age).
Finally, looking at Fig. 6 as a whole, it can be seen that,despite some specific differences, the shapes of w1w1 profileswere very similar for all the organochlorine compoundsstudied. The w1w1 vectors describe and recognize the domi-nant pattern or phenomenon that correlates organochlorinecompound concentration with the variables considered in thisstudy. This correlation is not random or stochastic, butsystematic for most of the investigated compounds. Most ofthe compounds follow very similar patterns, in whichtemperature (and consequently lake latitude and altitude)was always shown to play an important role in the accumula-tion of these compounds. Consequently, the results clearlyreinforce the phenomenological explanation provided by the“global distillation effect” theory and show that this is alsouseful for describing the altitudinal distribution of thesecompounds in mountain areas.
As stated above, some predictor variables in the presentstudy were highly correlated. For instance, it is well knownthat as the altitude of a lake increases, the average tempera-ture tends to decrease. In Europe, higher latitudes correspondto lakes of lower altitude. Collinearity between variables doesnot affect PLS models. In PLS models for each organochlorinecompound, the same conclusions were obtained when eitheraltitude or temperature was removed from the predictorvariables. In addition, the relative importance of the variableswas the same. This is not the case for classical multilinearregression (MLR) methods (regardless of whether or not astepwise selection of variables was used), due to the intrinsicdifficulties associated with these methods when collinearityamong variables is very high. The number and identity of theinitial parameters considered in the analysis can change thefinal regression model substantially. In this work, PLS modelswere preferred as they are more robust than MLR to thepresence of collinear variables.
Please cite this article as: Felipe-Sotelo M, et al, Assessment ofthe accumulation of organochlorine compounds in European m.
TEDPR
OOF
64. Conclusions
6This paper analyses the dependence between the accumula-6tion of organochlorine compounds in fish muscle and liver6tissues and geographical and physiological parameters by6means of chemometric tools. It shows the ability of chemo-6metric tools to deal with complex environmental data sets and6problems. The multivariate data analysis methodologies used6here (PCA and PLS) were shown to be useful for investigating6and illustrating how rich environmental information can be6extracted in a relatively straightforward and fast way, by6simultaneously considering a large number of samples and6variables.6The results extracted from the PCA and applied to the6whole set of biological and environmental parameters are6in agreement with previous studies that propose a global6distillation effect to explain the presence of a higher concen-6tration of organochlorine compounds with increasing altitude6and decreasing temperatures. All the organochlorine com-6pounds considered in this work followed this general trend.6However, the trend seemed slightly more marked for organo-6chlorine compounds of lower volatility.6These conclusions were confirmed in both liver and6muscle samples. Nevertheless, there were differences be-6tween the muscle and liver in the effects of the percentage of6muscle lipids on the accumulation of organochlorine com-6pounds and the behaviour of HCB, which showed opposite6tendencies in both tissues. The joint PCA of both tissues6revealed that the PCBs had a slight tendency to accumulate6more in liver than in muscle. Moreover, the organochlorine6compound concentration range in liver was more scattered6than in muscle.6Geographical differences were found within samples.6PCA enabled samples to be clearly clustered according to6their origin. Whereas PC1 enabled us to differentiate between6lakes, PC2 allowed specimens to be individuated within each6lake.6The different PLS models developed in this study establish6the relative importance of the different parameters in relation6to the accumulation of organochlorine compounds in fish6tissues. The results reveal that the percentage of lipids, age,6altitude, latitude and temperature were the main factors6affecting the concentration and accumulation of the organo-6chlorine compounds in muscle fish samples. These results6also confirmed previous conclusions obtained by PCAmodels.6In both cases, accumulation of organochlorine compounds in6fish tissues in mountain lakes were shown to depend6significantly on altitude and latitude. Therefore, the results6support the global distillation theory and can be applied to a
description of the altitudinal profiles of these compounds in6mountain lakes.
6Acknowledgements
6M. F. S acknowledges the CSIC (Centro Superior de Investiga-6ciones Científicas) for an I3P fellowship in the IIQAB. Research6funds from Ministerio de Educación y Ciencia, Spain, research6project Nr. CTQ2006-15052 are also acknowledged.
the environmental and physiological processes determining.., Sci Total Environ (2008), doi:10.1016/j.scitotenv.2008.06.020
PR
69 4
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
14 S C I E N C E O F T H E T O T A L E N V I R O N M E N T X X ( 2 0 0 8 ) X X X – X X X
ARTICLE IN PRESS
R E F E R E N C E S
Blais JM, Schindler DW, Muir DCG, Kimpe LE, Donald DB,Rosenberg B. Accumulation of persistent organochlorinecompounds in mountains of western Canada. Nature1998;395:585–8.
Daly GL, Wania F. Organic contaminants in mountains. EnvironSci Technol 2005;39:385–98.
Farnham IM, SinghAK, StetzenbachKJ, Johannesson KH. Treatmentof nondetects in multivariate analysis of groundwatergeochemistry data. Chem Intell Lab Systems 2002;60:265–81.
Grimalt JO, Borghini F, Sánchez-Hernández JC, Barra R, TorresGarcía CJ, Focardi S. Temperature dependence of thedistribution of organochlorine compounds in themosses of theAndean Mountains. Environ Sci Technol 2004;38:5386–92.
Grimalt JO, Fernández P, Berdie L, Vilanova RM, Catalan J, PsennerR, et al. Selective trapping of organochlorine compounds inmountain lakes of temperate areas. Environ Sci Technol2001;35:2690–7.
Jolliffe IT. Principal component analysis. New York (USA):Springer-Verlag; 2002.
Mackay D, Shiu WY, Ma KC. Illustrated handbook ofphysical–chemical properties and environmental fate fororganic chemicals. Michigan (USA): Lewis Publishers; 1992.
Martens H, Naes T. Multivariate calibration. London: John Wiley &Sons; 1991. pp.
Massart DL, Smeyers-Verbeke J, Capron X, Schlesier K. Visualpresentation of data by means of box plots. LC GC Europe2005;18:215–8.
UNCO
RREC
Please cite this article as: Felipe-Sotelo M, et al, Assessment ofthe accumulation of organochlorine compounds in European m.
OOF
Massart DL, Vandeginste BGM, Deming SN, Michotte Y, KaufmanL. Chemometrics: a textbook. Amsterdam: Elsevier; 1988.
Muir DCG, Ford CA, Grift NP, Metner DA, Lockhart WL.Geographic-variation of chlorinated hydrocarbons in burbot(Lota-Lota) from remote lakes and rivers in Canada. ArchEnviron Cont Tox 1990;19:530–42.
Ribes A, Grimalt JO, García CJT, Cuevas E. Temperature and organicmatter dependence of the distribution of organochlorinecompounds in mountain soils from the subtropical Atlantic(Teide, Tenerife Island). Environ Sci Technol 2002;36:1879–85.
Simonich S, Hites R. Global distribution of persistentorganochlorine compounds. Science 1995;269:1851–4.
Smilde AK, Bro R, Geladi P. Multi-way analysis with applications inthe chemical sciences. Chichester (United Kingdom): JohnWiley & Sons; 2004. 381 pp.
Vandeginste BGM, Massart DL, Buydens LMC, de Jong S, Lewi PJ,Smeyers-Verbeke J. Handbook of chemometrics andqualimetrics. The Netherlands: Elsevier Science B. V.; 1998.
Vives I, Grimalt JO, Catalan J, Rosseland BO, Battarbee RW.Influence of altitude and age in the accumulation oforganochlorine compounds in fish from high mountain lakes.Environ Sci Technol 2004;38:690–8.
Wania F, Mackay D. A global distribution model for persistentorganic-chemicals. Sci Total Environ 1995;161:211–32.
Wania F, Mackay D. Tracking the distribution of persistent organicpollutants. Environ Sci Technol 1996;30:A390–6.
TEDthe environmental and physiological processes determining.., Sci Total Environ (2008), doi:10.1016/j.scitotenv.2008.06.020