Upload
gabriel-lr
View
215
Download
0
Embed Size (px)
Citation preview
8/13/2019 2001 Lopez (Agri for Meteorol_107)
1/13
Agricultural and Forest Meteorology 107 (2001) 279291
Estimation of hourly global photosynthetically active radiationusing artificial neural network models
G. Lpez a, M.A. Rubio a, M. Martnez b, F.J. Batlles a,
a Facultad de Ciencias, Dpto. de F sica Aplicada, Universidad de Almer a, La Canada de San Urbano s/n, 04120 Almer a, Spainb Dpto. de Lenguajes y Computacin, Universidad de Almera, 04120 Almera, Spain
Received 5 July 2000; received in revised form 21 December 2000; accepted 3 January 2001
Abstract
Photosynthetically active radiation (PAR) reaching the earths surface is a major parameter controlling many biologicaland physical processes related with the evolution of plant canopies, agricultural and environmental fields. Unfortunately, PAR
is not often measured and therefore it must be estimated. The unavailability of measurements of global solar radiation at the
place of interest and different factors affecting the linear relation between PAR and global solar radiation can preclude the
estimation of PAR from global solar radiation. In this paper, a novel approach based on a simple multilayered feedforward
perceptron has been used to analyse the non-linear relationships between PAR and different meteorological and radiometric
variables in order to determinetheirrelative relevance. An artificialneural network based model forthe estimation of thehourly
PAR involving hourly global irradiance as only measured variable has been successfully developed. The model was tested
using data recorded at six radiometric stations covering a wide range of climates. The model performance has been compared
with other existing empirical complex models showing important improvements. Next, a second artificial neural network
based model involving only sunshine duration measurements has been developed and proved to be an acceptable alternative
to calculate hourly PAR when radiometric information is not available. 2001 Elsevier Science B.V. All rights reserved.
Keywords:Photosynthetically active radiation; Artificial neural network; Global solar radiation; Sunshine duration; Estimation
1. Introduction
One critical factor for crop energy conversion for
plant growth is the solar energy in the wavelength
region 400700 nm, referred to as photosynthetically
active radiation (PAR), received by the plant. Fur-
thermore, many of the exchange processes between
vegetation canopies and the atmosphere, as well
as dry matter yield, are regulated by photosynthe-
sis, which has often been related to the amount of
Corresponding author. Tel.: +34-950-015414;
fax: +34-950-215477.
E-mail address: [email protected] (F.J. Batlles).
absorbed PAR (Hanan and Bgu, 1995; Li et al.,
1997). Therefore, the amount of PAR absorbed by
green vegetation not only governs the processes of
the plant evolution but also influences the exchange
of energy and water between the land surface and the
atmosphere. Unfortunately, measurements of PAR are
not routinely carried out at radiometric stations. This
problem necessitates estimation from other commonly
measurable meteorological and radiometric variables
available at the location of interest. In the literature
there are two kinds of models for estimating the PAR:
(i) radiative transfer models (Gueymard, 1989a,b;Olseth and Skartveit, 1993) and (ii) empirical models
(Moon, 1940; Britton and Dodd, 1976; Howell et al.,
0168-1923/01/$ see front matter 2001 Elsevier Science B.V. All rights reserved.
PII: S 0 1 6 8 - 1 9 2 3 ( 0 1 ) 0 0 2 1 7 - 9
8/13/2019 2001 Lopez (Agri for Meteorol_107)
2/13
280 G. Lopez et al. / Agricultural and Forest Meteorology 107 (2001) 279291
Nomenclature
a zero intercept of the linear regression
fit between estimated and measured
values
b slope of the linear regression fit between
estimated and measured valuesD diffuse solar irradiance on horizontal
surface (W m2)
E sum of squares of differences between
network outputs and target outputs
G global solar irradiance on horizontal
surface (W m2)
h air relative humidity
I direct beam solar irradiance (W m2)
IIIid identity matrix
I0 extraterrestrial solar irradiance
(W m2)
I0h extraterrestrial solar irradiance
on horizontal surface (W m2)
JJJ Jacobian matrix
JJJT transpose of Jacobian matrix
k diffuse fraction
kt clearness index
mr relative optical air mass
MBE mean bias error expressed as a
percentage of the mean measured value
of the corresponding photosynthetically
active radiation
N number of nodes used by an artificial
neural network layer
Nh number of hidden nodes used by anartificial neural network
Ni number of input nodes used by an
artificial neural network
Np number of training patterns
oi artificial neural network output
forith pattern
Qp photosynthetically active radiation
on a horizontal surface (E m2 s1)
r residual vector
r T transpose vector ofr
R2 coefficient of determination of the
linear regression fit between estimated
and measured valuesS relative sunshine hours
ti target output forith pattern
T air temperature (C)
Td air dew point temperature (C)
w precipitable water thickness (cm)
w vector of weights used by anartificial neural network
wij weight for the connection betweennodei and node j in an artificial
neural network
x real-valued variable
xmax maximum value ofx
xmin minimum value ofx
xscaled scaled value ofx
y activation function of the nodes
used by an artificial neural network
Greek symbols
weight correction parameter used in the
training process of an artificial
neural network brightness of the sky
clearness of the sky
z solar zenith angle
Marquardt parameter used in the
training process of an artificial
neural network
1983; Papaioannou et al., 1993; Alados et al., 1996;
Al-Shooshan, 1997). The first ones take into account
interactions on 400700 nm waveband radiation with
terrestrial atmosphere, such as Rayleigh scattering,
ozone absorption and aerosol extinction. The currentproblem in the use of these models is the large amount
of meteorological information required, which is not
often measured at meteorological stations. On the
other hand, the empirical models relate the PAR to
other solar radiation measurements, especially global
solar irradiance on horizontal surfaces, thus requiring
simpler input data than the parametric models and
providing an estimate of the PAR both for cloudless
and cloudy conditions. In this way, these models avoid
the difficulties associated with the parameterisation
of clouds.
The linear relationship between PAR and global
radiation (2902700 nm) is usually the basis ofthe empirical models. Some studies (Szeicz, 1974;
Papaioannou et al., 1993; Alados et al., 1996) tend
8/13/2019 2001 Lopez (Agri for Meteorol_107)
3/13
G. Lopez et al. / Agricultural and Forest Meteorology 107 (2001) 279291 281
to show that the ratio between the PAR and global
radiation depends on solar elevation, as well as dew-
point temperature and sky conditions. These relations
appear to be non-linear and difficult to handle with
standard statistical techniques. Furthermore, sky con-
ditions are characterised by means of clearness of sky
and the brightness of skylight, which are functions
of direct beam and diffuse irradiance (Perez et al.,
1990). This dependence presents an additional prob-
lem since normal beam or diffuse components are not
measured as frequently as global irradiance.
This paper presents a novel technique for mod-
elling the hourly global PAR based on artificial neural
networks (ANNs). Neural networks are widely ac-
cepted as a technology offering an alternative way to
tackle complex and ill-defined problems. The power
of neural networks in modelling complex mappings
has been demonstrated (Kohonen, 1984; Lupo, 1989;
Hammerstrom, 1993) and successfully applied to a
wide range of agricultural and engineering applica-tions reporting higher accuracy compared to classical
methods (Bolte, 1989; Zhuang and Engel, 1990;
Muttiah and Engel, 1991; Thai and Shewfelt, 1991).
Furthermore, in the meteorological field, neural net-
work models have been developed to model radiation
variables, as global or diffuse solar irradiance (Lpez
et al., 2000a,b), showing important improvements
against traditional statistical models.
In this work, some radiometric and meteorolog-
ical variables derived from physical considerations
for estimating hourly PAR have been studied. An
analysis of the relevance of these inputs based on a
Bayesian method involving ANNs (MacKay, 1994;Neal, 1996) has been carried out. This first stage se-
lected the relevant inputs to develop the simplest and
optimal neural network model including radiometric
information. In addition to this, since throughout the
Table 1
Geographical locations of the stations, period of measurement and number of hours
Country Latitude (N) Longitude (W) Altitude (a.m.s.l.) Years N
Abisko Sweden 68.35 18.82 385 19951997 5573
Almera Spain 36.83 2.41 6 19901995 15200
Granada Spain 37.18 3.58 660 19941995 7442
Bondville USA 40.05 88.37 213 19971998 3123Desert Rock USA 36.63 116.02 1007 19981999 3274
Table Mountain USA 40.13 105.3 1689 19951997 7052
world more solar data are in the form of sunshine
measurements than in the form of radiation measure-
ments (Michalsky, 1992), a similar analysis has been
performed using sunshine duration rather than radia-
tion data inputs, so that a new and alternative model
may be applied in calculating PAR values without
data on global irradiance. Once the input variables
have been determined for both input sets, we have
developed the two respective ANN models. For that,
the ANNs have been trained using meteorological and
radiometric measurements obtained from only one
radiometric station, i.e. Almeras radiometric station.
Data collected at other radiometric stations operating
in climatologically diverse regions, have been used
to test the proposed models. In addition, for a better
analysis of the statistical results, the performance of
the proposed models has been compared with the
original versions of two empirical models by Alados
et al. (1996).
2. Data and measurements
The present work is based on data sets collected
at six radiometric stations. Table 1 summarises their
geographical locations, the number of observations
and the recording period. Almeras station is lo-
cated on a seashore site on the Mediterranean coast
of south-eastern Spain. High frequency of cloudless
days, an average annual temperature of 17C, and
the persistence of high humidity regime characterise
the local climate. The remaining stations have been
chosen with the intention of best representing a widediversity of climates. Granadas station is located on
the outskirts of Granada (south-eastern Spain). Cool
winter and hot summers characterise its inland loca-
tion. Abiskos station belongs to and is administered
8/13/2019 2001 Lopez (Agri for Meteorol_107)
4/13
282 G. Lopez et al. / Agricultural and Forest Meteorology 107 (2001) 279291
by The Royal Swedish Academy of Sciences. It is sit-
uated about 200 km north of the arctic circle, on the
south shore of Lake Tornetrsk (Sweden). The aver-
age annual temperature is approximately 1.0C andthe annual precipitation varies from about 1000 to
400 mm. The stations at Bondville, Desert Rock and
Table Mountain are part of Surface Radiation Research
Branch (SRRB) of NOAAs Air Resources Labora-
tory, and they represent inland locations with different
degrees of continental climate.
The data sets contain measurements of global hori-
zontal photosynthetic active photon flux density,
which we refer to as PAR, global and diffuse solar
irradiances on a horizontal surface, temperature, and
relative humidity. In Abiskos database, PAR and
global irradiance are only recorded. PAR was mea-
sured by LI-COR silicon quantum sensors at all sta-
tions and expressed as E m2 s1. Kipp and Zonen
pyranometers were employed to measure the global
solar irradiance in Almeras, Granadas and Abiskosstations, whereas Eppley ventilated pyranometers
model PSP were employed at the stations in the USA.
At Almeras and Granadas station, diffuse irradi-
ance was measured by a Kipp and Zonen pyranometer
equipped with an Eppley shadow band model SBS.
Because of the shadow band screens the sensor from a
portion of the diffuse radiation coming in from the sky,
a correction has been made to the measurements fol-
lowing Batlles et al. (1995). The SRRB stations used
Eppley pyranometers mounted on Eppley automatic
solar trackers model SMT-3 equipped with shade
disks model SDK. The measurements of temperature
and relative humidity were registered by means ofstandard sensors exposed in a meteorological screen.
Data were recorded and averaged on different sam-
pling basis times (1, 3, 5 and 10 min). Next, hourly
averaged values were obtained for all variables. Due
to cosine response problems of radiometric sensors,
we have only used cases corresponding to solar zenith
angles less than 85.
3. Artificial neural network
3.1. Basic architecture
ANNs learn the relationship between the input and
output variables by studying previously recorded data.
Fig. 1. A simple three-layer ANN.
Our neural network correspond to a feedforward mul-
tilayered perceptron (MLP) (Rumelhart et al., 1986),
which is among the most widely used neural network
models that learn from examples. These neural net-
works perform a non-polynomial regression and were
found to be suitable for our task. In the model for the
MLP, there is an input layer, a hidden layer, and anoutput layer (Fig. 1). Every layer is formed from el-
emental units named neurons or nodes. The neurons
in the input layer only receive the input signals and
distribute them forward to the network. In the follow-
ing layers, each neuron receives a signal, which is a
weighted sum of the outputs of the nodes in the layer
below. To node i in a posterior layer, l, the total input
xis given by
x(l)i =
N(l1)
j=0
w(l)ij y
(l1)j (1)
whereNis the number of nodes, wijthe weight for theconnection between nodej and node i andyj the output
of node j. Each node i has a bias, represented by the
weightwi0from a node with a constant unit activation
(y0 1). The output of a node in a hidden or outputlayer is obtained through a nodal activation function,
which uses the node input as argument. The activation
function used for the hidden neurons is sigmoid from
0 to 1
y = 1
1 + ex (2)
whereas the linear function (y = x) has been used for
the output nodes.Whereas the number of input and output neurons is
determined by the respective number of independent
8/13/2019 2001 Lopez (Agri for Meteorol_107)
5/13
G. Lopez et al. / Agricultural and Forest Meteorology 107 (2001) 279291 283
and dependent variables involved in the current prob-
lem, the choice of the number of hidden neurons
is usually not as easy. In current applications, the
number of intermediate neurons is often decided in
a heuristic way. In this work, the optimal number of
intermediate neurons has been determined empiri-
cally as the minimum number of neurons for which
estimation performance on a test set is satisfying.
3.2. Network training
The network training is the procedure where the
unknowns, i.e. weights and bias terms, are adjusted
based on numerical training data. The training data
is a set of patterns consisting of input and corre-
sponding output values, so-called target values. The
training method used in this work is named as su-
pervised training. The patterns in the training set
are presented to the network one at a time, and fol-
lowing a random sequence for optimal learning. Foreach sample, we compare the outputs obtained by
the network with the desired outputs. After the entire
training pattern has been processed, the weights and
bias are updated. This updating is done in such a way
that a measure of the error in the networks results is
reduced.
The goal is to find a network that describes the
inputoutput relation represented by the training
patterns. The LevembergMarquardt method (Mar-
quardt, 1963; Finschi, 1996) has been used to adjust
the unknown network parameters in order to minimise
the sum of squares of residuals, E, calculated as the
differences between network outputs, oi , and targetoutputs, ti , where i = 1Np with Np the number ofpatterns . The problem can be formulated as
minw
{E= r Tr} (3)
where r = (ti oi ; i = 1, . . . , N p) is the residualvector, rT the transpose vector ofr , and the networkunknowns are collected into a vector, w, according to
w = (w0i=1,Ni;j=1,Nh , wk=1,Nh , wbiasl=0,Nh
) (4)
wherew 0ij are the weights from input to hidden layer,
wk the weights from hidden to output layer, wbiasl theoutput and hidden bias, Ni the number of the input
nodes, and Nh the number of the hidden nodes. At
each iteration, m, the optimisation method adjusts the
unknowns according to
w(m+1) = w(m) + (m) (5)
where the correction, , is obtained from
(JJJ(m)
JJJ(m)T
+ (m)
IIIid)(m)
= JJJ(m)
rrr(m)
(6)
In the above equation,JJJis the Jacobian matrix with
first derivates of the residuals with respect to the
unknowns,JJJT the transpose matrix ofJJJid and III the
identity matrix. The Marquardt parameter is au-
tomatically adjusted during the training (Marquardt,
1963). The method approaches GaussNewton if
0 and steepest descent with a small step lengthif . Analytical expressions are derived for cal-culation of the Jacobian (Williams and Zipser, 1989).
3.3. Training and testing data
The performance of a trained network must be
evaluated by testing it on a different data set than
the one on which it was trained. This is an important
task, since a multilayer net can approximate any con-
tinuous multivariate function to any desired degree of
accuracy, provided that sufficiently many hidden neu-
rons are available. Thus, rather than learning the basic
structure of the data, enabling it to generalise well, it
learns irrelevant details of the individual cases.
The training file was created by randomly selecting
10% of the whole data set of Almeras station. The
ANN model is tested at each location. The tested file
used at Almera corresponds to the remaining 90%
of the whole data set whereas at the other sites the
corresponding whole data set is used. It should be
noted how few training patterns are required to develop
the ANN model against traditional statistical methods.
The input and output values were linearly scaled to lie
in the range 01 using
xscaled= x xmin
xmax xmin(7)
The constantsxmaxandxminare equal to the maximum
and minimum recorded value for each variable x. This
approach reduces the training time by eliminating thepossibility of reaching the saturation regions of the
sigmoid transfer function during training.
8/13/2019 2001 Lopez (Agri for Meteorol_107)
6/13
284 G. Lopez et al. / Agricultural and Forest Meteorology 107 (2001) 279291
4. Description of the input variables used
In the literature, there are models that use different
input sets for estimating PAR (Gueymard, 1989a,b;
Alados et al., 1996). We have tried to collect the most
common parameters used by empirical models and
analyse their input relevance as they are included in
the ANN.
Temperature, T, relative humidity, h, dew point
temperature,Td, and precipitable water thickness, w ,
influence the water vapour absorption in 400700 nm
band and solar broadband radiation on hourly PAR
estimation. Hourly precipitable water thickness has
been estimated from dew point temperature using
Reitans equation, which has been proved to be valid
for instantaneous values (Wright et al., 1989):
w = exp(0.0756 + 0.0693 Td) (8)
Concerning radiometric parameters, global and dif-
fuse solar irradiances have been considered. Thedimensionless indices kt, k, and have also been
introduced to account for the different sky conditions.
The hourly clearness index is defined as the ratio be-
tween the hourly horizontal global irradiance, G, to
the hourly horizontal extraterrestrial irradiance, I0h,
(kt = G/I0h). Hourly diffuse fraction is defined asthe ratio of hourly diffuse irradiance, D, to the hourly
horizontal global irradiance, (k = D/G). The clear-ness of the sky and the brightness of the sky are given
by Perez et al. (1990)
={(I+ D)/D} + 1.0413z
1 + 1.0413z(9)
= Dmr
I0(10)
where z is the solar zenith angle, mr the relative
optical airmass given by Kasten (1965), Ithe direct
beam solar irradiance and I0 the hourly extraterres-
trial irradiance. Values of direct solar irradiance were
obtained from measurements of global irradiance,
diffuse irradiance and solar zenith angle by means of
their relationship:
I = G D
cos z
(11)
It is clear that the kt k and representationsexpress equivalent information. The advantage of the
first representation is simpler computation whereas the
second provides a better characterisation of the clouds
and atmospheric aerosol amount. Finally, hourly sun-
shine duration represents the sum of minutes each
hour that the direct radiation,I, is above a threshold of
120Wm2 according to the WMO (Michalsky, 1992).
Hourly relative sunshine values are defined as the ratio
of actual minutes of sunshine to minutes of possible
sunshine for the hour.
5. Development of the ANN models
As we noted in Section 3, the number of input
and output units depends on the independent and
dependent variables involved in the current problem.
In our case, the output layer has had a single unit
representing the estimated hourly PAR. For the input
layer, we have considered two cases: the first one
takes into account all meteorological and radiometricinformation, whereas in the second one, the sunshine
duration and meteorological information are used in-
stead. For both cases, some input variables are not
independent of each other, and they could thus be
excluded from the ANN input layer. However, no
information about which input combination is most
relevant to the estimation of PAR, and thus, an analy-
sis of the input relevance for determining the optimal
input configuration is necessary.
We have used the Bayesian method of automatic
relevance determination (ARD) (MacKay, 1994;
Neal, 1996) for multilayer perceptron networks to ob-
tain the relevant inputs for each case. In this method,a hyperparameter is associated with each input. The
hyperparameters for an input corresponds to an in-
verse variance of the weights on connections from
that input, and represents the relevance of that input to
the task of predicting the measured output. Thus, the
smallest hyperparameter will indicate the input which
accounts for the largest part of the variability and
hence is the most relevant input, and the hyperparam-
eter with the next increase in value indicates the next
most relevant input and so on. Hence, 12 and 7 input
units form the input layer, respectively. With two neu-
rons in the hidden layer for both cases, the ANN archi-
tectures developed for the ARD are fully determined.The ARD was performed using all of Almeras
database.
8/13/2019 2001 Lopez (Agri for Meteorol_107)
7/13
G. Lopez et al. / Agricultural and Forest Meteorology 107 (2001) 279291 285
Table 2
Results of the automatic relevance determination (ARD) method
(for definition of the symbols see Nomenclature)
Input Hyperparameter
Using radiometric
information
Using sunshine
information
Random 4639980 155907T 457 6
h 1500 8
Td 111 16
w 58 24
G 3
D 29
kt 18
k 31
19
33
cos z 19 2
S 1
Table 2 summarises the results of the ARD method.In the left-hand column are the input parameters. Two
sets of results, one for radiometric and meteorological
inputs and one for sunshine duration and meteoro-
logical inputs, are given. In both cases, the artificial
random signal presents the largest hyperparameter as
it is independent of the PAR. On the other hand, the
smallest hyperparameter obtained in the first case is
associated to the global irradiance. The inputs with
relevance similar to each other succeeding to global
irradiance are cos z,kt, and. Next,D,kand have
shown to be the following set of relevant inputs, and
lastly, the meteorological variables involving precip-
itable water and dew point temperature. It is also noted
Table 3
Statistical results obtained from various input configurations used in developing the ANN model for estimating hourly PAR from radiometric
and meteorological information (for definition of the symbols see Nomenclature)a
Inputs a (E m2 s1) b R2 RMSE (%) MBE (%)
G, cos z, , , Td , w 2.8 1.00 0.999 2.1 0.0
G, cos z, kt , k, Td, w 2.3 1.00 0.999 2.0 0.0
G, cos z, kt , , h , Td 3.7 1.00 0.998 2.3 0.0
G, cos z, , Td 2.5 1.00 0.999 2.0 0.0
G, cos z, Td 2.2 1.00 0.999 2.0 0.0
G, cos z, 2.8 1.00 0.999 2.1 0.0
G, cos z 2.6 1.00 0.999 2.1 0.0
G, kt 2.8 1.00 0.999 2.3 0.1G, Td 2.0 1.00 0.998 2.6 0.0
a Mean value of measured hourly PAR: 938 E m2 s1.
the higher relevance of the dew point temperature and
precipitable water against temperature and relative
humidity when radiometric measurements are consid-
ered. Anyway, these results show that meteorological
information accounting for water vapour absorption
can be excluded from the ANN, and the variability
of the PAR values could be explained using only ra-
diometric information. These results agree with those
obtained by Gueymard (1989a,b) which shows that
global PAR in cloudless skies is essentially dependent
on the solar zenith in most conditions. In the second
case, relative sunshine duration appears to be the most
relevant input in conjunction with cos z. Tempera-
ture and relative humidity become more relevant than
dew point temperature and precipitable water with
hyperparameter values slightly higher than the lowest
ones. Thus, the inclusion of these parameters in the
ANN cannot be deleted and must be considered. Once
the relevance of the inputs has been analysed for the
two cases following the ARD method, the optimumANN configurations are to be determined. In the first
case, when radiometric measurements are available,
various ANNs with different combinations of input
parameters different to each other and 15 hidden units
were trained and tested according to Section 3.
Table 3 shows the statistical results obtained with
some of these ANNs for estimating hourly PAR. The
accuracy of the ANN models were evaluated based on
the regression analysis of estimated versus measured
values, in terms of the intercept, a, and slope, b, of
the linear fit and the determination coefficient,R2. We
have also included the root mean square error (RMSE)
and the mean bias error (MBE). The RMSE is a
8/13/2019 2001 Lopez (Agri for Meteorol_107)
8/13
286 G. Lopez et al. / Agricultural and Forest Meteorology 107 (2001) 279291
measure of the variation of predicted values around the
measured values, while the MBE is an indication of
the average deviation of the predicted values from the
measured values. Both indicators have been expressed
as a percentage of the mean value of the measured
hourly PAR.
It is noted that all ANN model performances are
similar to each other with RMSE values that range
from 2.0 to 2.6% and R2 values around 0.999. None
of the cases exhibit marked deviations, all the slope
values are equal to 1.00 and the intercept values
are very similar to each other with values around
2.5 E m2 s1. These results show that a ANN based
model using only global irradiance and solar zenith
angle is suitable to estimate PAR and additional input
variables as those given in Table 3 do not provide an
improvement in the ANN model performance. Thus,
we have chosen the simplest ANN model, which in-
volves only global irradiance and cos z, as MODEL
1. Once the input layer is determined, several ex-periments were conducted to find the combination
of number of hidden nodes which gave the greatest
accuracy in predicting the validation data set. As the
number of hidden nodes increased, the R2 value of
the estimated versus measured hourly PAR increased
until the number of hidden units reached the value of
10. For values higher than 10, the number of hidden
units did not seem to improve the estimates.
In a similar way, the above procedure was applied
to develop the ANN model when only relative sun-
shine duration and meteorological measurements are
available. Table 4 displays the statistical results for
various models that use different ANN input config-urations. The best model performance corresponds to
Table 4
Statistical results obtained from various input configurations used in developing the ANN model for estimating hourly PAR from sunshine
duration and meteorological information (for definition of the symbols see Nomenclature)a
Inputs a (E m2 s1) b R2 RMSE (%) MBE (%)
S, cos z, T, h , Td , w 24.5 0.98 0.981 8.0 0.2
S, cos z, T, h 25.1 0.98 0.980 8.2 0.2
S, cos z, T 27.2 0.97 0.978 8.3 0.2
S, cos z, Td 32.6 0.97 0.979 8.3 0.5
S, cos z, w 30.4 0.97 0.973 8.8 0.0
S, Td 556.8 0.40 0.397 44.3 0.8
cos z, Td 138.3 0.86 0.855 21.7 0.8S, cos z 25.4 0.97 0.977 8.6 0.1
a Mean value of measured hourly PAR: 938 E m2 s1.
that input configuration that uses all variables and the
worse ones correspond to those models that exclude
either sunshine duration or cos z. These results agree
with those obtained by the ARD method. In those
cases involving sunshine duration and cos z, all sta-
tistical tests were only slightly different to each other.
These results encourage us to use the relative sun-
shine duration and cos zas the only input parameters
in the ANN model, since the inclusion of additional
meteorological information leads to a reduction in
RMSE of only around 0.6%, and the remaining statis-
tical tests present values which are very close to each
other. Lastly, 10 hidden units have also been found to
be the optimal number of hidden units. We will refer
to this second ANN model as MODEL 2.
6. Performance of the models and discussion
To further test the performances of the proposed
models, they are compared with the original versionsof two empirical models by Alados et al. (1996). These
two models have been chosen because they were also
developed using Almeras database and use various
combinations of input parameters. The first Alados
model, which we will refer to as Alados1, calculates
the hourly PAR, Qp, from global irradiance, , , Tdand z using the following equation:
Qp
G= 1.786 0.192 ln 0.202 ln + 0.005Td
+0.032 cos2z (12)
This parameterisation tries to explain the observeddependencies of the ratio between PAR to global
8/13/2019 2001 Lopez (Agri for Meteorol_107)
9/13
G. Lopez et al. / Agricultural and Forest Meteorology 107 (2001) 279291 287
Table 5
Statistical comparison between observed and estimated PAR by MODEL 1, MODEL 2, Alados1 and Alados2, for each individual data set
(for definition of the symbols see Nomenclature)
a (E m2 s1) b R2 RMSE (%) MBE (%)
Almera (938 E m2 s1)a
Alados1 1.9 1.01 0.998 2.7 0.9
Alados2 3.6 1.02 0.998 3.0 1.4
MODEL 1 2.6 1.00 0.999 2.0 0.0MODEL 2 25.6 0.97 0.977 8.5 0.1
Granadab (936 E m2 s1)a
Alados1 9.5 0.96 0.998 4.8 3.4
Alados2 7.0 0.99 0.998 2.7 0.7
MODEL 1 11.7 0.97 0.998 3.7 2.2
Desert Rock (970 E m2 s1)a
Alados1 33.8 0.99 0.998 5.1 4.4
Alados2 31.5 0.98 0.997 6.0 5.3
MODEL 1 19.9 0.98 0.998 4.5 3.7
MODEL 2 3.4 0.89 0.976 14.3 10.4
Bondville (912 E m2 s1)a
Alados1 13.6 0.99 0.997 5.5 3.1
Alados2 13.0 0.99 0.996 5.9 3.5
MODEL 1 7.3 0.98 0.997 5.3 3.2
MODEL 2 83.7 0.88 0.945 18.8 1.3
Table Mountain (806 E m2 s1)a
Alados1 15.2 1.03 0.995 4.8 0.6
Alados2 13.7 1.02 0.995 4.8 0.1
MODEL 1 2.6 1.01 0.996 4.5 1.0
MODEL 2 66.3 0.90 0.948 15.3 2.1
Abiskob (414 E m2 s1)a
Alados2 14.1 1.03 0.993 9.5 6.0
MODEL 1 22.1 0.99 0.991 9.2 4.0
a Mean value of the measured hourly PAR.b Sunshine duration is not available.
irradiance with these variables and to exclude the ne-
cessity of local and seasonal calibration of this ratio.
The second Alados model, which we will refer to
as Alados2, estimates the hourly PAR from global irra-
diance,G, the clearness index, kt, and the solar zenith
angle,z. This model reads as follows:
Qp
G= 1.832 0.191 ln kt + 0.099cos z (13)
A statistical summary of the overall performance of
the four models is indicated in Table 5. The statistical
tests have been the same ones that we used in the
previous section. MODEL 1 appears to present thebest global results. In fact, MODEL 1 shows the low-
est RMSE percentages for all stations, excepting for
Granadas database. Moreover, MODEL 1 provides
estimates with errors below the experimental ones, for
Almeras station. Nevertheless, the statistical results
for the models that include radiometric information
present no significant differences to each other. This
outcome shows that measurements of global solar ir-
radiance are sufficient to obtain accurate estimates of
the hourly PAR in all sky conditions. Moreover, the
ANN model, MODEL 1, capture in a more real and
robust way the spread of PAR versus global irradiance
values. In this sense, Fig. 2 displays the scatter plot of
measured PAR values versus global irradiance values.
Additionally, we have plotted MODEL 1 evaluated forfour different fixed values of cos z. It is noted how
the spread of the PAR versus global irradiance values
8/13/2019 2001 Lopez (Agri for Meteorol_107)
10/13
288 G. Lopez et al. / Agricultural and Forest Meteorology 107 (2001) 279291
Fig. 2. Measured hourly PAR versus hourly global irradiance
values. Solid lines correspond to MODEL 1 evaluated for four
different fixed values of cos z (for a better legibility of the figure,
MODEL 1 has been evaluated from initial values of measured
global irradiance equal to 470, 450 and 310W m2, for the values
of cos z equal to 0.950, 0.650 and 0.500, respectively).
is explained by MODEL 1. Furthermore, MODEL 1
describes the slight increase of the PAR proportion of
broadband radiation as sky conditions change from
clear to cloudy, as Papaioannou et al. (1993) reported.
It is observed how the line corresponding to PAR val-
ues ofcos zequal to 0.950 tends to the upper bound of
the measured ones as global irradiance decrease, e.g.,
the sky becomes cloudy. Addition of meteorological
variables as temperature, relative humidity, dew point
temperature or precipitable water, or other radiometric
parameters as diffuse solar irradiance, clearness index,
diffuse transmittance, clearness or brightness of the
sky, do not lead to an improvement in PAR estimation.Among the multiple regression models, the statis-
tical tests show that the model Alados1 presents the
best global performance in agreement to Alados et al.
(1996). It should be noted that the model Alados1 uses
and as input variables and thus direct or diffuse
irradiance measurements are needed. However, these
variables are not often recorded. Table 5 shows that
the model MODEL 1 reduces both RMSE and MBE
against the model Alados1 at all locations other than
Table Mountain. In addition, the zero intercept values,
a, of the model MODEL 1 exhibit a slight improve-
ment against those obtained by the model Alados1 at
the USA stations. It is also noted from Table 5 that thestatistical results concerning each model as they are
evaluated with data collected at locations different to
that where the models were developed show an equiv-
alent deterioration. One advantage using the ANN
based model, MODEL 1, is the lower amount of in-
put information, since it needs only global irradiance
measurements which are available at many sites.
On the other hand, the statistical results corre-
sponding to the second ANN based model, MODEL
2, present small deviations in Almeras and SRRB
databases, with the exception of Desert Rocks one
where the MBD reaches a value of about 10% . The
variance of the PAR values explained is around 96%
and the RMSD values are around 8.5 and 15.5% for
Almeras database and the SRRB databases, respec-
tively. Although these values are relatively higher than
the RMSD values corresponding to MODEL 1, this
model provides an acceptable hourly PAR estimation
if only hourly sunshine duration measurements are
available.
An analysis of the behaviour of the residual PAR
values, which are obtained as the difference betweenestimated and measured values, has also been carried
out. Fig. 3 shows the averaged residual PAR values
for MODEL 1 and Alados1 evaluated in Almera
versus (a) global irradiance, (b) cos z, and (c) dew
point temperature. These models have been selected
because they present better performances than the
other ones. It may be seen that PAR residuals corre-
sponding to MODEL 1 do not exhibit any deviation
against global irradiance and cos z and a very slight
(0.5%) and quasi constant deviation against thedew point temperature. Dependences of PAR residual
differences against the other meteorological and ra-
diometric variables are also practically nil. In contrastthe results using the empirical model Alados1 exhibit
significant dependence of the PAR residuals on these
variables. So these results demonstrate the success of
using ANNs methodology.
Lastly, an analysis of the residual differences, sim-
ilar to the above one, using MODEL 1 and Alados1
evaluated for the data from Table Mountain has been
performed. From Fig. 4 it is firstly noted that PAR
values estimated by the ANN based model, MODEL
1, present a lower dependence on global irradiance,
cos z, and dew point temperature than those esti-
mated by Alados1, and second, the averaged residual
differences for both models show trends similar toeach one for each corresponding variable. Analysis
of the residual differences using other meteorological
8/13/2019 2001 Lopez (Agri for Meteorol_107)
11/13
G. Lopez et al. / Agricultural and Forest Meteorology 107 (2001) 279291 289
Fig. 3. Averaged residuals from estimates of the hourly PAR by
MODEL 1 and Alados1 versus: (a) G, (b) cos z and (c) Td . Avera-
ged residuals have been set as a percentage of the mean measured
PAR value. The error bars denote one standard deviation from the
mean values of the averaged residuals. Almeras data are used.
and radiometric variables and using other databases
have shown similar behaviour. In order to developa non-local model, PAR dependencies on global
radiation and solar zenith angle for locations with
Fig. 4. Averaged residuals from estimates of the hourly PAR by
MODEL 1 and Alados1 versus: (a) G, (b) cos z and (c) Td . Avera-
ged residuals have been set as a percentage of the mean measured
PAR value. The error bars denote one standard deviation from the
mean values of the averaged residuals. Table Mountains data are
used.
meteorological conditions different to each other
should be analysed. However, a thorough descriptionof this subject is outside the scope of this paper; but
will be presented in a future paper.
8/13/2019 2001 Lopez (Agri for Meteorol_107)
12/13
290 G. Lopez et al. / Agricultural and Forest Meteorology 107 (2001) 279291
7. Conclusions
In this work, a new method for estimating hourly
global PAR has been developed and tested. This
method is based on ANNs, particularly, on a multilayer
feedforward perceptron trained with the Levenberg
Marquardt algorithm. From this procedure, two mod-
els have been derived successfully. The first one in-
volves only global irradiance and solar zenith angle as
input variables, whereas the second one uses sunshine
duration and solar zenith angle. The models have been
extensively validated using data representative from
various climatic environments. These range from mar-
itime to high altitude deserts. For the model that uses
global irradiance and solar zenith angle as inputs, a
noticeable performance improvement is found over
multiple regression models that use a larger number
of input variables, as direct or diffuse irradiance, dew
point temperature, which are not often measured.
This result also demonstrates the viability of neu-ral network methods to solve such problems when
compared with existing methods that accomplish the
same task. The model involving relative sunshine du-
ration and solar zenith angle as input parameters also
produces acceptable results considering the limited
input information. This model provides an alternative
way to estimate hourly global PAR at many locations
where radiometric measurements are not available and
where PAR cannot be accurately calculated. It should
also be noted the relative small data sample needed
to train the ANN and to obtain a successful model.
Our study has revealed that the inclusion of other
radiometric input variables to the ANN models, suchas diffuse irradiance, clearness index, or Perezs pa-
rameters do not lead to more accurate estimations of
hourly global PAR. Similarly, meteorological infor-
mation such as temperature, relative humidity, dew
point temperature or precipitable water have very
little effect on the accuracy of PAR estimation.
Acknowledgements
This work was supported through the collaboration
convene between the Plataforma Solar de Almera
which belongs to the Centro de InvestigacionesEnergticas y Medioambientales (CIEMAT) and the
University of Almera. The authors are grateful to
SRRBs staff and The Royal Swedish Academy of
Sciences (particularly to Sir Martin Tjus) for the
facilities offered for providing the SRRB and Abisko
Research Station databases and the technical speci-
fication of the sensors, respectively. The authors are
indebted to Dr. Arahal for his helpful introductory
comments on ANNs. Lastly, the authors thank the
regional Editor Dr. J.B. Stewart, Prof. Juhan Ross and
an anonymous referee.
References
Alados, I., Foyo-Moreno, I., Alados-Arboledas, L., 1996. Photo-
synthetically active radiation: measurements and modelling.
Agric. For. Meteorol. 78, 121131.
Al-Shooshan, A.A., 1997. Estimation of photosynthetically active
radiation under an arid climate. J. Agric. Eng. Res. 66, 913.
Batlles, F.J., Olmo, F.J., Alados-Arboledas, L., 1995. On shadow-
band correction methods for diffuse irradiance measurements.
Solar Energy 54, 105114.
Bolte, J.P., 1989. Application of neural networks in agriculture.ASAE Paper No. 89-7591. American Society of Agricultural
Engineers, St. Joseph, MI.
Britton, C.M., Dodd, J.D., 1976. Relationships of photosyn-
thetically active radiation and shortwave irradiance. Agric.
Meteorol. 17, 17.
Finschi, L., 1996. An implementation of the LevenbergMarquardt
algorithm. Internal Report. Institut fuer Operations Research,
Zuerich.
Gueymard, C., 1989a. An atmospheric transmittance model for the
calculation of the clear sky beam, diffuse and global photosyn-
thetically active radiation. Agric. For. Meteorol. 45, 215229.
Gueymard, C., 1989b. A two-band model for the calculation of
clear sky solar irradiance, illuminance, and photosynthetically
active radiation at the earths surface. Solar Energy 43, 253
265.Hammerstrom, D., 1993. Working with neural network. IEEE
Spectrum 30 (7), 4653.
Hanan, N.P., Bgu, A., 1995. A method to estimate instantaneous
and daily intercepted photosynthetically active radiation using
a hemispherical sensor. Agric. For. Meteorol. 74, 155168.
Howell, T.A., Meek, D.W., Hatfield, J.L., 1983. Relationship of
photosynthetically active radiation in the San Joaquin Valley.
Agric. Meteorol. 28, 157175.
Kasten, F., 1965. A new table and approximation formula for
relative optical air mass. Arch. Meteorol. Geophys. Bioklimatol.
B 14, 206223.
Kohonen, T., 1984. Self-organization and Associative Memory.
Springer, Berlin.
Li, Z., Moreau, L., Cihlar, J., 1997. Estimation of photosynthe-
tically active radiation absorbed at the surface. J. Geophys. Res.
102, 2971729727.
Lpez, G., Martnez, M., Rubio, M.A., Tovar, J., Barbero, J.,
Batlles, F.J., 2000a. Estimation of the hourly diffuse fraction
8/13/2019 2001 Lopez (Agri for Meteorol_107)
13/13
G. Lopez et al. / Agricultural and Forest Meteorology 107 (2001) 279291 291
using a neural network based model. In: Proceedings of
the Second Asamblea Hispano-Portuguesa de Geodesia y
Geofsica, Lagos, Portugal, pp. 425426 (in Spanish, with
English abstract).
Lpez, G., Martnez, M., Rubio, Batlles, F.J., 2000b. Estimacin
de la radiacin global solar diaria mediante un modelo de red
neural. In: Proceedings of the IX Congreso Ibrico de Energ a
Solar, III Jornadas Tcnicas sobre Biomasa, Crdoba (Spain),
in press (in Spanish).Lupo, J.C., 1989. Defense applications of neural networks. IEEE
Commun. Mag. 27 (11), 8288.
MacKay, D.J.C., 1994. Bayesian non-linear modeling for the
energy prediction competition. ASHRAE Trans. 100, 1053
1062.
Marquardt, D.W., 1963. An algorithm for least-squares estimation
of nonlinear parameters. J. SIAM 11, 431441.
Michalsky, J.J., 1992. Comparison of a national weather
service foster sunshine recorder and the world meteorological
organisation standard for sunshine duration. Solar Energy 48,
133141.
Moon, P., 1940. Proposed standard solar radiation curves for
engineering use. J. Frankling Ins. 230, 538618.
Muttiah, R.S., Engel, B.A., 1991. Neural network methodology in
agriculture and natural resources. ASAE Paper No. 91-7018.American Society of Agricultural Engineers, St. Joseph, MI.
Neal, R.M., 1996. Bayesian Learning for Neural Networks, Lecture
Notes in Statistic, Vol. 118, Springer, New York.
Olseth, J.A., Skartveit, A., 1993. Luminous efficacy models and
their application for calculation of photosynthetically active
radiation. Solar Energy 52, 391399.
Papaioannou, G., Papanikolaou, N., Retails, D., 1993.
Relationships of photosynthetically radiation and shortwave
irradiance. Theor. Appl. Climatol. 48, 2327.
Perez, R., Ineichen, P., Seals, R., Michalsky, J.J., Stewart, R., 1990.
Modelling daylight availability and irradiance components from
direct and global irradiance. Solar Energy 44, 271289.Rumelhart, D.E., Hinton, G., Williams, R., 1986. Learning
internal representations by error propagation. In: Rumelhart, D.,
McClelland, J.L. (Eds.), Parallel Distributed Processings, Vol.
1, MIT Press, Cambridge, MA, pp. 318362.
Szeicz, G., 1974. Solar radiation for plant growth. J. Appl. Ecol.
11, 617636.
Thai, C.N., Shewfelt, R.L., 1991. Modeling sensory color quality
of tomato and peach: neural networks and statistical regression.
Trans. ASAE 34, 950955.
Williams, R.J., Zipser, D., 1989. A learning algorithm for
continually running fully recurrent neural networks. Neural
Comput. 1, 270280.
Wright, J., Perez, R., Michalsky, J.J., 1989. Luminous efficacy
of direct irradiance: variations with insolation and moisture
conditions. Solar Energy 42, 387394.Zhuang, X., Engel, B.A., 1990. Neural networks for applications
in agriculture. ASAE Paper No. 90-7024. American Society of
Agricultural Engineers, St. Joseph, MI.