Modeling Reference Evapotranspiration in a Semiarid Area

Modeling Reference Evapotranspiration in aSemiarid Area Using Data-driven Techniques: ACase Study in Lower Cheliff plain, AlgeriaMohammed ACHITE ( [email protected] )

Hassiba Benbouali University of Chlef: Universite Hassiba Benbouali de ChlefMuhammad Taghi Sattari

: University of Tabriz Faculty of AgricultureAbderrezak Kamel Toubal

Hassiba Benbouali University of Chlef: Universite Hassiba Benbouali de ChlefAndrzej Wałęga

Krakow University Faculty in Tychy: Krakowska Akademia Wydzial Zamiejscowy w TychachNir Krakauer

City College of the City University of New York: The City College of New YorkParveen Sihag

Shoolini UniversityQuoc Bao Pham

Ton Duc Thang University

Research Article

Keywords: Reference Evapotranspiration, FAO-56 Penman-Monteith, Data Driven Models, Lower Cheliff,Algeria

Posted Date: August 10th, 2021

DOI: https://doi.org/10.21203/rs.3.rs-773420/v1

License: This work is licensed under a Creative Commons Attribution 4.0 International License. Read Full License

https://doi.org/10.21203/rs.3.rs-773420/v1

mailto:[email protected]

https://doi.org/10.21203/rs.3.rs-773420/v1

https://creativecommons.org/licenses/by/4.0/

1

Modeling reference evapotranspiration in a semiarid area using data-

driven techniques: A case study in Lower Cheliff plain, Algeria

Mohammed Achite1*, Mohammad Taghi Sattari2, Abderrezak Kamel Toubal1, Andrzej Wałęga3, Nir

Krakauer4, Parveen Sihag5 and Quoc Bao Pham6,7

1Faculty of Nature and Life Sciences, Laboratory of Water & Environment, University Hassiba Benbouali of Chlef, Chlef, P.B 78C, Ouled Fares, Chlef 02180, Algeria; [email protected] 2Department of Water Engineering, Faculty of Agriculture, University of Tabriz, Tabriz, Iran; [email protected] 3Department of Sanitary Engineering and Water Management, University of Agriculture in Krakow, Mickiewicza 24/28 Street, 30-059 Krakow, Poland; [email protected] 4Department of Civil Engineering and NOAA-CREST, the City College of New York, New York 10031, NY, USA; [email protected] 5Department of Civil Engineering, Shoolini University, Solan, Himachal Pradesh, India; [email protected] Environmental Quality, Atmospheric Science and Climate Change Research Group, Ton Duc Thang University, 6

[email protected]; Ho Chi Minh City, Vietnam 7Faculty of Environment and Labour Safety, Ton Duc Thang University, Ho Chi Minh City, Vietnam

*Corresponding author: [email protected]

Abstract: Evapotranspiration (ET) is an important part of the hydrologic cycle, especially

when it comes to irrigated agriculture. For the estimation of reference evapotranspiration

(ET0), direct methods either pose difficulties or call for many inputs that may not always be

available from weather stations. This study compares Feed Forward Neural Network (FFNN),

Radial Basis Function Neural Network (RBFNN). and Gene Expression Programming (GEP)

approachs for the estimation of daily ET0 in a weather station in Lower Cheliff plain

(northwest Algeria), over a 6-year period (2006–2011). Firstly, measured air temperature,

relative humidity, wind speed, solar radiation and global radiation was used to calculate ET0

using FAO-56 Penman-Monteith equation as the reference. Then, the calculated ET0 using

FAO-56 Penman-Monteith was considered as output for data driven models, while the

measured meteorological data were considered as input of the models. The coefficient of

determination (𝑅2), root mean square error (RMSE) and Nash Sutcliffe efficiency coefficient

(EF) were used to evaluate the developed models. The results of the developed models were

https://www.researchgate.net/profile/Nir_Krakauer

https://www.researchgate.net/profile/Nir_Krakauer








2

compared with the Penman-Monteith evapotranspiration using these performance criteria. The

FFNN model proved to yield the best performance compared to all the developed data-driven

models, while the RBF-NN and GEP models also demonstrated potential for good

performance.

Key words: Reference Evapotranspiration, FAO-56 Penman-Monteith, Data Driven Models,

Lower Cheliff, Algeria

INTRODUCTION

Given increases in the global population, adequately and reliably meeting people’s nutritional

needs in arid and semi-arid climates is one of the most immediate problems of management in

water and agriculture sectors. Since water resources are very limited in climates with varying

degrees of aridity, there is an inevitable need to use existing water resources as optimally as

possible (Nairizi 2017). Evapotranspiration (ET) is one of the most important component of

the hydrology cycle and influences runoff, soil water storage, groundwater recharge,

biodiversity, and the global climate system (Amatya et al. 2018, Sun et al. 2011).

Determining the amount of ET is necessary in water resource management, irrigation

planning, watershed management, and the design of drainage systems.

To calculate the amount of ET for an agricultural system, the reference evapotranspiration

(ET0) is calculated first. However, estimating ET0 is known to be very complex. ET0 is either

measured directly (e.g. by lysimeter or pan setups) or complex physics-based experimentally

validated equations are used. It is clear that direct measurements are very costly and time

consuming. Physics-based equations (e.g.,Thornthwaite 1948; Blaney and Criddle 1950;

Makkink 1957; Turc 1961; Priestley and Taylor 1972; Hargreaves and Samani 1985),

including the FAO-56 Penman Monteith (Allen et al. 1998) are complex in that they involve

multiple parameters which may not all be known from local observations. Nevertheless, the

3

FAO-56 Penman Monteith method has been accepted as a standard and used by scientists in

different climates. A high correlation is observed between the ET0 values obtained from FAO-

56 Penman Monteith method and direct measurements even in different climatic conditions.

Therefore, scientists have considered the values computed using the FAO-56 Penman

Monteith method as the desired output of data-based artificial intelligence methods and

different combinations of meteorological variables as inputs for such methods for accurately

estimating ET0 (Kumar et al. 2008; Eslamian et al. 2012; Mohawesh 2011; Citakoglu et al.

2014; Aparecido et al. 2017; Sattari et al. 2020).

The development of computing, software, informatics, and networking has facilitated

measurement and computational estimation of meteorological variables to a great extent. As a

result of these developments, data-based models are frequently used in modeling stochastic

and complex nonlinear dynamics in water resources engineering (Sattari et al. 2017;

Rouzegari et al. 2019; Shabani et al. 2020, Apaydın et al. 2020). For example, de Oliveira

Ferreira Silva et al. (2019) presented the R package "agriwater" for the spatial modelling of

actual evapotranspiration and radiation balance. Thorp et al. (2019) developed a methodology

for unbiased evaluation and comparison of three ET algorithms in the Cotton2K

agroecosystem model. Guven et al. (2008) successfully estimated he daily amount of ET0 in

California, USA, with a genetic programming. Rahimikhoob (2010) predicted ET0 values

with artificial neural networks using temperature and relative humidity parameters in an eight-

station region of Iran with a subtropical climate. The results obtained from the ANN method

were compated with the Hargreaves equation, and it was found that the ANN method was

successful in ET0 estimation. Ozkan et al. (2011) successfully estimated daily ET0 amounts

using artificial neural networks and bee colony hybrid method using the meteorological data

of two stations in California, USA. Cobaner (2013) estimated ET0 amounts in the USA using

wavelet regression (WR) and class A pan evaporation data. WR model results were found to

4

give better results than FAO-56 Penman-Monteith equation. Ladlani et al. (2014) applied

Adaptive Neuro-Fuzzy Inference System (ANFIS) and Multiple Linear Regression (MLR)

models for daily ET0 estimation in the north of Algeria. According to the results of the study,

ANFIS yielded better results. Wen et al. (2015) calculated daily ET0 amounts using the

support vector machine (SVM) method in a region of China that was extremely arid. They

utilized limited meteorological variables as model input. It was observed that modeling is

sufficient in estimating daily ET0 based on maximum and minimum temperature. Gocić et al.

(2015) used genetic programming (GP), support vector machine-firefly optimization

algorithm (SVM-FFA), artificial neural network (ANN) and support vector machine-wavelet

(SVM-Wavelet) models for ET0 prediction in Serbia. This particular study took FAO-56

Penman-Monteith equation to be the basic method. The results pointed to SVM-Wavelet

being the best performing methodology for the estimation of ET0 under the given conditions.

Petković et al. (2016) estimated the amount of ET0 in Serbia between 1980 and 2010 using

particle swarm optimization, radial basic function network (RBFN-PSO), and back

propagation radial basic function network (RBFN-BP) methods. Pandey et al. (2017)

estimated daily ET0 by methods like ANN, support vector regression (SVR) and nonlinear

regression (NR). In this study, limited climatic parameters were used as model input. Daily

ET0 values calculated from FAO56PM method were compared to the model output. The

results pointed to the acceptability of the ANN model estimations. Fan et al. (2018) estimated

daily ET0 amounts using SVM, extreme learning machine (ELM) models, and four tree-based

ensemble methods in China’s different climatic conditions. The results pointed to the fact that

tree-based ensemble methods can yield appropriate results in different climates. Wu et al.

(2019) used cross-station and synthetic climate data to estimate the amount of ET0. They also

found that machine learning methods could perform successfully in the prediction process.

5

In this study, the aim was to investigate the potential of Feed Forward Neural Network

(FFNN), Radial Basis Function Neural Networks (RBFNN) and the Gene Expression

Programming (GEP) models for the estimation of daily ET0 values using different

combinations of climatic variables in a semi-arid farmland area, namely Lower Cheliff plain

in northwest Algeria. This research made use of the well-known FAO-56 (PM56) equation as

the basic method. In this article, the role of climatic parameters in ET0 estimation in this semi-

arid region was also determined.

MATERIALS AND METHODS

Study area and meteorological data acquisition

The region in focus for this study was Lower Cheliff Plain in the northwest of Algeria (Figure

1), lying between latitudes 34°03′12′′ and 36°05′57′′ N and between longitudes 0°40 and

01°06′08′′ E, and occupying an area of 40,000 ha (Douaoui et al. 2006). This area is classified

as semi-arid. Average annual rainfall ranges from 320 mm to 250 mm. Temperature reach its

maximum in July and August and its minimum in January. The average annual temperature

along the Lower Cheliff plain ranges from 19.5 °C in the north to 25.3 °C in the south.

The historical data for this study was provided by the Hmadna station (SYNMET Automatic

Station), located at latitude 35°55’31” N and longitude 00°45’04” E. The instrument used for

measuring ET0 at the automatic meteorological station of Hmadna is shown in Figure 2. The

units and the measuring range of sensors are listed in Table 1.

2.2. Description of data

The meteorological data included daily observations of maximum, minimum and mean air

temperatures (Tmax, Tmin and Tmean), daily mean relative humidity (RH), wind speed (Ws),

sunshine duration (Sd) and global radiation (GR). The days with data that proved to be

inadequate were excluded from the patterns. The statistical parameters pertaining to the daily

6

climatic data are given in Table 2, in which the Xmean, Xmax, Xmin, Sx and Cv stand for the

mean, maximum, minimum, standard deviation and coefficient of variation , respectively.

Evapotranspiration estimation method

The FAO Penman–Monteith method to calculate ET0 can be expressed following Allen et al.

(1998) as:

𝐸𝑇0 = 0.408𝛥(𝑅𝑛 − 𝐺) + 𝛾 900𝑇𝑚𝑒𝑎𝑛 + 273𝑈2 (1)

where ET0 is the reference crop evapotranspiration (mm day-1), Rn is the net radiation (MJ m-2

day-1), G is the soil heat flux (MJ m-2day-1), c is the psychrometric constant (kPa °C-1), es is

the pressure of saturation vapor (kPa), ea is the pressure of the actual vapor (kPa), D is the

slope of the curve for saturation vapor pressure–temperature (kPa.C-1), Ta is the average daily

air temperature (°C), and U2 is the mean daily wind speed at 2 m (m s-1). All the data needed

in the calculation of ET0 were computed on the basis of the procedures outlined in Chapter 3

of FAO-56 (Allen et al. 1998).

Artificial neural networks

Artificial neural networks (ANNs) are nonlinear mathematical models based on ideas about

the behavior of biological neural networks. An ANN consists of with a variety of layers of

interconnected nodes or neurons. Each neuron gets a linearly transformed combination of the

previous neuron’s outputs (∑wijxj), or (for the first layer) of the network inputs and returns a

nonlinear transformation of this quantity.

The weights (wij) are the parameters added to each source defining this linear combination

and typically also include an intercept term called the activation threshold (Shiri et al. 2012).

A nonlinear activation function is then applied to the linear output combination (f (∑wijxj)).

This activation function can be, for example, a sigmoid function, which constrains each

7

neuron's output values to between two asymptotes. Once the activation function is applied,

each neuron’s output feeds into the outputs of the next layer. The most frequently used

architecture for an ANN consists of an input layer in which the data is introduced into the

ANN, a hidden layer(s) in which the data undergoes processing, and the output layer in which

the effects of the input generate a predicted output value(s) (Shiri et al. 2012).

Multi-Layer Perceptron (MLP)

The literature contains many kinds of neural networks that have been put to many uses. The

Multilayer Perceptron (MLP) is a commonly used ANN configuration utilized regularly in the

hydrological modelling field (Leahy et al. 2008) (Figure 3). This study assesses the usefulness

of neural MLP networks for the estimation of EP. The MLP is the most frequently used and

simplest neural network architecture (Haciismailoglu et al. 2009).

Radial Basis Function (RBF)

Another architecture used commonly in ANN is the Radial Basis Function (RBF). Multilayer

and feed-forward RBF is often used for multi-dimensional spatial interpolation. The word

"feed-forward" means the neurons in a layered neural network are arranged in layers (Lin et

al. 2004). The underlying architecture of a neural network with three layers is presented in

Figure 4. The RBF network comprises three layers, i.e. hidden, input, and output. The

activation function of each RBF neuron has the form of an RBF, generating a response only if

the inputs are close to some central value determined for that particular neuron.

Gene Expression Programming (GEP)

While ANNs are compicated models that typically do not capture the physical relationships

between different process components in an understandable way, GEP models have the

8

capacity to express the relationship between dependent and independent variables explicitly

(Wang et al. 2015).

The procedure for modelling daily evapotranspiration (considered to be the dependent

variable) on the basis of weather variables (considered as the independent variables) involves

the following: selecting the fitness function; selecting terminals T and set of functions F for

creating chromosomes; selecting chromosome architecture; and selecting the link function

and genetic operators (Figure 5) (Shiri et al. 2012).

Evaluation criteria

The performance of the models utilized in this study were evaluated by means of standard

criteria for statistical performance evaluation. The statistical measures taken into account

were coefficient of determination (𝑅2), root mean square error (RMSE) and Nash Sutcliffe

efficiency coefficient (EF) (Moriasi et al. 2007). The calculation of the three criteria was done

according to the following equations:

𝑅2 = 1 − ∑ (𝐸𝑇𝑖(𝑜𝑏𝑠𝑒𝑟𝑣𝑒𝑑) − 𝐸𝑇𝑖(𝑚𝑜𝑑𝑒𝑙))𝑁𝑖=1∑ (𝐸𝑇𝑖(𝑜𝑏𝑠𝑒𝑟𝑣𝑒𝑑) − 𝐸𝑇𝑚𝑒𝑎𝑛)𝑁𝑖=1 (2)

𝑅𝑀𝑆𝐸 = √1𝑁∑(𝐸𝑇𝑖(𝑜𝑏𝑠𝑒𝑟𝑣𝑒𝑑) − 𝐸𝑇𝑖(𝑚𝑜𝑑𝑒𝑙))𝑁𝑖=1

(3)

𝐸𝐹 = 1 − ∑ (𝐸𝑇𝑖(𝑜𝑏𝑠𝑒𝑟𝑣𝑒𝑑) − 𝐸𝑇𝑖(𝑚𝑜𝑑𝑒𝑙))2𝑁𝑖=1∑ (𝐸𝑇𝑖(𝑜𝑏𝑠𝑒𝑟𝑣𝑒𝑑) − 𝐸𝑇𝑚𝑒𝑎𝑛)2𝑁𝑖=1 (4)

where N is the number of observed ET data, ETi(observed) and ETi(model) are observed and model

estimations of ET, respectively, and ETmean is the mean of observed ET.

9

RESULTS AND DISCUSSION

In this study, firstly, ET0 values were computed by Penman-Monteith method using climatic

data. Then the following equation was used to normalize the input (meteorological data) and

output (calculated ET0 by Penman-Monteith):

𝑋𝑛 = 2. 𝑋𝑜 − 𝑋𝑚𝑖𝑛𝑋𝑚𝑎𝑥 − 𝑋𝑚𝑖𝑛 − 1 (5)

where, Xn and XO stand for the normalized and original data, while Xmin and Xmax represent

the minimum and maximum values in the original data. Approximately 70% of the available

data period (from around 2006 to 2010) was selected for the training phase; the remaining

30% belonged to the year 2011 and was used for testing process. MATLAB was used for the

modeling process.

Application of MLP

In this study, FFNN algorithm was used with a single hidden layer. More details about the

parameters used for the FFNN-based model with one hidden layer are listed in Table 3. With

the input data playing a considerable role in model development, several input combinations

were used for model development. The performances of all MLP-based input combinations

are listed in Table 4 for training and testing stages. MLP-based model development is a trial

and error process. In this study, tangent sigmoid transfer function was used in the hidden layer

and linear transfer function was used for target. To achieve ideal performance with MLP

models, the number of neurons in the hidden layer has to be optimized. The results in Table 4

suggest that Model FFNN2 including Tmax, Tmean, (Tmax-Tmin), RH, I, Ws and GR performed

better than other FFNN-based input combination models with R2 values as 0.9903, 0.9921,

RMSE values as 0.2332, 0.2342 and E values as 0.9902, 0.9902 for both training and testing

stages, respectively. Nineteen neurons were used in the hidden layer to achieve this ideal

performance. The performance and agreement plot among actual and predicted values of

FFNN2 model for both training and testing stage are mapped out in Figure 6 which shows that

10

max values lie very close to the line of 450 and follow the same pattern as the actual values in

both training and testing stages. If all the values lie on the line of 450 and follow the exact

same path, the model is ideal and predicts values similar to actual ones.

Performance evaluation results suggest than FFN2 model performed better than other input

combination based models. As to comparing various input combination based models with

one another, the results in Table 4 indicate that several other models are comparable in

performance to the best model (FFNN2), while having a lower number of required input

meteorolgical variables. Overall, going with the assessment in Table 4, the FFNN11 model

(Tmean, RH, Ws and GR) is suitable for predicting ET with R2 values as 0.9875, 0.9892,

RMSE values as 0.2656, 0.2623 and E values as 0.9873, 0.9877 for both training and testing

stages, respectively. The same number of neurons (19) is used in the single hidden layer for

achieving this performance similar to the FFNN2 model. The performance and agreement plot

among actual and predicted values of the FFNN11 model for both training and testing stage is

shown in Figure 7 which points to the fact that max values lie very close to the line of perfect

agreement and follow the same pattern as the actual values in both training and testing stages.

Application of RBF

For the RBF method as well, several input combinations were used for model development.

The performance of all input combination based RBF-NN models are listed in Table 5 for

training and testing stages. RBF-NN based model development is a trial and error process

similar to FFNN model development. In this study, the RBF models had a single hidden layer.

To achieve ideal performance with RBF-NN models, the value of the spread must be found

through a trial and error process. The results of Table 5 suggest that the RBF-NN5 model

including Tmin, Tmax, Tmean, RH, I, Ws and GR performs better than other input combination

RBF-NN based models with R2 values as 0.9907, 0.9911, RMSE values as 0.2270, 0.2374

11

and E values as 0.9907, 0.9899 for both training and testing stages, respectively. The

performance and agreement plot among actual and predicted values of RBF-NN5 model for

both training and testing stages are shown in Figure 8 which shows that max values lie very

close to the line of 450 and follow the same pattern as the actual values in both training and

testing stages.

The performance evaluation results suggest than the RBF-NN5 model performs better than

other input combination based models. On inter comparison among various input combination

based models, the results in Table 5 indicate that the performance of several other models was

comparable to the best model (RBF-NN5) and involved a lower number of inputs. Overall,

the assessment mapped out on Table 5 shows that the RBF-NN11 model (Tmean, RH, Ws and

GR) is suitable for predicting the ET with R2 values as 0.9886, 0.9892, RMSE values as

0.2514, 0.2551 and E values as 0.9886, 0.9884 for both training and testing stages,

respectively. A lower rate of spread (Table 5) was used in the development of this model than

in the case of the RBF-NN5 model. The performance and agreement plot among actual and

predicted values of the RBF-NN11 model for both training and testing stages are shown in

Figure 9 which shows that max values lie very close to the line of perfect agreement and

follow the same pattern as the actual values in both training and testing stages.

Application of GEP

The details of parameters used in the GEP model are listed in Table 6. The performance of all

input combination based GEP models are listed in Table 7 for training and testing stages. GEP

based model development is also a trial and error process similar to the model development

typical of FFNN and RBF-NN based models. For the performance of GEP models under

different input combinations, for the training phase, the R2 ranged between 0.6973 – 0.9664,

RMSE ranged 0.4830–1.3112 mm day−1 and EF ranged 0.6895- 0.9579. So, for the test phase,

12

the R2 ranged between 0.8057– 0.9775, RMSE ranged 0.3701–1.1224 mm day−1 and E ranged

0.7744 - 0.9755 (Table 7). It is clear that the presence or absence of critical meteorological

variables in the input combinations significantly affected GEP model performance. The

results of Table 7 suggest that the GEP11 model including Tmean, RH, Ws and GR

parameters in the input combination performed better than other input combination and

GEP based models with R2 values as 0.9606, 0.9775, RMSE values as 0.4830, 0.3701 and E

values as 0.9579, 0.9755 for the training and testing stages, respectively. The performance

and agreement plot among actual and predicted values of the GEP11 model for both training

and testing stages are shown in Figure 10 which indicates that max values lie very close to the

line of 450 and follow the same path as the actual values in both training and testing stages.

Table 7 concludes that the GEP11 model is the best performing model with optimum input

combinations.

Inter comparison among best and optimum input combination based models

Table 8 shows that the FFNN2 based model works better than the RBF-NN and GEP based

models. Figure 11 indicates that predicted values using FFNN2-based model lie closer to the

line of perfect agreement than the values predicted by the RBF-NN and GEP based models.

The overall performance of the FFNN2 based model is reliable and suitable for the prediction

of ET0. So Tmax, Tmean, (Tmax-Tmin), RH, I, Ws and the GR input combination based FFNN

model could be used for the prediction of ET0. However, the results mapped out in Table 9 of

single factor ANOVA suggest that there is no significant difference among observed and

predicted values using FFNN, RBF-NN and GEP best combination based models.

Figure 12 displays box plots for prediction errors for the best input combination based models

using the test period. The values of descriptive statistic of prediction errors for the best input

combinations are listed in Table 10. According to Table 10 and Figure 11, the FFNN2 model

13

followed the corresponding observed values with lower minimum error (-0.8840), lower

maximum error (1.4199), and the width of 1st quartile is less than other best input

combination based models.

The Taylor diagram of the observed and predicted ETo by different best input combination

based models over the test period is depicted by Figure 13. It is clear that the representative

points of all the applied models have nearly the same position. The FFNN2 model is located

nearest to the observed point with lower value of RMSE and SD and higher value of

coefficient of correlation, which picks out this model as the superior model.

Table 11 proposes that the RBF-NN11-based model works better than FFNN and GEP based

models. Figure 14 indicates that predicted values using the RBF-NN11-based model lie closer

to the line of perfect agreement than the values predicted by the FFNN and GEP based

models. The overall performance of the RBF-NN11 based model is reliable and suitable for

the prediction of ET0, which suggests that Tmean, RH, Ws and GR input combination based

RBF-NN model could be used for the prediction of ET0. The results in Table 12 of single

factor ANOVA suggest that there is no significant difference among observed and predicted

values using FFNN, RBF-NN and GEP optimum input combination based models.

Figure 15 displays the Box Plot for the prediction errors for the optimum input combination

based models using the test period. The descriptive statistical values of prediction errors for

the optimum input combinations are listed in Table 13. According to the Table 13 and Figure

15, the RBF-NN11 model has followed the corresponding observed values with lower

maximum error (1.3700) and the width of the 1st quartile (-0.0952) is less than other optimum

input combination based models.

The Taylor diagram of the observed and predicted ETo by different optimum input

combination based models over the test period is depicted by Figure 16. It is clear that, the

representative points of all the applied models have nearly the same position. The RBF-NN11

14

model is located nearest to the observed point with lower value of RMSE, SD and higher

value of coefficient of correlation, making this model emerge as a superior model with

optimum number of input parameters.

CONCLUSIONS

This study aimed to investigate the potential of FFNN, RBFNN and GEP to estimate daily

evapotranspiration in a semi-arid region in Algeria. Different input combinations of

meteorological variables were tried in these models. The results pointed to the fact that both

the neural network (i.e. FFNN and RBFNN) and GEP models make for optimal levels of

agreement with the ET0 obtained by the FAO PM method. They yielded reliable estimations

for the semi-arid area in question. The study also found that modeling ET0 by means of the

ANN technique leads to better estimates than the GEP model.

The current results suggested that the FFNN based model 2 outperformed all other applied

models. Another major conclusion was that the RBF-NN based model 11 performed better

than other applied models with a smaller number of required meteorological inputs. ANN and

GEP based models suggest that Tmean, RH, Ws and GR parameters are the optimum

parameters for the estimation of daily evapotranspiration in the semi-arid region of Algeria.

The overall performance of all applied models is satisfactory, as there is no significant

difference among actual and predicted values using the optimum number of input parameters

in the models.

Conflict of interest: The authors declare that they have no conflict of interest

15

REFERENCES

Allen RG, Pereira LS, Raes, D, Smith, M (1998) Crop evapotranspiration: guidelines for

computing crop water requirements. FAO Irrigation and Drainage Paper No. 56. Rome

(Italy): FAO. p. 300

Alves WB, Rolim GS & Aparecid LEO (2017) Reference evapotranspiration forecasting. J. of

Eng. Agric.v37, p1116-1125. Doi : 10.1590/1809-4430

Amatya, D.M., Muwamba, A.,Panda, S., Callahan, T., Harder, S., Pellett, A. 2018.

Assessment of Spatial and Temporal Variation of Potential Evapotranspiration Estimated

by Four Methods for South Carolina. J. of South Carolina Water Resources, 5(1), 3-24

Apaydin, H., Feizi, H., Sattari, M. T., Colak, M. S., Shamshirband, S., & Chau, K.-W. (2020).

Comparative Analysis of Recurrent Neural Network Architectures for Reservoir Inflow

Forecasting. Water, 12(5), 1500. https://doi.org/10.3390/w12051500

Blaney HF and Criddle WD (1950) Determining Water Requirements in Irrigated Areas from

Climatological and Irrigation Data. US Soil Conservation Service,TP 96.

Citakoglu H, Cobaner M & Haktanir T & Kisi O. (2014) Estimation of Monthly Mean

Reference Evapotranspiration in Turkey. Water Resour Manage (2014) 28:99–113. Doi

10.1007/s11269-013-0474-1

Cobaner, M. (2013). Reference evapotranspiration based on Class A pan evaporation via

wavelet regression technique. Irrigation Science, 31(2), 119–134.

https://doi.org/10.1007/s00271-011-0297-x

de Oliveira Ferreira Silva, C., de Castro Teixeira, A., H., Manzione, R.L. 2019. agriwater: An

R package for spatial modelling of energy balance and actual evapotranspiration using

satellite images and agrometeorological data. Environmental Modelling & Software 120,

104497, DOI: 10.1016/j.envsoft.2019.104497

https://doi.org/10.3390/w12051500

https://doi.org/10.1007/s00271-011-0297-x

16

Douaoui A, Walter Ch, Gaouar A. et Hammoudi S (2001) Assessment of the topsoil structural

degradation of the Lower Cheliff Valley (Algeria) -Application of multivariate Analysis.

4th conference of the Working Group on Pedometrics (WG-PM), Ghent, 19-21 september.

Douaoui AEK, Nicolas H and Walter C (2006) Detecting salinity hazards with in a semiarid

Context by means of combining soil and remote-sensing data. Geoderma134 (1): 217–230.

Doi: 10.1016/j.geoderma.2005.10.009

Eslamian SS, Gohari SA Zareian MJ & Firoozfar A (2012) Estimating Penman–Monteith

Reference Evapotranspiration Using Artificial Neural Networks and Genetic Algorithm: A

Case Study. Arab J Sci Eng (2012) 37:935–944. Doi : 10.1007/s13369-012-0214-5

Fan, J., Yue, W., Wu, L., Zhang, F., Cai, H., Wang, X., Lu, X., & Xiang, Y. (2018).

Evaluation of SVM, ELM and four tree-based ensemble models for predicting daily

reference evapotranspiration using limited meteorological data in different climates of

China. Agricultural and Forest Meteorology, 263, 225–241.

https://doi.org/10.1016/j.agrformet.2018.08.019

Gocić, M., Motamedi, S., Shamshirband, S., Petković, D., Ch, S., Hashim, R., & Arif, M.

(2015). Soft computing approaches for forecasting reference evapotranspiration.

Computers and Electronics in Agriculture, 113, 164–173.

https://doi.org/10.1016/j.compag.2015.02.010

Guven, A., Aytek, A., Yuce, M. I., & Aksoy, H. (2008). Genetic Programming-Based

Empirical Model for Daily Reference Evapotranspiration Estimation. CLEAN - Soil, Air,

Water, 36(10–11), 905–912. https://doi.org/10.1002/clen.200800009

Haciismailoglu MC, Kucuk I, Derebasi N (2009) Prediction of dynamic hysteresis loops of

nano-crystalline cores. Expert Syst Appl 36:2225–2227

https://doi.org/10.1016/j.geoderma.2005.10.009

https://doi.org/10.1016/j.agrformet.2018.08.019

https://doi.org/10.1016/j.compag.2015.02.010

17

Hargreaves GH and Samani ZA (1985) Reference crop evapotranspiration from temperature.

Appl. Eng. Agr. 1:96–99. Doi:10.13031/2013.26773

Kisi O (2007) The potential of different ANN techniques in evapotranspiration modeling.

Hydrol Process. 22(14):2449–2460. Doi:10.1002/hyp.6837

Kumar M, Raghuwanshi NS, Singh R, Wallender WW and Pruitt WO (2002) Estimating

evapotranspiration using artificial neural network. J. Irrig Drainage E-ASCE. 128 (4):224–

232. Doi:10.1061/(ASCE)0733-9437(2002)128:4(224

Kumar M, Bandyopadhyay A, Raghuwanshi NS & Singh R (2008) Comparative study of

conventional and artifcial neural network-based ETo estimation models. Irrig Sci, 26:531–

545. Doi 10.1007/s00271-008-0114-3

Kumar M, Raghuwanshi NS and Singh R (2011) Artifcial neural networks approach in

evapotranspiration modeling: a review. Irrig Sci (2011) 29:11–25. Doi :10.1007/s00271-

010-023 0-8

Laaboudi A, Mouhouche B and Draoui B (2012) Neural network approach to reference

evapotranspiration modeling from limited climatic data in arid regions. Int J Biometeorol

(2012) 56:831–841. DOI:10.1007/ss00484-011-0485-7

Ladlani I, Houichi L, Djemili L, Heddam S and Belouz K (2012) Modeling daily reference

evapotranspiration (ET0) in the north of Algeria using generalized regression neural

networks (GRNN) and radial basis function neural networks (RBFNN): a comparative

study, Meteorol Atmos Phys (2012) 118:163–178. Doi:10.1007/s00703-012-0205-9

Leahy P, Kiely G & Corcoran G (2008) Structural optimization and input selection of an

artificial neural network for river level prediction. J Hydrol 355:192–201. Doi:

10.1016/j.jhydrol.2008.03.017

https://doi.org/10.1002/hyp.6837

https://doi.org/10.1061/(ASCE)0733-9437(2002)128:4(224)

https://doi.org/10.1007/s00271-010-0230-8

https://doi.org/10.1007/s00271-010-0230-8

https://ui.adsabs.harvard.edu/link_gateway/2012MAP...118..163L/doi:10.1007/s00703-012-0205-9

https://doi.org/10.1016/j.jhydrol.2008.03.017

https://doi.org/10.1016/j.jhydrol.2008.03.017

18

Lin F and Chen LH (2004) A Non-Linear Rainfall-Runoff Model Using Radial Basis

Function Network,” Journal of Hydrology, Vol. 289, No. 1-4, pp. 1-8.

Doi: 10.1016/j.jhydrol.2003.10.015

Makkink GF (1957) Testing the Penman formula by means of lysimeters. J. Inst. Water Eng.

11 (3), 277–288.

Mohawesh OE (2011) Artificial neural network for estimating monthly reference

evapotranspiration under arid and semi-arid environments, Archives of Agronomy and Soil

Science, Doi:10.1080/03650340.2011.603126

Marti P, González-Altozano P, López-Urrea R. & Mancha LA & Shiri J (2015) Modeling

reference evapotranspiration with calculated targets. Assessment and implications.

Agricultural Water Management 149:81-90. Doi: 10.1016/j.agwat.2014.10.028

Moriasi, D.N., Arnold, J.G., Van Liew, M.W., Bingner, R.L., Harmel, R.D. , Veith, T.L.,

2007. Model evaluations guidelines forsystematic quantification of accuracy in watershed

simulations. Trans. of the ASABE. 50(3), 885-900.

Nairizi S. (Eds.) 2017. Irrigated Agriculture Development under Drought and Water Scarcity

Irrigated Agriculture Development under Drought and Water Scarcity. International

Commission on Irrigation and Drainage, India

Pandey, P. K., Nyori, T., & Pandey, V. (2017). Estimation of reference evapotranspiration

using data driven techniques under limited data conditions. Modeling Earth Systems and

Environment, 3(4), 1449–1461. https://doi.org/10.1007/s40808-017-0367-z

Petković, D., Gocic, M., Shamshirband, S., Qasem, S. N., & Trajkovic, S. (2016). Particle

swarm optimization-based radial basis function network for estimation of reference

evapotranspiration. Theoretical and Applied Climatology, 125(3–4), 555–563.

https://doi.org/10.1007/s00704-015-1522-y

https://www.researchgate.net/deref/http%3A%2F%2Fdx.doi.org%2F10.1016%2Fj.jhydrol.2003.10.015?_sg%5B0%5D=mCFqqZK6-SQoinpABIT8DHan2tn43qMITrkzBngBEIJK7iZDlgSqK3pZDOgQWnidFb8EMngtmNfcvgchI9v-LKKmTw.dzNn7Xd8Q_A_UbOyLnmnhHO6zuZZQHW6OV0NBsdPK77GBihhlW8kimKMq7Krn8rSwZ2ZxITdk7rzvSefwrTARg

https://www.researchgate.net/profile/Ramon_Lopez-Urrea?_sg%5B0%5D=ipa-kLY1w-gqVtGV50yU24gEnV6qCoTwz8Qlv15J7p_8Cq5WiMtAXj1qntRczYTPhtwXqec.wpMzYKH_AOCGPXemlSGu_vJZwRF1X2tTlMzql0jcbzjx9IDjyZbv_prbYqqNIRUzaCLKmeIeCSqiRCd39QsM3Q&_sg%5B1%5D=YGanvue1zm6mgMCGJWcmyeEsQqLbBC3KkVxqOhvwjHR1bhNlBAsmVDmo3OljEm1mlA6wVJ0.8zZTFbTwuK3yjxG-bA-v7GtAB7MnkLGOIdHdciqdPxcrRI5mj0-TH3HPOS3irg1Aiio-izNqFi04uz_ka0E6aw

https://www.researchgate.net/deref/http%3A%2F%2Fdx.doi.org%2F10.1016%2Fj.agwat.2014.10.028?_sg%5B0%5D=7LMYMMa0BUyqNx1Q98wspxb9YNACip0m8cw9RAXWDZfalWYmj4mOBGo6p_bnJUj9I2b2Y2Y8v6yd243SH-HKYptFXw.39cGTVo3bpRSIMbunMmf4YGbkjfGTjUSQJo54Jy4B3BOHRxKIbX4oGe-q1Si6mzwlrt-G9iQTPafF4qFrmO69g

19

Priestley CHB and Taylor RJ (1972) On the assessment of surface heat flux and evaporation

using large-scale parameters. Mon. Weather Rev. 100 (2), 81–92

RahimiKhoob A (2008) Comparative study of Hargreaves’s and artificial neural network’s

methodologies in estimating reference evapotranspiration in a semiarid environment. Irrig

Sci. 26:253–259. Doi: 10.1007/s00271-007-0090-z

Rahimikhoob, A. (2010). Estimation of evapotranspiration based on only air temperature data

using artificial neural networks for a subtropical climate in Iran. Theoretical and Applied

Climatology, 101(1–2), 83–91. https://doi.org/10.1007/s00704-009-0204-z

RahimiKhoob A (2014) Comparison between M5 Model Tree and NeuralbNetworks for

Estimating Reference Evapotranspiration in an Arid Environment. Water Resour Manage

28:657–669. Doi: 10.1007/s11269-013-0506-x

Rouzegari, N., Hassanzadeh, Y., & Sattari, M. T. (2019). Using the Hybrid Simulated

Annealing-M5 Tree Algorithms to Extract the If-Then Operation Rules in a Single

Reservoir. Water Resources Management, 33(10), 3655–3672.

https://doi.org/10.1007/s11269-019-02326-4

Sattari MT, Pal M, Yurekli K & Ünlukara A (2013a) M5 model trees and neural network

based modelling of ET0 in Ankara, Turkey. Turk J Eng Environ Sci 37:211–219.

Doi: 10.3906/muh-1212-5

Sattari, M. T., Mirabbasi, R., Sushab, R. S., & Abraham, J. (2017). Prediction of Level in

Ardebil Plain Using Support Vector Regression and M5 Tree Model. Groundwater.

https://doi.org/10.1111/gwat.12620

https://www.researchgate.net/deref/http%3A%2F%2Fdx.doi.org%2F10.1007%2Fs00271-007-0090-z?_sg%5B0%5D=ObxAkOT_go5V5ks5uU481Y7r_Ejlg7OPbtlaA0KCmMXChqw59lT73gzREBytCCupTbLH6ek8NA21mqmU3DSrHpvIZw.hGZzAF-k2de8T2aAxjZTt2Q3-1Jv0OPcN9iUyHyvDhYvPpOB-j0M3Wb3vUJ6a73nrowO_e5s-ds5kPx-fAOE8A

https://doi.org/10.1007/s11269-019-02326-4

https://www.researchgate.net/deref/http%3A%2F%2Fdx.doi.org%2F10.3906%2Fmuh-1212-5?_sg%5B0%5D=G8-3GE7DGBjrAzbJ8LQ4psU3xqu2RvT5rHCKlGr_b1jNajtjEKyUZQbHheYj4npiGXauV45FabqD_o0UVwJIGDA91g.ZVye5146Nzpe2StnJFlQhQLv3bNivK_ygKqDqeF-OFxWAfQJu3x8XfLU3zm2V2YX_ZyjA60gCV9j7_IfzI0Uaw

https://doi.org/10.1111/gwat.12620

20

Sattari, M. T., Apaydin, H., & Shamshirband, S. (2020). Performance Evaluation of Deep

Learning-Based Gated Recurrent Units (GRUs) and Tree-Based Models for Estimating

ETo by Using Limited Meteorological Variables. Mathematics, 8(6), 972.

https://doi.org/10.3390/math8060972

Shabani, S., Samadianfard, S., Sattari, M. T., Mosavi, A., Shamshirband, S., Kmet, T., &

Várkonyi-Kóczy, A. R. (2020). Modeling Pan Evaporation Using Gaussian Process

Regression K-Nearest Neighbors Random Forest and Support Vector Machines;

Comparative Analysis. Atmosphere, 11(1), 66. https://doi.org/10.3390/atmos11010066

Shiri J, Kisi Ö, Anderas G, López JJ, Nazemi AH and Stuyt LCPM (2012) Daily reference

evapotranspiration modeling by using genetic programming approach in the Basque

Country (Northern Spain, Journal of Hydrology 414–415 (2012) 302–316.

Doi:10.1016/j.jhydrol.2011.11.004.

Shiri J, Sadraddini A, Nazemi AH, Kisi O, Landeras G, Fakheri Fard A, Marti P (2014)

Generalizability of gene expression programming-based approaches for estimating daily

reference evapotranspiration in coastal stations of Iran. J Hydrol 508:1–11

Sudheer KP, Gosain AK and Ramasastri KS (2003) Estimating Actual Evapotranspiration

from Limited Climatic Data Using Neural Computing Technique. J. Irrig. Drain Eng.

129:214-218. Doi: 10.1061/~ASCE/0733-9437~2003!129:3~214

Sun G, Alstad K, Chen J, Chen S, Ford CR., Lin G, Zhang Z. 2011. A general predictive

model for estimating monthly ecosystem evapotranspiration. Ecohydrology. 4(2): 245–

255. doi:10.1002/eco.194

Thornthwaite, C.W., 1948. An approach toward a national classification of climate. Geogr.

Rev. 38 (1), 55–94.

https://doi.org/10.3390/math8060972

https://doi.org/10.3390/atmos11010066

21

Thorp, K.R., Marek, G.W., DeJonge, K.C., Evett, S.R., Lascano, R.J. 2019. Novel

methodology to evaluate and compare evapotranspiration algorithms in an agroecosystem

model. Environmental Modelling & Software 119, 214-227. DOI:

10.1016/j.envsoft.2019.06.007

Traore S, Guven A (2013) New algebraic formulations of evapotranspiration extracted from

gene-expression programming in the tropical seasonally dry regions of West Africa. Irrig

Sci 31(1):1–10

Wang S., Fu Z, Chen H, Nie Y & Wang K (2015) Modeling daily reference ET in the karst

area of northwest Guangxi (China) using gene expression programming (GEP) and

artificial neural network (ANN). Theor Appl Climatol. Doi: 10.1007/s00704-015-1602-z

Wen, X., Si, J., He, Z., Wu, J., Shao, H., & Yu, H. (2015). Support-Vector-Machine-Based

Models for Modeling Daily Reference Evapotranspiration With Limited Climatic Data in

Extreme Arid Regions. Water Resources Management, 29(9), 3195–3209.

https://doi.org/10.1007/s11269-015-0990-2

Wu, L., Peng, Y., Fan, J., & Wang, Y. (2019). Machine learning models for the estimation of

monthly mean daily reference evapotranspiration based on cross-station and synthetic data.

Hydrology Research, 50(6), 1730–1750. https://doi.org/10.2166/nh.2019.060

https://doi.org/10.2166/nh.2019.060

Figures

Figure 1

Lower Cheliff Plain situation

Figure 2

Automatic meteorological station of Hmadna

Figure 3

MLP architecture

Figure 4

RBF architecture

Figure 5

General GEP model implementation and general structure (Shiri et al., 2012).

Figure 6

Performance of the best performing FFNN M2 model for both training and testing stages

Figure 7

Performance of the optimum combination of inputs based best model FFNN M11 model for both trainingand testing stages

Figure 8

Performance of the best performing RBF-NN M5 model for both training and testing stages

Figure 9

Performance of the optimum combination of inputs based best model RBF-NN M11 model for bothtraining and testing stages

Figure 10

Performance of the optimum combination of inputs/ best input combination based model GEP M11model for both training and testing stages

Figure 11

Scatter plot among observed and predicted values using best input combination based models usingtesting data set

Figure 12

Box Plot for best performing models.

Figure 13

Taylor diagram for best performing models.

Figure 14

Scatter plot among observed and predicted values using optimum input combination based modelsusing testing data set

Figure 15

Box Plot for best performing optimum number of input combination based models.

Figure 16

Taylor diagram for best performing optimum number of input combination based models

Documents

Modeling Reference Evapotranspiration in a Semiarid Area