10
1090 IEEE INTERNET OF THINGS JOURNAL, VOL. 5, NO. 2, APRIL 2018 Solar Power Generation Forecasting With a LASSO-Based Approach Ningkai Tang, Shiwen Mao , Senior Member, IEEE, Yu Wang, and R. M. Nelms, Fellow, IEEE Abstract—The smart grid (SG) has emerged as an important form of the Internet of Things. Despite the high promises of renewable energy in the SG, it brings about great challenges to the existing power grid due to its nature of intermittent and uncontrollable generation. In order to fully harvest its potential, accurate forecasting of renewable power generation is indispens- able for effective power management. In this paper, we propose a least absolute shrinkage and selection operator (LASSO)-based forecasting model and algorithm for solar power generation forecasting. We compare the proposed scheme with two represen- tative schemes with three real world datasets. We find that the LASSO-based algorithm achieves a considerably higher accu- racy comparing to the existing methods, using fewer training data, and being robust to anomaly data points in the training data, and its variable selection capability also offers a convenient tradeoff between complexity and accuracy, which all make the proposed LASSO-based approach a highly competitive solution to forecasting of solar power generation. Index Terms—Generation forecasting, Internet of Things (IoT), least absolute shrinkage and selection operator (LASSO), machine learning, renewable energy. I. I NTRODUCTION I NTERNET of Things (IoT) is defined as uniquely identifiable objects that are organized in an Internet like structure. With technology developments and evolution of the power grid, the concept of smart grid (SG) has emerged, which is the next generation power grid, as well as an important part of the IoT [2]. An SG is an electricity network that can intelligently integrate the interactions of all users con- nected to generators, consumers, and those that assume both roles, in order to efficiently deliver sustainable, economic, and secure electricity supplies [2]. Such capabilities are enabled by the computation, communication, and control mechanisms that are incorporated with the power grid, where a large amount of interconnected wireless sensors (e.g., phasor measurement units) are deployed to obtain real-time state information, while Manuscript received September 30, 2017; revised January 8, 2018 and February 7, 2018; accepted March 1, 2018. Date of publication March 5, 2018; date of current version April 10, 2018. This work was supported in part by the U.S. NSF under Grant CNS-1702957 and Grant DMS-1736470 and in part by the Wireless Engineering Research and Education Center at Auburn University. This work was presented in part at IEEE GLOBECOM 2017, Singapore, Dec. 2017 [1]. (Corresponding author: Shiwen Mao.) N. Tang, S. Mao, and R. M. Nelms are with the Department of Electrical and Computer Engineering, Auburn University, Auburn, AL 36849 USA (e-mail: [email protected]; [email protected]; [email protected]). Y. Wang is with the Department of Electrical Engineering, Nanjing University of Aeronautics and Astronautics, Nanjing 210016, China (e-mail: [email protected]). Digital Object Identifier 10.1109/JIOT.2018.2812155 many actuators are deployed to enforce power scheduling, protection, and security decisions. Because of the IoT-based technology, SG and IoT are naturally inseparable. Recently, numerous IoT technologies has been developed to fulfill the SG’s potential, including energy distribution and manage- ment [3]–[6], load balancing [7], security and privacy [8], and the future smart building framework [9]. The SG is characterized with the two-way flow of power and information, microgrid, and distributed renew- able energy resources (DRERs) [10]. In the meantime, the rise of new energy (e.g., photovoltaic power such as solar power) has brought new challenges to such uncon- ventional power networks. Although integrating the power charge from solar power generators could reinforce the macro grid, a large and uncertain amount of power generated by micro solar grids could lead to severe energy management problems [11]. In order to fully harvest the potential of DRERs, two key techniques, load forecasting (i.e., to predict the amount of power needed to achieve the demand and supply equilib- rium) and power generation forecasting (i.e., to predict how much power will be generated at a future time), are indis- pensable. Load forecasting has been well studied in [12]–[14], with different statistics and machine learning approaches, such as nonparametric functional time series analysis, state space models, and artificial neural networks. Similarly, generation forecasting has been investigated with various models and methods as well. Since solar power generation is linked directly to solar inten- sity, the solar power forecasting problem naturally translates to a weather forecasting problem. In [15] and [16], support vector machine (SVM) and nonlinear time series are used to predict solar intensity, respectively. Other prior works such as [17]–[19] also provided various effective solutions to the solar intensity prediction problem. Although the prior works have done a good job on achieving a low error rate, there is always room for improvement for more accurate forecasting. In addition, a deeper analysis will be helpful to gain a good understanding of the problem. For example, the SVM-based method [15] achieves a low error rate, but the selection of kernel is usually based on experience. For neural network- based technologies [19], [20], the neural network structure needs to be predesigned and a quite complicated structure is needed to achieve a good precision, which, however, leads to a high computational cost. In rainy or cloudy days, the time-series-based method [21] are usually not effective to cap- ture the high variations in data. In [16], we present a local 2327-4662 c 2018 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information. Authorized licensed use limited to: Auburn University. Downloaded on April 13,2020 at 01:48:03 UTC from IEEE Xplore. Restrictions apply.

Solar Power Generation Forecasting With a LASSO-Based Approachszm0001/papers/IoT18_LASSO.pdf · Since solar power generation is linked directly to solar inten-sity, the solar power

  • Upload
    others

  • View
    2

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Solar Power Generation Forecasting With a LASSO-Based Approachszm0001/papers/IoT18_LASSO.pdf · Since solar power generation is linked directly to solar inten-sity, the solar power

1090 IEEE INTERNET OF THINGS JOURNAL, VOL. 5, NO. 2, APRIL 2018

Solar Power Generation Forecasting Witha LASSO-Based Approach

Ningkai Tang, Shiwen Mao , Senior Member, IEEE, Yu Wang, and R. M. Nelms, Fellow, IEEE

Abstract—The smart grid (SG) has emerged as an importantform of the Internet of Things. Despite the high promises ofrenewable energy in the SG, it brings about great challengesto the existing power grid due to its nature of intermittent anduncontrollable generation. In order to fully harvest its potential,accurate forecasting of renewable power generation is indispens-able for effective power management. In this paper, we proposea least absolute shrinkage and selection operator (LASSO)-basedforecasting model and algorithm for solar power generationforecasting. We compare the proposed scheme with two represen-tative schemes with three real world datasets. We find that theLASSO-based algorithm achieves a considerably higher accu-racy comparing to the existing methods, using fewer trainingdata, and being robust to anomaly data points in the trainingdata, and its variable selection capability also offers a convenienttradeoff between complexity and accuracy, which all make theproposed LASSO-based approach a highly competitive solutionto forecasting of solar power generation.

Index Terms—Generation forecasting, Internet of Things(IoT), least absolute shrinkage and selection operator (LASSO),machine learning, renewable energy.

I. INTRODUCTION

INTERNET of Things (IoT) is defined as uniquelyidentifiable objects that are organized in an Internet like

structure. With technology developments and evolution of thepower grid, the concept of smart grid (SG) has emerged, whichis the next generation power grid, as well as an importantpart of the IoT [2]. An SG is an electricity network thatcan intelligently integrate the interactions of all users con-nected to generators, consumers, and those that assume bothroles, in order to efficiently deliver sustainable, economic, andsecure electricity supplies [2]. Such capabilities are enabled bythe computation, communication, and control mechanisms thatare incorporated with the power grid, where a large amountof interconnected wireless sensors (e.g., phasor measurementunits) are deployed to obtain real-time state information, while

Manuscript received September 30, 2017; revised January 8, 2018 andFebruary 7, 2018; accepted March 1, 2018. Date of publication March 5, 2018;date of current version April 10, 2018. This work was supported in part bythe U.S. NSF under Grant CNS-1702957 and Grant DMS-1736470 and inpart by the Wireless Engineering Research and Education Center at AuburnUniversity. This work was presented in part at IEEE GLOBECOM 2017,Singapore, Dec. 2017 [1]. (Corresponding author: Shiwen Mao.)

N. Tang, S. Mao, and R. M. Nelms are with the Department of Electrical andComputer Engineering, Auburn University, Auburn, AL 36849 USA (e-mail:[email protected]; [email protected]; [email protected]).

Y. Wang is with the Department of Electrical Engineering, NanjingUniversity of Aeronautics and Astronautics, Nanjing 210016, China (e-mail:[email protected]).

Digital Object Identifier 10.1109/JIOT.2018.2812155

many actuators are deployed to enforce power scheduling,protection, and security decisions. Because of the IoT-basedtechnology, SG and IoT are naturally inseparable. Recently,numerous IoT technologies has been developed to fulfill theSG’s potential, including energy distribution and manage-ment [3]–[6], load balancing [7], security and privacy [8], andthe future smart building framework [9].

The SG is characterized with the two-way flow ofpower and information, microgrid, and distributed renew-able energy resources (DRERs) [10]. In the meantime,the rise of new energy (e.g., photovoltaic power such assolar power) has brought new challenges to such uncon-ventional power networks. Although integrating the powercharge from solar power generators could reinforce the macrogrid, a large and uncertain amount of power generated bymicro solar grids could lead to severe energy managementproblems [11].

In order to fully harvest the potential of DRERs, two keytechniques, load forecasting (i.e., to predict the amount ofpower needed to achieve the demand and supply equilib-rium) and power generation forecasting (i.e., to predict howmuch power will be generated at a future time), are indis-pensable. Load forecasting has been well studied in [12]–[14],with different statistics and machine learning approaches, suchas nonparametric functional time series analysis, state spacemodels, and artificial neural networks. Similarly, generationforecasting has been investigated with various models andmethods as well.

Since solar power generation is linked directly to solar inten-sity, the solar power forecasting problem naturally translatesto a weather forecasting problem. In [15] and [16], supportvector machine (SVM) and nonlinear time series are used topredict solar intensity, respectively. Other prior works suchas [17]–[19] also provided various effective solutions to thesolar intensity prediction problem. Although the prior workshave done a good job on achieving a low error rate, there isalways room for improvement for more accurate forecasting.In addition, a deeper analysis will be helpful to gain a goodunderstanding of the problem. For example, the SVM-basedmethod [15] achieves a low error rate, but the selection ofkernel is usually based on experience. For neural network-based technologies [19], [20], the neural network structureneeds to be predesigned and a quite complicated structure isneeded to achieve a good precision, which, however, leadsto a high computational cost. In rainy or cloudy days, thetime-series-based method [21] are usually not effective to cap-ture the high variations in data. In [16], we present a local

2327-4662 c© 2018 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission.See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.

Authorized licensed use limited to: Auburn University. Downloaded on April 13,2020 at 01:48:03 UTC from IEEE Xplore. Restrictions apply.

Page 2: Solar Power Generation Forecasting With a LASSO-Based Approachszm0001/papers/IoT18_LASSO.pdf · Since solar power generation is linked directly to solar inten-sity, the solar power

TANG et al.: SOLAR POWER GENERATION FORECASTING WITH LASSO-BASED APPROACH 1091

linear model for nonlinear time series, which leads to an accu-rate approximation and an analysis on the relationship betweenthe renewable power generation process and the weather vari-able processes. However, the importance of each variable isyet to be better identified.

In this paper, we investigate the solar power generationforecasting problem, aiming to develop an effective methodthat not only achieve a high forecasting accuracy but alsohelps to reveal the significance of weather variables. To thisend, we propose a least absolute shrinkage and selectionoperator (LASSO)-based method for solar power generationforecasting based on historical weather data. Based on a sin-gle index model (SIM) and LASSO, we develop an effectivealgorithm that maximizes Kendall’s tau coefficient to esti-mate the prediction model coefficients. The goal of variableselection is achieved by the nature of LASSO, which auto-matically reduce the weights of less important variables andincrease the sparsity of the overall coefficient vector. With theproposed algorithm, we can either maximize the predictionaccuracy using all the weather data/variables, or achieve atradeoff between accuracy and complexity by using a lim-ited number of variables. The proposed scheme is evaluatedwith the real dataset collected from a weather station [22],and comparison to two representative benchmark schemes.The proposed LASSO-based scheme outperforms both exist-ing schemes with considerable reduction in predictionerror.

The remainder of this paper is organized as follows.We introduce LASSO and the forecasting problem inSection III. Then we present our LASSO-based algorithm inSection IV. Meanwhile, we also introduce the time-series solu-tion proposed in [16], which is used as a benchmark. Wevalidate the performance of our solution and compare it withtwo benchmark schemes in Section V. We conclude this paperin Section VI.

II. RELATED WORK

Power generation prediction is an essential issue in SGnetwork, especially for system integrated with new energy,such as wind or solar generation. In order to make accurateprediction, considerable work has been done. In addition tothose discussed in the introduction section, we review severaladditional key related work here.

In [15], methods such as past-predicts-future (PPF) model,linear least square regression, and SVM regression are adoptedto solve the solar generation prediction problem. According tothe authors, linear regression and SVM outperform PPF dueto the change of weather pattern. Also, as a conclusion, SVMshows a higher accuracy while linear least square achieves aninterpretable model. Although the idea is novel, the accuracyof these models are yet to be improved.

Inspired by this paper, we propose the time-series-based algorithm TLLE [16], which separates the his-torical data into small neighborhoods and estimates thecoefficient of each neighborhood with a linear solution.While the algorithm improves the accuracy comparing tomultilinear regression (MLR) and SVM, the model also

provides simultaneous confidence interval to making furtherinterpretation for each covariate. However, the nonlinear natureof the target problem could undermine the interpretation of theresults.

Other than these statistical methods, neural network is also awidely used approach to address the problem. Zhang et al. [18]proposed several different neural network models as competi-tors. In this paper, the authors noticed the fact that weakstationarity and lack of continuity result in volatilities of solardata. Thus, the proposed models use historical data similar tothe target day to perform prediction. The accuracy for cer-tain models are great but we should be aware of the casewhen there are no historical, similar days. Also, the heavycomputation of neural networks is always a concern.

Not limited to algorithms, data preprocessing could becomea useful means to reduce error. Chen et al. [32] andShi et al. [33] proposed to classify the data by weather con-ditions before using their similar-day-based neural networkalgorithms. Specifically, Chen et al. [32] classified histori-cal data by irradiance, total cloud, and cloud cover, whileShi et al. [33] took the weather feature, such as sunny orcloudy for data classification. As certain tricks surely improvethe performance of model, we should still notice that the avail-ability and accuracy of these features are highly dependenton location of dedicated datasets, making the schemes lessflexible.

III. FORECASTING MODEL AND PROBLEM STATEMENT

A. LASSO Preliminaries

In machine learning and statistics, LASSO has become apopular method for regression analysis, ever since it was firstintroduced by Tibshirani [24] in 1996. By applying LASSOto practical problems, we benefit from two main functionsthat LASSO has: 1) regularization and 2) variable selection.Due to the nature of LASSO, while a stronger l1 penalty isused, LASSO is encouraged to shrink its coefficients of lessimportant variables to 0. In other words, it performs variableselection by dropping the corresponding variables from themodel and achieves a sparse solution in this case. On the otherhand, while a weaker l1 penalty is used, the algorithm tendsto retain most variables and predict with better regularization.The level of l1 penalty can be chosen by automatic techniqueslike cross-validation or by manually using the regularizationpath. In recent years, LASSO has been successfully applied tovarious SIMs [25]–[27] due to the above-mentioned capability.

We propose to use LASSO for solar power generationwith high accuracy. In addition, since weather data gatheredfrom the local weather station can vary in different types ofweather parameters to monitor, it is important to find outwhich variables are more important on solar power generation,especially when lacking of sufficient weather information, orwhen computation complexity is a concern. As discussed, lin-ear regression, neural networks, and SVM-based algorithmshave already been applied to the solar power generation fore-casting problem. To the best of our knowledge, this is thefirst application of LASSO to the problem, to achieve highprediction precision as well as variable selection.

Authorized licensed use limited to: Auburn University. Downloaded on April 13,2020 at 01:48:03 UTC from IEEE Xplore. Restrictions apply.

Page 3: Solar Power Generation Forecasting With a LASSO-Based Approachszm0001/papers/IoT18_LASSO.pdf · Since solar power generation is linked directly to solar inten-sity, the solar power

1092 IEEE INTERNET OF THINGS JOURNAL, VOL. 5, NO. 2, APRIL 2018

B. System Model

The solar power forecasting problem is a good match forthe SIM, which has the advantage of avoiding the so-called“curse of dimensionality” in fitting multivariate nonparametricregression functions by focusing on an index [23]. Specifically,we adopt the SIM as follows:

Y∣∣X ∼ P

(·, f(

XTβ))

(1)

where Y ∈ R is the response, X ∈ Rp are the covariates, p is

the dimension of variables, P(·, θ) represents a stochasticallyincreasing family of functions with parameter θ , β ∈ R

p isthe coefficient of the covariate X and is unit normed, and f (·)is an unknown strictly smooth increasing link function.

To relate the model in (1) to our problem, Y is our desiredestimation of solar intensity and X is the weather data collectedfrom a weather station. In our forecasting algorithm, we usea special case of the model as follows:

Y = f(

XTβ) + ε (2)

where ε is a zero mean variable with a finite variance rep-resenting error. Specifically, the weather data collected froma local weather station, X, consists of five weather data vari-ables, including temperature, humidity, dew point, wind speed,and precipitation, which compose a 5-D dataset.

C. Kendall’s Tau Coefficient

With a set of independent identically distributed data sam-ples, the proposed algorithm is capable of simultaneousvariable selection and forecasting through optimizing the rela-tionship between Y and XTβ. Although it is not clear whetherthe problem is linear or not, the MLR-based approaches didnot show a satisfying performance in [15] and [16], whichcould be an indicator that linear model is not suitable for theproblem. Therefore, we propose to use Kendall’s tau coef-ficient between Y and XTβ instead of Pearson’s correlationcoefficient [28].

Kendall’s tau coefficient is a statistic used to measure therank correlation between two quantities [28]. Comparing to thewidely used Pearson’s correlation coefficient, which is a lin-ear correlation measurement, Kendall’s tau coefficient is moresuitable for nonlinear problems. Also, due to the assumptionof monotonicity, if we ignore the outliers caused by randomerrors, the increments of Y is highly possible to be syn-chronized with XTβ. Thus, we can precisely estimate β bymaximizing the following Kendall’s tau coefficient between Yand XTβ.

Assume there are n data units, {X1, X2, . . . , Xn}, and thecorresponding response values are {Y1, Y2, . . . , Yn}, respec-tively. For discontinuous β, Kendall’s tau coefficient isexpressed as

τn(β) = 1

n(n − 1)

1≤i1 �=i2≤n

sign(

Yi2 − Yi1

) · sign(

XTi2β − XT

i1β)

(3)

where sign(·) is the signum function. For the continuous formof β, Kendall’s tau coefficient is defined as

τ ∗n (β) = 1

n(n − 1)

1≤i1 �=i2≤n

sign(

Yi2 − Yi1

)

tanh

(

XTi2β − XT

i1β

c

)

(4)

where tanh(·) is the hyperbolic tangent function and c is asmall constant, which can be seen as a given value.

IV. PROPOSED ALGORITHM

We present the proposed solution algorithm in this sec-tion. In particular, in Section IV-A, we introduce the proposedLASSO-based algorithm. In Section IV-B, we discuss how tochoose the parameters used in the algorithm. In Section IV-C,we show how to apply the proposed LASSO-based algorithmfor the solar intensity prediction problem.

A. Proposed Algorithm

With the definition of Kendall’s tau coefficient, the proposedsolution algorithm consists of two parts, i.e., coefficient esti-mation and link function estimation. The proposed algorithmconsists of the following three steps.

1) Coefficient Estimation: First, we need to find an indexj that can maximize the following value ρj, j = 1, 2, . . . , p,where p is the dimension of the variables:

ρj = 1

n(n − 1)

1≤i1 �=i2≤n

sign(Yi2 − Yi1) · sign(

Xi2j − Xi1j)

.

(5)

We call this index j1, and set β(1) = sign(ρj1)ej1 , where ej =[0, . . . , 1, . . . , 0]T is a p × 1 vector with 1 at the jth positionand 0 at all other positions.

Suppose we have Xj1 , Xj2 , . . . , Xjk−1 as the selected vari-ables, and the currently optimized coefficient is β(k−1). Forthe remaining j /∈ {j1, j2, . . . , jk−1}, we continue our procedurein parallel, solving the following problem:

βj = arg maxβj

{

τ ∗n

(

β(k−1) + βjej

)

− λ|βj|}

j /∈ {j1, j2, . . . , jk−1} (6)

where λ is a system parameter (we will discuss its selectionin detail in Section IV-B), and βj is the jth element in β(k−1).We then set jk as

jk = arg maxj/∈{j1,j2,...,jk−1}

{

τ ∗n

(

β(k−1) + βjej

)}

. (7)

The algorithm will terminate if the following condition issatisfied, where ε is a small positive threshold value:

τ ∗n

(

β(k−1) + βjk ejk

)

− τ ∗n

(

β(k−1)

)

< ε. (8)

Otherwise, we set β(k) as

β(k) = β(k−1) + βjk ejk∥∥∥β(k−1) + βjk ejk

∥∥∥

2

(9)

and repeat the above steps until the stop condition (8) issatisfied. Then we obtain the estimated coefficient vector β.

Authorized licensed use limited to: Auburn University. Downloaded on April 13,2020 at 01:48:03 UTC from IEEE Xplore. Restrictions apply.

Page 4: Solar Power Generation Forecasting With a LASSO-Based Approachszm0001/papers/IoT18_LASSO.pdf · Since solar power generation is linked directly to solar inten-sity, the solar power

TANG et al.: SOLAR POWER GENERATION FORECASTING WITH LASSO-BASED APPROACH 1093

2) Link Function Estimation: Due to the monotone assump-tion and the Kendall’s tau coefficient, we perform isotonicregression in our algorithm [29], which is usually applied fornondecreasing data. The goal is to estimate the link functionf (·), which still remains unknown. First, we define

Zi = XTi β. (10)

We then sort {Z1, Z2, . . . , Zn} in ascending order, anddenote the results as {Z(1), Z(2), . . . , Z(n)}. We also rear-range {Y1, Y2, . . . , Yn} according to {Z(1), Z(2), . . . , Z(n)}. Wenext execute the pool-adjacent-violators algorithm (PAVA, asdescribed in [30], which is a simple linear algorithm for iso-tonic regression) on the sorted Y’s, and mark the results as{Y(1), Y(2), . . . , Y(n)}.

By choosing a symmetric and smooth kernel functionKer(t), we estimate the link function f (t) as

f (t) =∑n

j=1 Ker(

1b (t − Z( j))

)

× Y( j)

∑nj=1 Ker

(1b (t − Z( j))

) (11)

where b is chosen by applying the cross validation technique.The kernal function can be any function that applies to thedata; we use the Gaussian kernel in our simulations. The kernalfunction can be any function that applies to the dataset. Weuse the Gaussian kernel in this paper, since it is widely usedand outperforms several other kernels, such as Epanechnikov,sigmoid, and quartic in our simulations.

3) Solar Intensity Prediction: With the estimated coeffi-cient vector β, the estimated link function f (t), and a newobservation X′, we can predict the solar intensity by

Y ′ = f(

X′T β)

. (12)

We manually set a threshold T for the minimum solar intensity.For example, we can set T = 0 by default since solar intensitycannot have a negative value. The final prediction result Y∗ iscomputed as

Y∗ ={

Y ′, if Y ′ > TT, otherwise.

(13)

B. Selection of System Parameter λ

The system parameter λ in (6) is one of the most importantparameters in the proposed algorithm. It is sensitive to variousproblems and should be carefully tuned. In this section, wepresent two basic methods on how to choose λ.

The first method is to use the cross validation technique.For initialization, we need to shuffle the dataset and randomlysplit the samples into five subsets (for a fivefold cross valida-tion) with equal size. Then we pick a set of possible λ valuesand τn(β) is calculated on a one-fifth data subset by using theestimated β from the remaining data. After repeating the pro-cess until all five parts have been calculated (i.e., to avoid thepossible unbalanced results caused by the randomness in data),we could choose the λ that maximizes the average estimationprecision in the cross validation process. With the proposedalgorithm, parallel computation can be employed, with which

Fig. 1. Grid search with λ from 0 to 1.

Fig. 2. Grid search with λ from 0 to 0.1.

the processing speed will be greatly increased. The cross val-idation technique is used when we have abundant time andinformation, and the best estimation precision is preferred.

Alternatively, we can use the regularization path methodto achieve a tradeoff between precision and speed. Whenwe demand more on speed and an acceptable precision isspecified, we could choose λ with the following process.

1) Choose a set of possible λ values and sort them byincreasing order.

2) Execute the proposed algorithm for each λ and recordtheir performance.

3) Plot the achieved precision performance versus thevalues of λ.

4) Choose an acceptable point on the curve to guarantee theperformance while achieving the maximized estimationspeed due to sparsity.

The regularization path method also has the potential toachieve high estimation accuracy even when information islacking.

In Figs. 1–3, we show an example of how to use the solu-tion path method with a grid search. We first search by usinga larger λ value ranging from 0.001 to 1. With the plot-ted curve and the related root mean squared error (RMSE),we zoom in the region that has smaller RMSEs, to select asmaller step size 0.001 and narrower range from 0.001 to 0.1.Then repeat this procedure. For the last round we plot thecurve with λ ranging from 0 to 0.03, since the result showsno merit to continue further, we stop here and choose themost accurate and stable λ value as 0.015. The correspond-ing solar power generation forecasting result is presented inSection V.

Authorized licensed use limited to: Auburn University. Downloaded on April 13,2020 at 01:48:03 UTC from IEEE Xplore. Restrictions apply.

Page 5: Solar Power Generation Forecasting With a LASSO-Based Approachszm0001/papers/IoT18_LASSO.pdf · Since solar power generation is linked directly to solar inten-sity, the solar power

1094 IEEE INTERNET OF THINGS JOURNAL, VOL. 5, NO. 2, APRIL 2018

Fig. 3. Grid search with λ from 0 to 0.03.

C. Prediction Methodology and Performance Measures

It is noticed that both the observational and forecastedweather dataset are time-series datasets that changes overweather patterns and time. As the result shown in [15],solar intensity depends on multiple weather variables, whichcould help us to construct an accurate prediction model. Thestructure of the dataset and the possible relationship amongthe weather variables motivate our proposed LASSO-basedmethod for developing solar intensity prediction models. Toconstruct the model, we utilize historical weather data as input,which include several forecasting data parameters and theactual solar intensity, with totally six weather variables. Theproposed algorithm establishes a function that computes solarintensity from the five forecasting weather variables. Thus,we could use it as the prediction model for future solar powergeneration. We also use part of the remaining data to test themodel’s accuracy. One unique benefit of using our proposedtechnique is the relatively low requirement for data size. Ingeneral, not too much data is needed, usually historical dataover a 15–30 day period will be sufficient.

We focus this paper on short-term forecasting for the nextfew days. We develop a model that shows a relationshipbetween solar intensity and forecasted weather data. For anytime t, we build the LASSO model by using the historical datafrom the past 30 days as an input, i.e., the data from (t − 30)

to (t−1). Using the proposed model, we then predict the solarintensity for time t. In Section V, we also compare the accu-racy of our models with different popular and efficient models,including an SVM-based model [15] and a time-series-basedmodel [16].

Using the basic SIM presented in Section III, the solarpower general prediction model is

Y ∼ P(·, f (Temperature, DewPoint, WindSpeed

Precipitation, Humidity)) (14)

where f (·) is the link function that we determine using dif-ferent prediction methods. The units of the parameters in themodel are: Temperature in degrees of Fahrenheit, DewPointin Fahrenheit, WindSpeed in miles per hour, Precipitation ininches, and Humidity in percentage between 0% and 100%.However, to avoid potential scaling problems, before applyingany selected algorithm, we normalize all feature data to havea zero mean and unit variance.

To quantify the accuracy of each model, we compute theRMSE and mean absolute percentage error (MAPE) between

the predicted solar intensity and the actually observed solarintensity. RMSE and MAPE are well-known statistical mea-sures of the accuracy of values predicted by models withrespect to the observed values. RMSE and MAPE of zeroindicate that the model exactly predicts solar intensity withno error (although this is impossible in reality). The closerthe RMSE and MAPE values are to zero, the more accuratethe model’s prediction is. RMSE and MAPE are defined as

RMSE =√√√√

1

n

n∑

t=1

(

yt − yt)2 (15)

MAPE = 100

n

n∑

t=1

∣∣∣∣

yt − yt

yt

∣∣∣∣

(16)

where n represents the number of predicted data points, yt

stands for the prediction result for data point t, and yt is theactual value of data point t.

V. SIMULATION VALIDATION

In this section, we present our simulation validation ofthe proposed LASSO-based scheme. We use three differentdatasets gathered in both U.S. and U.K. to test the proposedscheme under a variety of environments. The datasets canbe found in [22] and [31]. For comparison purpose, we usethe SVM-based method presented in [15] and the TLLEmethod [16] as benchmarks.

A. Dataset Description

The first dataset we use is gathered from a Davis Weatherstation located in Amherst, MA, USA [22]. The weather datawas collected every 5 min and the weather station is equippedwith sensors to measure temperature, wind chill, humidity,dew-point, wind speed, wind direction, rainfall, baromet-ric pressure, sunlight, and ultraviolet (UV). The dataset isrecorded for quite a long period from February 2006 to January2013. However, the dataset contains errors, which are indicatedby a value of −100 000, as well as missing data for someperiods. In the simulation study, we excluded such errors andmissing data.

We plot the recorded daily solar intensity in Amherst, MA,USA, in Fig. 4, to clearly show how the data pattern variesover time. In accordance with our general knowledge, we canobserve peaks in hot summer days and valleys in cold winterdays. Also, we can see the strong correlation between consecu-tive days. Thus, we try to use seasons and months as additionalparameters and use historical data of the past 30 consecutivedays as training samples.

The second dataset [31] is recorded in Harnhill andDiddington in the U.K. At each study location, two weatherstations are installed (four in total), each record data every30 min for rainfall, temperature, humidity, wind speed, winddirection, barometric pressure, and UV. The Harnhill datasetis from April 2011 to November 2012, while the Diddingtondataset records weather information from August 2011 toDecember 2012. Missing data in both datasets are representedby NaN. By excluding all such invalid data, the amounts

Authorized licensed use limited to: Auburn University. Downloaded on April 13,2020 at 01:48:03 UTC from IEEE Xplore. Restrictions apply.

Page 6: Solar Power Generation Forecasting With a LASSO-Based Approachszm0001/papers/IoT18_LASSO.pdf · Since solar power generation is linked directly to solar inten-sity, the solar power

TANG et al.: SOLAR POWER GENERATION FORECASTING WITH LASSO-BASED APPROACH 1095

Fig. 4. Solar intensity collected at the Davis weather station [22].

Fig. 5. Solar intensity recorded in the Diddington dataset [31].

Fig. 6. Solar intensity recorded in the Harnhill dataset [31].

of valid samples are both for 397 days. The solar intensityrecorded in Harnhill and Diddington, U.K., are plotted inFigs. 5 and 6, respectively. It can be seen weather patternvaries with time just the same as mentioned before. The samestrategy is used for both datasets, where 30 consecutive historydays are selected as training samples.

B. Results With the UMass Dataset

1) SVM-Based Method: With the UMass dataset [22], wefirst apply the SVM method since it is shown to be effectiveand widely used in prediction and classification [15]. Here,we use historical weather data as training samples and aimto predict the solar intensity data through January 1, 2013 toFebruary 28, 2013. In the simulations, we find that differentsets of training data have considerable effects on estimationaccuracy. Experimenting with all the data that is available,we achieve the optimal accuracy with the historical data fromJanuary 1, 2012 to February 28, 2012, which is exactly oneyear ahead of the target period for prediction. The predictedsolar intensity is plotted along with the observed data inFig. 7. The best RMSE achieved by the SVM-based methodis 30.1524 watts/m2. However, the MAPE for the dataset is ashigh as 468.283, which is largely due to the large deviationof the 55th day data. If we exclude that day, the MAPE of theSVM-based method will be reduced to 39.2063.

2) TLLE Method: According to [16], TLLE has beenproven to be a more accurate means when comparing toSVM and MLR-based approaches. The same UMass TraceRepository data [22] is used as in [16]. The historical datafrom January 1, 2012 to February 28, 2012 is used to constructthe TLLE model, and solar power generation is predicted forthe period from January 1, 2013 to February 28, 2013.

We plot the predicted solar intensity along with the observeddata in Fig. 8. The best RMSE we obtained with the TLLE-based method is 23.1464 watts/m2, which is about the same asthat reported in [16]. TLLE achieves a 23.2% reduction overthe SVM-based method. This result validates the advantageof TLLE comparing to the SVM-based method. We also finda very high MAPE value in this simulation, also due to theanomaly data of the 55th day. The MAPE for the remainingdata, after excluding the 55th day data, is reduced to 29.0174,which is a 26.0% reduction over the SVM-based method.

3) Proposed LASSO-Based Algorithm: Now, we apply theproposed LASSO-based method to predict solar intensity. Inthe simulation, we use a relatively smaller training sample sizeof 30, i.e., the training data here is gathered from the past 30days of the target date. Applying the proposed LASSO-basedmethod to the training data yields the prediction results thatis plotted in Fig. 9.

For the LASSO-based prediction curve in Fig. 9, the RMSEis 14.0262 watts/m2 and the MAPE is 17.817, represent-ing further 39.4% and 60.1% reductions over the TLLE-based approach, respectively. More important, these resultsare achieved with the entire 30-day original data, i.e., with-out excluding the 55th day anomaly data in the dataset. Ourmethod also achieves a very stable performance in MAPE.Furthermore, even if we reduce the number of training datafor 30 days to 15 days, the proposed LASSO-based algorithmstill achieves a reliable result, with an RMSE lower than20 watts/m2.

C. Results With the U.K. Datasets

We also apply the three algorithms to the Diddington andHarnhill datasets described in Section V-A [31] for a more

Authorized licensed use limited to: Auburn University. Downloaded on April 13,2020 at 01:48:03 UTC from IEEE Xplore. Restrictions apply.

Page 7: Solar Power Generation Forecasting With a LASSO-Based Approachszm0001/papers/IoT18_LASSO.pdf · Since solar power generation is linked directly to solar inten-sity, the solar power

1096 IEEE INTERNET OF THINGS JOURNAL, VOL. 5, NO. 2, APRIL 2018

Fig. 7. Solar power generation prediction using the SVM-based method withthe UMass dataset.

Fig. 8. Solar power generation prediction using the TLLE-based methodwith the UMass dataset.

Fig. 9. Solar power generation prediction using the proposed LASSO-basedmethod with the UMass dataset.

comprehensive evaluation. Unlike the weather in the U.S., theareas in Britain inevitably have less sunlight due to the muchmore rainy and cloudy days. This different data feature canbe a practical test to our proposed method. For both datasets

Fig. 10. Solar power prediction using SVM with the Diddington dataset.

Fig. 11. Solar power prediction using TLLE with the Diddington dataset.

from [31], we predict the solar intensity for the period fromthe 365th to 394th day.

With the Diddington dataset, we find the SVM method havegreat difficulty with the small set of training samples, whichforces us to increase the amount of training samples to 100.Here, we use the first 100 days’ weather data to train the SVMmodel and finally obtain an acceptable result as presented inFig. 10. Meanwhile, the TLLE method still works better thanSVM. We used the first 60 days’ data as the model generatingdata to achieve the best performance, which is illustrated inFig. 11. The prediction results with the proposed LASSO-based approach is presented in Fig. 12, which is obtainedwith a much smaller 30 training data size than SVM andTLLE. The overall comparison of accuracy is presented inTable I. For this dataset, the proposed LASSO-based approachachieves reductions of 76.6763% and 66.1474% in RMSE overSVM and TLLE, respectively, and reductions of 85.6069% and80.0973% in MAPE over SVM and TLLE, respectively. Notethat such considerable gains are achieved with a much smallertraining data size.

The simulation results with the Harnhill dataset arepresented in Figs. 13–15 for the three schemes. The prediction

Authorized licensed use limited to: Auburn University. Downloaded on April 13,2020 at 01:48:03 UTC from IEEE Xplore. Restrictions apply.

Page 8: Solar Power Generation Forecasting With a LASSO-Based Approachszm0001/papers/IoT18_LASSO.pdf · Since solar power generation is linked directly to solar inten-sity, the solar power

TANG et al.: SOLAR POWER GENERATION FORECASTING WITH LASSO-BASED APPROACH 1097

Fig. 12. Solar power prediction using the proposed method with theDiddington dataset.

TABLE IPREDICTION ACCURACY WITH THE DIDDINGTON DATASET

Fig. 13. Solar power prediction with SVM of Harnhill dataset.

accuracy results are summarized in Table II. Due to the dif-ferent weather pattern in U.K., the level of solar intensity isconsiderably smaller than that in U.S. Although we couldwitness a much closer RMSE achieved with all the threemethods, we should still notice their obvious difference inMAPE. For the dataset, the proposed LASSO-based approachachieves reductions of 73.4156% and 66.5391% in RMSE overSVM and TLLE, respectively, and reductions of 82.4621% and81.1672% in MAPE over SVM and TLLE, respectively. Notethat these performance gains are consistent with that observedwith the Diddington dataset in Table I.

Clearly, our LASSO-based algorithm has achieved consid-erably higher accuracy compared to the two existing methods.In addition, it requires fewer training data and is robust to

Fig. 14. Solar power prediction with TLLE of Harnhill dataset.

Fig. 15. Solar power prediction with proposed method of Harnhill dataset.

TABLE IIPREDICTION ACCURACY WITH THE HARNHILL DATASET

anomaly data points in the training data, which make it ahighly competitive solution to practical problems such asforecasting of solar power generation.

D. Variable Selection With the Proposed Scheme

A notable advantage of the LASSO-based algorithm is itsability of variable selection. By tuning the loss function withparameter λ, it allows to identify which variable(s) are more“important” to the prediction result. The process is to tune λ

in condition of an acceptable prediction accuracy, until any βj

has reached 0. Then we can treat the corresponding variable Xj

as least important. Repeating this procedure, we can identifythe second least important variable, and so forth.

Variable selection will at least provide us with two fasci-nating advantages: 1) reducing the computational complexity

Authorized licensed use limited to: Auburn University. Downloaded on April 13,2020 at 01:48:03 UTC from IEEE Xplore. Restrictions apply.

Page 9: Solar Power Generation Forecasting With a LASSO-Based Approachszm0001/papers/IoT18_LASSO.pdf · Since solar power generation is linked directly to solar inten-sity, the solar power

1098 IEEE INTERNET OF THINGS JOURNAL, VOL. 5, NO. 2, APRIL 2018

TABLE IIICORRELATION MATRIX OF THE UMASS DATASET

Fig. 16. Solar power prediction using the three selected variables with theUMass dataset.

and 2) simplifying the prediction model. Due to the struc-ture of our proposed algorithm, historical data will also beused in the prediction stage. So when less parameters areused in the prediction model, the computational cost willbe greatly reduced. In addition, a simplified model can pro-vide us a clearer understanding of the relationship betweensolar power generation and the weather parameters. We canuse the reduced model to estimate solar power generationwhen the dataset is incomplete, or to reduce the computationtime when necessary (i.e., to tradeoff between complexity andaccuracy).

As an example, we use the UMass dataset to illustrate thevariable selection procedure. Table III shows the correlationmatrix computed with the dataset. Although the problem can-not be simply defined as a linear one, we could still usethe correlation matrix to obtain an intuitive observation. Asthe matrix shows, temperature and humidity are more closelycorrelated to solar intensity, while precipitation is quite inde-pendent with most other parameters. After adjusting the λ

value to reduce the model to a 3-variable model, we havethe optimized β values listed in Table IV.

From Table IV, we find that both dewpoint and windspeedare seen as less important variables. The result for windspeedcoincides with the correlation matrix but dewpoint and pre-cipitation have a conflict. However, noticing from Table IIIthat dewpoint is tightly correlated with temperature, whileprecipitation is quite independent to temperature, the resultfor β becomes reasonable. Fig. 16 provides the predictionresult with the 3-parameter model. The RMSE in this case

TABLE IVOPTIMIZED β WITH THREE VARIABLES

is 21.2468 watts/m2, which is still better than both SVMand TLLE, but the MAPE has increased to 68.5998 due tothe inaccuracy on some small values. Such phenomenon canbe explained by the inner characteristics of variance. Whenwe use fewer variables, we actually lose a certain amountof variance and the prediction will become more unbiased tomaintain accuracy. Thus, it is expected to have a lower accu-racy and the absolute percentage error on certain small valuescould become high. The variable selection capability providesa useful tradeoff between computational/model complexity andaccuracy.

VI. CONCLUSION

In this paper, we proposed an LASSO-based algorithmthat accurately predict solar power generation with a smallamount of historical data. After presenting the detailed algo-rithm design, we compared the proposed scheme with tworepresentative existing schemes using three datasets withdifferent features. We found that the LASSO-based algo-rithm achieved considerably higher accuracy comparing tothe existing methods, using fewer training data, and beingrobust to anomaly data points in the training data. In addi-tion, the variable selection capability offered a nice tradeoffbetween complexity and accuracy. These features all made ita highly competitive solution to forecasting of solar powergeneration.

REFERENCES

[1] N. Tang, S. Mao, Y. Wang, and R. M. Nelms, “LASSO-based singleindex model for solar power generation forecasting,” in Proc. IEEEGLOBECOM, Singapore, Dec. 2017, pp. 1–6.

[2] Y. Wang, S. Mao, and R. M. Nelms, Online Algorithms for OptimalEnergy Distribution in Microgrids, 1st ed. New York, NY, USA:Springer, 2015.

[3] Y. Wang, S. Mao, and R. M. Nelms, “Distributed online algorithm foroptimal real-time energy distribution in the smart grid,” IEEE InternetThings J., vol. 1, no. 1, pp. 70–80, Feb. 2014.

[4] L. Yu, T. Jiang, and Y. Zou, “Distributed online energy management fordata centers and electric vehicles in smart grid,” IEEE Internet ThingsJ., vol. 3, no. 6, pp. 1373–1384, Dec. 2016.

[5] Y. Wang, S. Mao, and R. M. Nelms, “Online algorithm for optimal real-time energy distribution in the smart grid,” IEEE Trans. Emerg. TopicsComput., vol. 1, no. 1, pp. 10–21, Jun. 2013.

Authorized licensed use limited to: Auburn University. Downloaded on April 13,2020 at 01:48:03 UTC from IEEE Xplore. Restrictions apply.

Page 10: Solar Power Generation Forecasting With a LASSO-Based Approachszm0001/papers/IoT18_LASSO.pdf · Since solar power generation is linked directly to solar inten-sity, the solar power

TANG et al.: SOLAR POWER GENERATION FORECASTING WITH LASSO-BASED APPROACH 1099

[6] Y. Huang, S. Mao, and R. M. Nelms, “Adaptive electricity schedulingin microgrids,” IEEE Trans. Smart Grid, vol. 5, no. 1, pp. 270–281,Jan. 2014.

[7] M. Li, P. He, and L. Zhao, “Dynamic load balancing applying water-filling approach in smart grid systems,” IEEE Internet Things J., vol. 4,no. 1, pp. 247–257, Feb. 2017.

[8] Z. Guan et al., “Achieving efficient and secure data acquisition for cloud-supported Internet of Things in smart grid,” IEEE Internet Things J.,vol. 4, no. 6, pp. 1934–1944, Dec. 2017.

[9] J. Pan et al., “An Internet of Things framework for smart energy inbuildings: Designs, prototype, and experiments,” IEEE Internet ThingsJ., vol. 2, no. 6, pp. 527–537, Dec. 2015.

[10] Y. Wang, S. Mao, and R. M. Nelms, “On hierarchical power schedul-ing for the macrogrid and cooperative microgrids,” IEEE Trans. Ind.Informat., vol. 11, no. 6, pp. 1574–1584, Dec. 2015.

[11] M. J. E. Alam, K. M. Muttaqi, and D. Sutanto, “A comprehensiveassessment tool for solar PV impacts on low voltage three phase distribu-tion networks,” in Proc. IEEE ICDRET, Dhaka, Bangladesh, Jan. 2012,pp. 1–5.

[12] M. Chaouch, “Clustering-based improvement of nonparametric func-tional time series forecasting: Application to intra-day household-levelload curves,” IEEE Trans. Smart Grid, vol. 5, no. 1, pp. 411–419,Jan. 2014.

[13] V. Dordonnat, S. J. Koopman, and M. Ooms, “Dynamic factors inperiodic time-varying regressions with an application to hourly elec-tricity load modelling,” Comput. Stat. Data Anal., vol. 56, no. 11,pp. 3134–3152, Nov. 2012.

[14] H. S. Hippert, C. E. Pedreira, and R. C. Souza, “Neural networks forshort-term load forecasting: A review and evaluation,” IEEE Trans.Power Syst., vol. 16, no. 1, pp. 44–55, Feb. 2001.

[15] N. Sharma, P. Sharma, D. Irwin, and P. Shenoy, “Predicting solar gen-eration from weather forecasts using machine learning,” in Proc. IEEESmartGridComm, Brussels, Belgium, Oct. 2011, pp. 528–533.

[16] Y. Wang, G. Cao, S. Mao, and R. Nelms, “Analysis of solar generationand weather data in smart grid with simultaneous inference of non-linear time series,” in Proc. IEEE INFOCOM WKSHPS, Hong Kong,Apr./May 2015, pp. 600–605.

[17] R. J. Bessa, A. Trindade, and V. Miranda, “Spatial-temporal solar powerforecasting for smart grids,” IEEE Trans. Ind. Informat., vol. 11, no. 1,pp. 232–241, Feb. 2015.

[18] Y. Zhang, M. Beaudin, R. Taheri, H. Zareipour, and D. Wood,“Day-ahead power output forecasting for small-scale solar photo-voltaic electricity generators,” IEEE Trans. Smart Grid, vol. 6, no. 5,pp. 2253–2262, Sep. 2015.

[19] C. Wan et al., “Photovoltaic and solar power forecasting for smartgrid energy management,” CSEE J. Power Energy Syst., vol. 1, no. 4,pp. 38–46, Dec. 2015.

[20] Y. Wang, Y. Shen, S. Mao, G. Cao, and R. M. Nelms, “Adaptive learninghybrid model for solar intensity forecasting,” IEEE Trans. Ind. Informat.,to be published, doi: 10.1109/TII.2017.2789289.

[21] L. Martín et al., “Prediction of global solar irradiance based on timeseries analysis: Application to solar thermal power plants energy pro-duction planning,” Solar Energy, vol. 84, no. 10, pp. 1772–1781,Oct. 2010.

[22] University of Massachusetts. The UMass Trace Repository.Accessed: Mar. 2018. [online] Available: http://traces.cs.umass.edu/index.php/Sensors/Sensors/

[23] R. Zhang, Z. Huang, and Y. Lv, “Statistical inference for the indexparameter in single-index models,” J. Multivariate Anal., vol. 101, no. 4,pp. 1026–1041, Apr. 2010.

[24] R. Tibshirani, “Regression shrinkage and selection via the lasso,” J. Roy.Stat. Soc. B Methodol., vol. 58, no. 1, pp. 267–288, Jun. 1996.

[25] P. Zeng, T. He, and Y. Zhu, “A lasso-type approach for estimation andvariable selection in single index models,” J. Comput. Graph. Stat.,vol. 21, no. 1, pp. 92–109, Apr. 2012.

[26] W. Y. Hwang, H. H. Zhang, and S. Ghosal, “FIRST: Combining for-ward iterative selection and shrinkage in high dimensional sparse linearregression,” Stat. Interface, vol. 2, no. 3, pp. 341–348, Jun. 2009.

[27] S. Luo and S. Ghosal, “Forward selection and estimation in high dimen-sional single index models,” Stat. Methodol., vol. 33, pp. 172–179,Dec. 2016.

[28] M. G. Kendall, “A new measure of rank correlation,” Biometrika, vol. 30,nos. 1–2, pp. 81–89, Jun. 1938.

[29] R. E. Barlow, D. J. Bartholomew, J. Bremner, and H. D. Brunk,Statistical Inference Under Order Restrictions: The Theory andApplication of Isotonic Regression. New York, NY, USA: Wiley, 1972.

[30] O. Burdakov, A. Grimvall, and M. Hussian, “A generalised PAValgorithm for monotonic regression in several variables,” in Proc.COMPSTAT, Prague, Czech Republic, 2004, pp. 761–767.

[31] The Detection of Archaeological Residues using Remote-sensingTechniques (DART) Project. DART Weather Data. Accessed:Mar. 2018. [Online] Available: https://dartportal.leeds.ac.uk/dataset/dart_monitoring_weather_data/

[32] C. Chen, S. Duan, T. Cai, and B. Liu, “Online 24-h solar power forecast-ing based on weather type classification using artificial neural network,”Solar Energy, vol. 85, no. 11, pp. 2856–2870, Nov. 2011.

[33] J. Shi, W.-J. Lee, Y. Liu, Y. Yang, and P. Wang, “Forecasting power out-put of photovoltaic systems based on weather classification and supportvector machines,” IEEE Trans. Ind. Appl., vol. 48, no. 3, pp. 1064–1069,May/Jun. 2012.

Ningkai Tang received the B.E. degree in infor-mation science and engineering from SoutheastUniversity, Nanjing, China, in 2010, and the M.S.degree in electrical and computer engineering fromAuburn University, Auburn, AL, USA, in 2014,where he is currently pursuing the Ph.D. degreeat the Department of Electrical and ComputerEngineering.

He was a Technology Support Engineer withthe ZTE Corporation, Shenzhen, China. His currentresearch interests include machine learning, smart

grids, cognitive radio, full-duplex transmission, and control theory.

Shiwen Mao (S’99–M’04–SM’09) received thePh.D. degree in electrical and computer engineeringfrom Polytechnic University, Brooklyn, NY, USA,in 2004.

He is the Samuel Ginn Distinguished Professorand the Director of the Wireless EngineeringResearch and Education Center, Auburn University,Auburn, AL, USA. His current research interestsinclude wireless networks, multimedia communica-tions, and smart grids.

Dr. Mao was a recipient of the 2017 IEEEComSoc ITC Outstanding Service Award, the 2015 IEEE ComSoc TC-CSRDistinguished Service Award, the 2013 IEEE ComSoc MMTC OutstandingLeadership Award, and the NSF CAREER Award in 2010. He was a co-recipient of the Best Demo Award from IEEE SECON 2017, the BestPaper Awards from IEEE GLOBECOM in 2015 and 2016, IEEE WCNC2015, and IEEE ICC 2013, and the 2004 IEEE Communications SocietyLeonard G. Abraham Prize in the Field of Communications Systems. Heis a Distinguished Lecturer of the IEEE Vehicular Technology Society. He ison the Editorial Board of the IEEE TRANSACTIONS ON MULTIMEDIA, theIEEE INTERNET OF THINGS JOURNAL (Area Editor), IEEE MULTIMEDIA

and ACM GetMobile.

Yu Wang received the B.E. degree in measur-ing and control technology and instrumentation andthe M.E. degree in instrument science and technol-ogy from Southeast University, Nanjing, China, in2008 and 2011, respectively, and the M.S. degree inprobability and statistics and Ph.D. degree in elec-trical computer engineering from Auburn University,Auburn, AL, USA, in 2015.

Since 2015, he has been an Assistant Professorwith the Department of Electrical Engineering,Nanjing University of Aeronautics and Astronautics,

Nanjing. His current research interest includes smart grids and optimization.

R. M. Nelms (F’04) received the B.E.E. andM.S. degrees in electrical engineering from AuburnUniversity, Auburn, AL, USA, in 1980 and 1982,respectively, and the Ph.D. degree in electrical engi-neering from the Virginia Polytechnic Institute andState University, Blacksburg, VA, USA, in 1987.

He is currently a Professor and the Chair of theDepartment of Electrical and Computer Engineering,Auburn University. His current research interestsinclude power electronics, power systems, and elec-tric machinery.

Dr. Nelms is a registered Professional Engineer in the State of Alabama.

Authorized licensed use limited to: Auburn University. Downloaded on April 13,2020 at 01:48:03 UTC from IEEE Xplore. Restrictions apply.