08306874

1090 IEEE INTERNET OF THINGS JOURNAL, VOL. 5, NO.
2, APRIL 2018
Solar Power Generation Forecasting With

a LASSO-Based Approach
Ningkai Tang, Shiwen Mao , Senior Member, IEEE, Yu Wang, and R. M. Nelms, Fellow, IEEE
Abstract—The smart grid (SG) has emerged as an important many actuators are deployed to enforce power scheduling,
form of the Internet of Things. Despite the high promises of protection, and security decisions. Because of the IoT-based
renewable energy in the SG, it brings about great challenges technology, SG and IoT are naturally inseparable. Recently,
to the existing power grid due to its nature of intermittent and
uncontrollable generation. In order to fully harvest its potential, numerous IoT technologies has been developed to fulfill the
accurate forecasting of renewable power generation is indispens- SG’s potential, including energy distribution and manage-
able for effective power management. In this paper, we propose ment [3]–[6], load balancing [7], security and privacy [8], and
a least absolute shrinkage and selection operator (LASSO)-based the future smart building framework [9].
forecasting model and algorithm for solar power generation The SG is characterized with the two-way flow of
forecasting. We compare the proposed scheme with two represen-
tative schemes with three real world datasets. We find that the power and information, microgrid, and distributed renew-
LASSO-based algorithm achieves a considerably higher accu- able energy resources (DRERs) [10]. In the meantime,
racy comparing to the existing methods, using fewer training the rise of new energy (e.g., photovoltaic power such as
data, and being robust to anomaly data points in the training solar power) has brought new challenges to such uncon-
data, and its variable selection capability also offers a convenient ventional power networks. Although integrating the power
tradeoff between complexity and accuracy, which all make the
proposed LASSO-based approach a highly competitive solution charge from solar power generators could reinforce the macro
to forecasting of solar power generation. grid, a large and uncertain amount of power generated by
micro solar grids could lead to severe energy management
Index Terms—Generation forecasting, Internet of Things
(IoT), least absolute shrinkage and selection operator (LASSO), problems [11].
machine learning, renewable energy. In order to fully harvest the potential of DRERs, two key
techniques, load forecasting (i.e., to predict the amount of
power needed to achieve the demand and supply equilib-
I. I NTRODUCTION rium) and power generation forecasting (i.e., to predict how
much power will be generated at a future time), are indis-
NTERNET of Things (IoT) is defined as uniquely
I identifiable objects that are organized in an Internet like
structure. With technology developments and evolution of the
pensable. Load forecasting has been well studied in [12]–[14],
with different statistics and machine learning approaches, such
as nonparametric functional time series analysis, state space
power grid, the concept of smart grid (SG) has emerged, which models, and artificial neural networks. Similarly, generation
is the next generation power grid, as well as an important forecasting has been investigated with various models and
part of the IoT [2]. An SG is an electricity network that methods as well.
can intelligently integrate the interactions of all users con- Since solar power generation is linked directly to solar inten-
nected to generators, consumers, and those that assume both sity, the solar power forecasting problem naturally translates
roles, in order to efficiently deliver sustainable, economic, and to a weather forecasting problem. In [15] and [16], support
secure electricity supplies [2]. Such capabilities are enabled by vector machine (SVM) and nonlinear time series are used to
the computation, communication, and control mechanisms that predict solar intensity, respectively. Other prior works such
are incorporated with the power grid, where a large amount as [17]–[19] also provided various effective solutions to the
of interconnected wireless sensors (e.g., phasor measurement solar intensity prediction problem. Although the prior works
units) are deployed to obtain real-time state information, while have done a good job on achieving a low error rate, there is
Manuscript received September 30, 2017; revised January 8, 2018 and always room for improvement for more accurate forecasting.
February 7, 2018; accepted March 1, 2018. Date of publication March 5, 2018; In addition, a deeper analysis will be helpful to gain a good
date of current version April 10, 2018. This work was supported in part by understanding of the problem. For example, the SVM-based
the U.S. NSF under Grant CNS-1702957 and Grant DMS-1736470 and in
part by the Wireless Engineering Research and Education Center at Auburn method [15] achieves a low error rate, but the selection of
University. This work was presented in part at IEEE GLOBECOM 2017, kernel is usually based on experience. For neural network-
Singapore, Dec. 2017 [1]. (Corresponding author: Shiwen Mao.) based technologies [19], [20], the neural network structure
N. Tang, S. Mao, and R. M. Nelms are with the Department of Electrical and
Computer Engineering, Auburn University, Auburn, AL 36849 USA (e-mail: needs to be predesigned and a quite complicated structure is
nzt0007@tigermail.auburn.edu; smao@ieee.org; nelmsrm@auburn.edu). needed to achieve a good precision, which, however, leads
Y. Wang is with the Department of Electrical Engineering, Nanjing to a high computational cost. In rainy or cloudy days, the
University of Aeronautics and Astronautics, Nanjing 210016, China (e-mail:
yuwang15@nuaa.edu.cn). time-series-based method [21] are usually not effective to cap-
Digital Object Identifier 10.1109/JIOT.2018.2812155 ture the high variations in data. In [16], we present a local
2327-4662 c 2018 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission.
See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
TANG et al.: SOLAR POWER GENERATION FORECASTING WITH LASSO-BASED APPROACH 1091
linear model for nonlinear time series, which leads to an accu- provides simultaneous confidence interval to making further
rate approximation and an analysis on the relationship between interpretation for each covariate. However, the nonlinear nature
the renewable power generation process and the weather vari- of the target problem could undermine the interpretation of the
able processes. However, the importance of each variable is results.
yet to be better identified. Other than these statistical methods, neural network is also a
In this paper, we investigate the solar power generation widely used approach to address the problem. Zhang et al. [18]
forecasting problem, aiming to develop an effective method proposed several different neural network models as competi-
that not only achieve a high forecasting accuracy but also tors. In this paper, the authors noticed the fact that weak
helps to reveal the significance of weather variables. To this stationarity and lack of continuity result in volatilities of solar
end, we propose a least absolute shrinkage and selection data. Thus, the proposed models use historical data similar to
operator (LASSO)-based method for solar power generation the target day to perform prediction. The accuracy for cer-
forecasting based on historical weather data. Based on a sin- tain models are great but we should be aware of the case
gle index model (SIM) and LASSO, we develop an effective when there are no historical, similar days. Also, the heavy
algorithm that maximizes Kendall’s tau coefficient to esti- computation of neural networks is always a concern.
mate the prediction model coefficients. The goal of variable Not limited to algorithms, data preprocessing could become
selection is achieved by the nature of LASSO, which auto- a useful means to reduce error. Chen et al. [32] and
matically reduce the weights of less important variables and Shi et al. [33] proposed to classify the data by weather con-
increase the sparsity of the overall coefficient vector. With the ditions before using their similar-day-based neural network
proposed algorithm, we can either maximize the prediction algorithms. Specifically, Chen et al. [32] classified histori-
accuracy using all the weather data/variables, or achieve a cal data by irradiance, total cloud, and cloud cover, while
tradeoff between accuracy and complexity by using a lim- Shi et al. [33] took the weather feature, such as sunny or
ited number of variables. The proposed scheme is evaluated cloudy for data classification. As certain tricks surely improve
with the real dataset collected from a weather station [22], the performance of model, we should still notice that the avail-
and comparison to two representative benchmark schemes. ability and accuracy of these features are highly dependent
The proposed LASSO-based scheme outperforms both exist- on location of dedicated datasets, making the schemes less
ing schemes with considerable reduction in prediction flexible.
error.
The remainder of this paper is organized as follows.
III. F ORECASTING M ODEL AND P ROBLEM S TATEMENT
We introduce LASSO and the forecasting problem in
Section III. Then we present our LASSO-based algorithm in A. LASSO Preliminaries
Section IV. Meanwhile, we also introduce the time-series solu- In machine learning and statistics, LASSO has become a
tion proposed in [16], which is used as a benchmark. We popular method for regression analysis, ever since it was first
validate the performance of our solution and compare it with introduced by Tibshirani [24] in 1996. By applying LASSO
two benchmark schemes in Section V. We conclude this paper to practical problems, we benefit from two main functions
in Section VI. that LASSO has: 1) regularization and 2) variable selection.
Due to the nature of LASSO, while a stronger l1 penalty is
used, LASSO is encouraged to shrink its coefficients of less
II. R ELATED W ORK important variables to 0. In other words, it performs variable
Power generation prediction is an essential issue in SG selection by dropping the corresponding variables from the
network, especially for system integrated with new energy, model and achieves a sparse solution in this case. On the other
such as wind or solar generation. In order to make accurate hand, while a weaker l1 penalty is used, the algorithm tends
prediction, considerable work has been done. In addition to to retain most variables and predict with better regularization.
those discussed in the introduction section, we review several The level of l1 penalty can be chosen by automatic techniques
additional key related work here. like cross-validation or by manually using the regularization
In [15], methods such as past-predicts-future (PPF) model, path. In recent years, LASSO has been successfully applied to
linear least square regression, and SVM regression are adopted various SIMs [25]–[27] due to the above-mentioned capability.
to solve the solar generation prediction problem. According to We propose to use LASSO for solar power generation
the authors, linear regression and SVM outperform PPF due with high accuracy. In addition, since weather data gathered
to the change of weather pattern. Also, as a conclusion, SVM from the local weather station can vary in different types of
shows a higher accuracy while linear least square achieves an weather parameters to monitor, it is important to find out
interpretable model. Although the idea is novel, the accuracy which variables are more important on solar power generation,
of these models are yet to be improved. especially when lacking of sufficient weather information, or
Inspired by this paper, we propose the time-series- when computation complexity is a concern. As discussed, lin-
based algorithm TLLE [16], which separates the his- ear regression, neural networks, and SVM-based algorithms
torical data into small neighborhoods and estimates the have already been applied to the solar power generation fore-
coefficient of each neighborhood with a linear solution. casting problem. To the best of our knowledge, this is the
While the algorithm improves the accuracy comparing to first application of LASSO to the problem, to achieve high
multilinear regression (MLR) and SVM, the model also prediction precision as well as variable selection.
1092 IEEE INTERNET OF THINGS JOURNAL, VOL. 5, NO. 2, APRIL 2018
B. System Model where sign(·) is the signum function. For the continuous form
The solar power forecasting problem is a good match for of β, Kendall’s tau coefficient is defined as
T
the SIM, which has the advantage of avoiding the so-called 1 Xi2 β − XTi1 β
∗
“curse of dimensionality” in fitting multivariate nonparametric τn (β) = sign Yi2 − Yi1 tanh
n(n − 1) c
regression functions by focusing on an index [23]. Specifically, 1≤i1 =i2 ≤n
we adopt the SIM as follows: (4)

where tanh(·) is the hyperbolic tangent function and c is a
Y X ∼ P ·, f XT β (1) small constant, which can be seen as a given value.
where Y ∈ R is the response, X ∈ Rp are the covariates, p is IV. P ROPOSED A LGORITHM

the dimension of variables, P(·, θ ) represents a stochastically We present the proposed solution algorithm in this sec-
increasing family of functions with parameter θ , β ∈ Rp is tion. In particular, in Section IV-A, we introduce the proposed
the coefficient of the covariate X and is unit normed, and f (·) LASSO-based algorithm. In Section IV-B, we discuss how to
is an unknown strictly smooth increasing link function. choose the parameters used in the algorithm. In Section IV-C,
To relate the model in (1) to our problem, Y is our desired we show how to apply the proposed LASSO-based algorithm
estimation of solar intensity and X is the weather data collected for the solar intensity prediction problem.
from a weather station. In our forecasting algorithm, we use
a special case of the model as follows: A. Proposed Algorithm
With the definition of Kendall’s tau coefficient, the proposed
Y = f XT β + (2)
solution algorithm consists of two parts, i.e., coefficient esti-
mation and link function estimation. The proposed algorithm
where is a zero mean variable with a finite variance rep- consists of the following three steps.
resenting error. Specifically, the weather data collected from 1) Coefficient Estimation: First, we need to find an index
a local weather station, X, consists of five weather data vari- j that can maximize the following value ρj , j = 1, 2, . . . , p,
ables, including temperature, humidity, dew point, wind speed, where p is the dimension of the variables:
and precipitation, which compose a 5-D dataset.
1
ρj = sign(Yi2 − Yi1 ) · sign Xi2 j − Xi1 j .
n(n − 1)
1≤i1 =i2 ≤n
C. Kendall’s Tau Coefficient
(5)
With a set of independent identically distributed data sam-
ples, the proposed algorithm is capable of simultaneous We call this index j1 , and set β̂ (1) = sign(ρj1 )ej1 , where ej =
variable selection and forecasting through optimizing the rela- [0, . . . , 1, . . . , 0]T is a p × 1 vector with 1 at the jth position
tionship between Y and XT β. Although it is not clear whether and 0 at all other positions.
the problem is linear or not, the MLR-based approaches did Suppose we have Xj1 , Xj2 , . . . , Xjk−1 as the selected vari-
not show a satisfying performance in [15] and [16], which ables, and the currently optimized coefficient is β̂ (k−1) . For
could be an indicator that linear model is not suitable for the the remaining j ∈ / {j1 , j2 , . . . , jk−1 }, we continue our procedure
problem. Therefore, we propose to use Kendall’s tau coef- in parallel, solving the following problem:

ficient between Y and XT β instead of Pearson’s correlation β̂j = arg max τn∗ β̂ (k−1) + βj ej − λ|βj |
coefficient [28]. βj
Kendall’s tau coefficient is a statistic used to measure the / {j1 , j2 , . . . , jk−1 }
j∈ (6)
rank correlation between two quantities [28]. Comparing to the
widely used Pearson’s correlation coefficient, which is a lin- where λ is a system parameter (we will discuss its selection
ear correlation measurement, Kendall’s tau coefficient is more in detail in Section IV-B), and βj is the jth element in β̂ (k−1) .
suitable for nonlinear problems. Also, due to the assumption We then set jk as

of monotonicity, if we ignore the outliers caused by random jk = arg max τn∗ β̂ (k−1) + β̂j ej . (7)
errors, the increments of Y is highly possible to be syn- / {j1 ,j2 ,...,jk−1 }
j∈
chronized with XT β. Thus, we can precisely estimate β by The algorithm will terminate if the following condition is
maximizing the following Kendall’s tau coefficient between Y satisfied, where is a small positive threshold value:
and XT β.

Assume there are n data units, {X1 , X2 , . . . , Xn }, and the τn∗ β̂ (k−1) + β̂jk ejk − τn∗ β̂ (k−1) < . (8)
corresponding response values are {Y1 , Y2 , . . . , Yn }, respec-
tively. For discontinuous β, Kendall’s tau coefficient is Otherwise, we set β̂ (k) as
expressed as β̂ (k−1) + β̂jk ejk
β̂(k) = (9)

1 β̂ (k−1) + β̂jk ejk
τn (β) = sign Yi2 − Yi1 · sign XTi2 β − XTi1 β 2
n(n − 1)
1≤i1 =i2 ≤n and repeat the above steps until the stop condition (8) is
(3) satisfied. Then we obtain the estimated coefficient vector β̂.
2) Link Function Estimation: Due to the monotone assump-

tion and the Kendall’s tau coefficient, we perform isotonic
regression in our algorithm [29], which is usually applied for
nondecreasing data. The goal is to estimate the link function
f (·), which still remains unknown. First, we define
Zi = XTi β̂. (10)
We then sort {Z1 , Z2 , . . . , Zn } in ascending order, and

denote the results as {Z(1) , Z(2) , . . . , Z(n) }. We also rear-
range {Y1 , Y2 , . . . , Yn } according to {Z(1) , Z(2) , . . . , Z(n) }. We
next execute the pool-adjacent-violators algorithm (PAVA, as Fig. 1. Grid search with λ from 0 to 1.
described in [30], which is a simple linear algorithm for iso-
tonic regression) on the sorted Y’s, and mark the results as
{Y(1) , Y(2) , . . . , Y(n) }.
By choosing a symmetric and smooth kernel function
Ker(t), we estimate the link function f (t) as
n
j=1 Ker 1
b (t − Z( j) ) × Y( j)
f̂ (t) = n
(11)
j=1 Ker 1
b (t − Z ( j) )
where b is chosen by applying the cross validation technique.

The kernal function can be any function that applies to the
data; we use the Gaussian kernel in our simulations. The kernal
function can be any function that applies to the dataset. We Fig. 2. Grid search with λ from 0 to 0.1.
use the Gaussian kernel in this paper, since it is widely used
and outperforms several other kernels, such as Epanechnikov,
sigmoid, and quartic in our simulations. the processing speed will be greatly increased. The cross val-
3) Solar Intensity Prediction: With the estimated coeffi- idation technique is used when we have abundant time and
cient vector β, the estimated link function f (t), and a new information, and the best estimation precision is preferred.
observation X , we can predict the solar intensity by Alternatively, we can use the regularization path method

to achieve a tradeoff between precision and speed. When
Ŷ = f̂ X β̂ .
T
(12) we demand more on speed and an acceptable precision is
specified, we could choose λ with the following process.
We manually set a threshold T for the minimum solar intensity. 1) Choose a set of possible λ values and sort them by
For example, we can set T = 0 by default since solar intensity increasing order.
cannot have a negative value. The final prediction result Ŷ ∗ is 2) Execute the proposed algorithm for each λ and record
computed as their performance.
3) Plot the achieved precision performance versus the
Ŷ , if Ŷ > T
Ŷ ∗ = (13) values of λ.
T, otherwise.
4) Choose an acceptable point on the curve to guarantee the
performance while achieving the maximized estimation
B. Selection of System Parameter λ speed due to sparsity.
The system parameter λ in (6) is one of the most important The regularization path method also has the potential to
parameters in the proposed algorithm. It is sensitive to various achieve high estimation accuracy even when information is
problems and should be carefully tuned. In this section, we lacking.
present two basic methods on how to choose λ. In Figs. 1–3, we show an example of how to use the solu-
The first method is to use the cross validation technique. tion path method with a grid search. We first search by using
For initialization, we need to shuffle the dataset and randomly a larger λ value ranging from 0.001 to 1. With the plot-
split the samples into five subsets (for a fivefold cross valida- ted curve and the related root mean squared error (RMSE),
tion) with equal size. Then we pick a set of possible λ values we zoom in the region that has smaller RMSEs, to select a
and τn (β) is calculated on a one-fifth data subset by using the smaller step size 0.001 and narrower range from 0.001 to 0.1.
estimated β from the remaining data. After repeating the pro- Then repeat this procedure. For the last round we plot the
cess until all five parts have been calculated (i.e., to avoid the curve with λ ranging from 0 to 0.03, since the result shows
possible unbalanced results caused by the randomness in data), no merit to continue further, we stop here and choose the
we could choose the λ that maximizes the average estimation most accurate and stable λ value as 0.015. The correspond-
precision in the cross validation process. With the proposed ing solar power generation forecasting result is presented in
algorithm, parallel computation can be employed, with which Section V.
the predicted solar intensity and the actually observed solar

intensity. RMSE and MAPE are well-known statistical mea-
sures of the accuracy of values predicted by models with
respect to the observed values. RMSE and MAPE of zero
indicate that the model exactly predicts solar intensity with
no error (although this is impossible in reality). The closer
the RMSE and MAPE values are to zero, the more accurate
the model’s prediction is. RMSE and MAPE are defined as

n
1 2
RMSE = ŷt − yt (15)
n
Fig. 3. Grid search with λ from 0 to 0.03. t=1
n
100 ŷt − yt
C. Prediction Methodology and Performance Measures MAPE = (16)
n y
t
It is noticed that both the observational and forecasted t=1
weather dataset are time-series datasets that changes over where n represents the number of predicted data points, ŷt
weather patterns and time. As the result shown in [15], stands for the prediction result for data point t, and yt is the
solar intensity depends on multiple weather variables, which actual value of data point t.
could help us to construct an accurate prediction model. The
structure of the dataset and the possible relationship among
V. S IMULATION VALIDATION
the weather variables motivate our proposed LASSO-based
method for developing solar intensity prediction models. To In this section, we present our simulation validation of
construct the model, we utilize historical weather data as input, the proposed LASSO-based scheme. We use three different
which include several forecasting data parameters and the datasets gathered in both U.S. and U.K. to test the proposed
actual solar intensity, with totally six weather variables. The scheme under a variety of environments. The datasets can
proposed algorithm establishes a function that computes solar be found in [22] and [31]. For comparison purpose, we use
intensity from the five forecasting weather variables. Thus, the SVM-based method presented in [15] and the TLLE
we could use it as the prediction model for future solar power method [16] as benchmarks.
generation. We also use part of the remaining data to test the
model’s accuracy. One unique benefit of using our proposed A. Dataset Description
technique is the relatively low requirement for data size. In The first dataset we use is gathered from a Davis Weather
general, not too much data is needed, usually historical data station located in Amherst, MA, USA [22]. The weather data
over a 15–30 day period will be sufficient. was collected every 5 min and the weather station is equipped
We focus this paper on short-term forecasting for the next with sensors to measure temperature, wind chill, humidity,
few days. We develop a model that shows a relationship dew-point, wind speed, wind direction, rainfall, baromet-
between solar intensity and forecasted weather data. For any ric pressure, sunlight, and ultraviolet (UV). The dataset is
time t, we build the LASSO model by using the historical data recorded for quite a long period from February 2006 to January
from the past 30 days as an input, i.e., the data from (t − 30) 2013. However, the dataset contains errors, which are indicated
to (t −1). Using the proposed model, we then predict the solar by a value of −100 000, as well as missing data for some
intensity for time t. In Section V, we also compare the accu- periods. In the simulation study, we excluded such errors and
racy of our models with different popular and efficient models, missing data.
including an SVM-based model [15] and a time-series-based We plot the recorded daily solar intensity in Amherst, MA,
model [16]. USA, in Fig. 4, to clearly show how the data pattern varies
Using the basic SIM presented in Section III, the solar over time. In accordance with our general knowledge, we can
power general prediction model is observe peaks in hot summer days and valleys in cold winter
Y ∼ P(·, f (Temperature, DewPoint, WindSpeed days. Also, we can see the strong correlation between consecu-
tive days. Thus, we try to use seasons and months as additional
Precipitation, Humidity)) (14)
parameters and use historical data of the past 30 consecutive
where f (·) is the link function that we determine using dif- days as training samples.
ferent prediction methods. The units of the parameters in the The second dataset [31] is recorded in Harnhill and
model are: Temperature in degrees of Fahrenheit, DewPoint Diddington in the U.K. At each study location, two weather
in Fahrenheit, WindSpeed in miles per hour, Precipitation in stations are installed (four in total), each record data every
inches, and Humidity in percentage between 0% and 100%. 30 min for rainfall, temperature, humidity, wind speed, wind
However, to avoid potential scaling problems, before applying direction, barometric pressure, and UV. The Harnhill dataset
any selected algorithm, we normalize all feature data to have is from April 2011 to November 2012, while the Diddington
a zero mean and unit variance. dataset records weather information from August 2011 to
To quantify the accuracy of each model, we compute the December 2012. Missing data in both datasets are represented
RMSE and mean absolute percentage error (MAPE) between by NaN. By excluding all such invalid data, the amounts
B. Results With the UMass Dataset

1) SVM-Based Method: With the UMass dataset [22], we
first apply the SVM method since it is shown to be effective
and widely used in prediction and classification [15]. Here,
we use historical weather data as training samples and aim
to predict the solar intensity data through January 1, 2013 to
February 28, 2013. In the simulations, we find that different
sets of training data have considerable effects on estimation
accuracy. Experimenting with all the data that is available,
we achieve the optimal accuracy with the historical data from
January 1, 2012 to February 28, 2012, which is exactly one
year ahead of the target period for prediction. The predicted
solar intensity is plotted along with the observed data in
Fig. 7. The best RMSE achieved by the SVM-based method
is 30.1524 watts/m2 . However, the MAPE for the dataset is as
Fig. 4. Solar intensity collected at the Davis weather station [22]. high as 468.283, which is largely due to the large deviation
of the 55th day data. If we exclude that day, the MAPE of the
SVM-based method will be reduced to 39.2063.
2) TLLE Method: According to [16], TLLE has been
proven to be a more accurate means when comparing to
SVM and MLR-based approaches. The same UMass Trace
Repository data [22] is used as in [16]. The historical data
from January 1, 2012 to February 28, 2012 is used to construct
the TLLE model, and solar power generation is predicted for
the period from January 1, 2013 to February 28, 2013.
We plot the predicted solar intensity along with the observed
data in Fig. 8. The best RMSE we obtained with the TLLE-
based method is 23.1464 watts/m2 , which is about the same as
that reported in [16]. TLLE achieves a 23.2% reduction over
the SVM-based method. This result validates the advantage
of TLLE comparing to the SVM-based method. We also find
a very high MAPE value in this simulation, also due to the
Fig. 5. Solar intensity recorded in the Diddington dataset [31].
anomaly data of the 55th day. The MAPE for the remaining
data, after excluding the 55th day data, is reduced to 29.0174,
which is a 26.0% reduction over the SVM-based method.
3) Proposed LASSO-Based Algorithm: Now, we apply the
proposed LASSO-based method to predict solar intensity. In
the simulation, we use a relatively smaller training sample size
of 30, i.e., the training data here is gathered from the past 30
days of the target date. Applying the proposed LASSO-based
method to the training data yields the prediction results that
is plotted in Fig. 9.
For the LASSO-based prediction curve in Fig. 9, the RMSE
is 14.0262 watts/m2 and the MAPE is 17.817, represent-
ing further 39.4% and 60.1% reductions over the TLLE-
based approach, respectively. More important, these results
are achieved with the entire 30-day original data, i.e., with-
out excluding the 55th day anomaly data in the dataset. Our
method also achieves a very stable performance in MAPE.
Fig. 6. Solar intensity recorded in the Harnhill dataset [31].
Furthermore, even if we reduce the number of training data
for 30 days to 15 days, the proposed LASSO-based algorithm
still achieves a reliable result, with an RMSE lower than
of valid samples are both for 397 days. The solar intensity 20 watts/m2 .
recorded in Harnhill and Diddington, U.K., are plotted in
Figs. 5 and 6, respectively. It can be seen weather pattern
varies with time just the same as mentioned before. The same C. Results With the U.K. Datasets
strategy is used for both datasets, where 30 consecutive history We also apply the three algorithms to the Diddington and
days are selected as training samples. Harnhill datasets described in Section V-A [31] for a more
Fig. 7. Solar power generation prediction using the SVM-based method with
the UMass dataset. Fig. 10. Solar power prediction using SVM with the Diddington dataset.
Fig. 8. Solar power generation prediction using the TLLE-based method Fig. 11. Solar power prediction using TLLE with the Diddington dataset.
with the UMass dataset.
from [31], we predict the solar intensity for the period from
the 365th to 394th day.
With the Diddington dataset, we find the SVM method have
great difficulty with the small set of training samples, which
forces us to increase the amount of training samples to 100.
Here, we use the first 100 days’ weather data to train the SVM
model and finally obtain an acceptable result as presented in
Fig. 10. Meanwhile, the TLLE method still works better than
SVM. We used the first 60 days’ data as the model generating
data to achieve the best performance, which is illustrated in
Fig. 11. The prediction results with the proposed LASSO-
based approach is presented in Fig. 12, which is obtained
with a much smaller 30 training data size than SVM and
TLLE. The overall comparison of accuracy is presented in
Table I. For this dataset, the proposed LASSO-based approach
Fig. 9. Solar power generation prediction using the proposed LASSO-based
method with the UMass dataset. achieves reductions of 76.6763% and 66.1474% in RMSE over
SVM and TLLE, respectively, and reductions of 85.6069% and
80.0973% in MAPE over SVM and TLLE, respectively. Note
comprehensive evaluation. Unlike the weather in the U.S., the that such considerable gains are achieved with a much smaller
areas in Britain inevitably have less sunlight due to the much training data size.
more rainy and cloudy days. This different data feature can The simulation results with the Harnhill dataset are
be a practical test to our proposed method. For both datasets presented in Figs. 13–15 for the three schemes. The prediction
Fig. 12. Solar power prediction using the proposed method with the Fig. 14. Solar power prediction with TLLE of Harnhill dataset.
Diddington dataset.
TABLE I
P REDICTION ACCURACY W ITH THE D IDDINGTON DATASET
Fig. 15. Solar power prediction with proposed method of Harnhill dataset.
TABLE II
P REDICTION ACCURACY W ITH THE H ARNHILL DATASET
Fig. 13. Solar power prediction with SVM of Harnhill dataset.

anomaly data points in the training data, which make it a
highly competitive solution to practical problems such as
accuracy results are summarized in Table II. Due to the dif-
forecasting of solar power generation.
ferent weather pattern in U.K., the level of solar intensity is
considerably smaller than that in U.S. Although we could
witness a much closer RMSE achieved with all the three D. Variable Selection With the Proposed Scheme
methods, we should still notice their obvious difference in A notable advantage of the LASSO-based algorithm is its
MAPE. For the dataset, the proposed LASSO-based approach ability of variable selection. By tuning the loss function with
achieves reductions of 73.4156% and 66.5391% in RMSE over parameter λ, it allows to identify which variable(s) are more
SVM and TLLE, respectively, and reductions of 82.4621% and “important” to the prediction result. The process is to tune λ
81.1672% in MAPE over SVM and TLLE, respectively. Note in condition of an acceptable prediction accuracy, until any βj
that these performance gains are consistent with that observed has reached 0. Then we can treat the corresponding variable Xj
with the Diddington dataset in Table I. as least important. Repeating this procedure, we can identify
Clearly, our LASSO-based algorithm has achieved consid- the second least important variable, and so forth.
erably higher accuracy compared to the two existing methods. Variable selection will at least provide us with two fasci-
In addition, it requires fewer training data and is robust to nating advantages: 1) reducing the computational complexity
TABLE III
C ORRELATION M ATRIX OF THE UM ASS DATASET
TABLE IV
O PTIMIZED β W ITH T HREE VARIABLES
is 21.2468 watts/m2 , which is still better than both SVM

and TLLE, but the MAPE has increased to 68.5998 due to
the inaccuracy on some small values. Such phenomenon can
be explained by the inner characteristics of variance. When
we use fewer variables, we actually lose a certain amount
of variance and the prediction will become more unbiased to
maintain accuracy. Thus, it is expected to have a lower accu-
racy and the absolute percentage error on certain small values
could become high. The variable selection capability provides
Fig. 16. Solar power prediction using the three selected variables with the
UMass dataset.
a useful tradeoff between computational/model complexity and
accuracy.
and 2) simplifying the prediction model. Due to the struc- VI. C ONCLUSION
ture of our proposed algorithm, historical data will also be
used in the prediction stage. So when less parameters are In this paper, we proposed an LASSO-based algorithm
used in the prediction model, the computational cost will that accurately predict solar power generation with a small
be greatly reduced. In addition, a simplified model can pro- amount of historical data. After presenting the detailed algo-
vide us a clearer understanding of the relationship between rithm design, we compared the proposed scheme with two
solar power generation and the weather parameters. We can representative existing schemes using three datasets with
use the reduced model to estimate solar power generation different features. We found that the LASSO-based algo-
when the dataset is incomplete, or to reduce the computation rithm achieved considerably higher accuracy comparing to
time when necessary (i.e., to tradeoff between complexity and the existing methods, using fewer training data, and being
accuracy). robust to anomaly data points in the training data. In addi-
As an example, we use the UMass dataset to illustrate the tion, the variable selection capability offered a nice tradeoff
variable selection procedure. Table III shows the correlation between complexity and accuracy. These features all made it
matrix computed with the dataset. Although the problem can- a highly competitive solution to forecasting of solar power
not be simply defined as a linear one, we could still use generation.
the correlation matrix to obtain an intuitive observation. As
the matrix shows, temperature and humidity are more closely R EFERENCES
correlated to solar intensity, while precipitation is quite inde- [1] N. Tang, S. Mao, Y. Wang, and R. M. Nelms, “LASSO-based single
pendent with most other parameters. After adjusting the λ index model for solar power generation forecasting,” in Proc. IEEE
value to reduce the model to a 3-variable model, we have GLOBECOM, Singapore, Dec. 2017, pp. 1–6.
the optimized β values listed in Table IV. [2] Y. Wang, S. Mao, and R. M. Nelms, Online Algorithms for Optimal
Energy Distribution in Microgrids, 1st ed. New York, NY, USA:
From Table IV, we find that both dewpoint and windspeed Springer, 2015.
are seen as less important variables. The result for windspeed [3] Y. Wang, S. Mao, and R. M. Nelms, “Distributed online algorithm for
coincides with the correlation matrix but dewpoint and pre- optimal real-time energy distribution in the smart grid,” IEEE Internet
Things J., vol. 1, no. 1, pp. 70–80, Feb. 2014.
cipitation have a conflict. However, noticing from Table III [4] L. Yu, T. Jiang, and Y. Zou, “Distributed online energy management for
that dewpoint is tightly correlated with temperature, while data centers and electric vehicles in smart grid,” IEEE Internet Things
precipitation is quite independent to temperature, the result J., vol. 3, no. 6, pp. 1373–1384, Dec. 2016.
[5] Y. Wang, S. Mao, and R. M. Nelms, “Online algorithm for optimal real-
for β becomes reasonable. Fig. 16 provides the prediction time energy distribution in the smart grid,” IEEE Trans. Emerg. Topics
result with the 3-parameter model. The RMSE in this case Comput., vol. 1, no. 1, pp. 10–21, Jun. 2013.
[6] Y. Huang, S. Mao, and R. M. Nelms, “Adaptive electricity scheduling [30] O. Burdakov, A. Grimvall, and M. Hussian, “A generalised PAV
in microgrids,” IEEE Trans. Smart Grid, vol. 5, no. 1, pp. 270–281, algorithm for monotonic regression in several variables,” in Proc.
Jan. 2014. COMPSTAT, Prague, Czech Republic, 2004, pp. 761–767.
[7] M. Li, P. He, and L. Zhao, “Dynamic load balancing applying water- [31] The Detection of Archaeological Residues using Remote-sensing
filling approach in smart grid systems,” IEEE Internet Things J., vol. 4, Techniques (DART) Project. DART Weather Data. Accessed:
no. 1, pp. 247–257, Feb. 2017. Mar. 2018. [Online] Available: https://dartportal.leeds.ac.uk/dataset/
[8] Z. Guan et al., “Achieving efficient and secure data acquisition for cloud- dart_monitoring_weather_data/
supported Internet of Things in smart grid,” IEEE Internet Things J., [32] C. Chen, S. Duan, T. Cai, and B. Liu, “Online 24-h solar power forecast-
vol. 4, no. 6, pp. 1934–1944, Dec. 2017. ing based on weather type classification using artificial neural network,”
[9] J. Pan et al., “An Internet of Things framework for smart energy in Solar Energy, vol. 85, no. 11, pp. 2856–2870, Nov. 2011.
buildings: Designs, prototype, and experiments,” IEEE Internet Things [33] J. Shi, W.-J. Lee, Y. Liu, Y. Yang, and P. Wang, “Forecasting power out-
J., vol. 2, no. 6, pp. 527–537, Dec. 2015. put of photovoltaic systems based on weather classification and support
[10] Y. Wang, S. Mao, and R. M. Nelms, “On hierarchical power schedul- vector machines,” IEEE Trans. Ind. Appl., vol. 48, no. 3, pp. 1064–1069,
ing for the macrogrid and cooperative microgrids,” IEEE Trans. Ind. May/Jun. 2012.
Informat., vol. 11, no. 6, pp. 1574–1584, Dec. 2015.
[11] M. J. E. Alam, K. M. Muttaqi, and D. Sutanto, “A comprehensive Ningkai Tang received the B.E. degree in infor-
assessment tool for solar PV impacts on low voltage three phase distribu- mation science and engineering from Southeast
tion networks,” in Proc. IEEE ICDRET, Dhaka, Bangladesh, Jan. 2012, University, Nanjing, China, in 2010, and the M.S.
pp. 1–5. degree in electrical and computer engineering from
[12] M. Chaouch, “Clustering-based improvement of nonparametric func- Auburn University, Auburn, AL, USA, in 2014,
tional time series forecasting: Application to intra-day household-level where he is currently pursuing the Ph.D. degree
load curves,” IEEE Trans. Smart Grid, vol. 5, no. 1, pp. 411–419, at the Department of Electrical and Computer
Jan. 2014. Engineering.
[13] V. Dordonnat, S. J. Koopman, and M. Ooms, “Dynamic factors in He was a Technology Support Engineer with
periodic time-varying regressions with an application to hourly elec- the ZTE Corporation, Shenzhen, China. His current
tricity load modelling,” Comput. Stat. Data Anal., vol. 56, no. 11, research interests include machine learning, smart
pp. 3134–3152, Nov. 2012. grids, cognitive radio, full-duplex transmission, and control theory.
[14] H. S. Hippert, C. E. Pedreira, and R. C. Souza, “Neural networks for
short-term load forecasting: A review and evaluation,” IEEE Trans. Shiwen Mao (S’99–M’04–SM’09) received the
Power Syst., vol. 16, no. 1, pp. 44–55, Feb. 2001. Ph.D. degree in electrical and computer engineering
[15] N. Sharma, P. Sharma, D. Irwin, and P. Shenoy, “Predicting solar gen- from Polytechnic University, Brooklyn, NY, USA,
eration from weather forecasts using machine learning,” in Proc. IEEE in 2004.
SmartGridComm, Brussels, Belgium, Oct. 2011, pp. 528–533. He is the Samuel Ginn Distinguished Professor
[16] Y. Wang, G. Cao, S. Mao, and R. Nelms, “Analysis of solar generation and the Director of the Wireless Engineering
and weather data in smart grid with simultaneous inference of non- Research and Education Center, Auburn University,
linear time series,” in Proc. IEEE INFOCOM WKSHPS, Hong Kong, Auburn, AL, USA. His current research interests
Apr./May 2015, pp. 600–605. include wireless networks, multimedia communica-
[17] R. J. Bessa, A. Trindade, and V. Miranda, “Spatial-temporal solar power tions, and smart grids.
forecasting for smart grids,” IEEE Trans. Ind. Informat., vol. 11, no. 1, Dr. Mao was a recipient of the 2017 IEEE
pp. 232–241, Feb. 2015. ComSoc ITC Outstanding Service Award, the 2015 IEEE ComSoc TC-CSR
[18] Y. Zhang, M. Beaudin, R. Taheri, H. Zareipour, and D. Wood, Distinguished Service Award, the 2013 IEEE ComSoc MMTC Outstanding
“Day-ahead power output forecasting for small-scale solar photo- Leadership Award, and the NSF CAREER Award in 2010. He was a co-
voltaic electricity generators,” IEEE Trans. Smart Grid, vol. 6, no. 5, recipient of the Best Demo Award from IEEE SECON 2017, the Best
pp. 2253–2262, Sep. 2015. Paper Awards from IEEE GLOBECOM in 2015 and 2016, IEEE WCNC
[19] C. Wan et al., “Photovoltaic and solar power forecasting for smart 2015, and IEEE ICC 2013, and the 2004 IEEE Communications Society
grid energy management,” CSEE J. Power Energy Syst., vol. 1, no. 4, Leonard G. Abraham Prize in the Field of Communications Systems. He
pp. 38–46, Dec. 2015. is a Distinguished Lecturer of the IEEE Vehicular Technology Society. He is
[20] Y. Wang, Y. Shen, S. Mao, G. Cao, and R. M. Nelms, “Adaptive learning on the Editorial Board of the IEEE T RANSACTIONS ON M ULTIMEDIA, the
hybrid model for solar intensity forecasting,” IEEE Trans. Ind. Informat., IEEE I NTERNET OF T HINGS J OURNAL (Area Editor), IEEE M ULTIMEDIA
to be published, doi: 10.1109/TII.2017.2789289. and ACM GetMobile.
[21] L. Martín et al., “Prediction of global solar irradiance based on time
series analysis: Application to solar thermal power plants energy pro- Yu Wang received the B.E. degree in measur-
duction planning,” Solar Energy, vol. 84, no. 10, pp. 1772–1781, ing and control technology and instrumentation and
Oct. 2010. the M.E. degree in instrument science and technol-
[22] University of Massachusetts. The UMass Trace Repository. ogy from Southeast University, Nanjing, China, in
Accessed: Mar. 2018. [online] Available: http://traces.cs.umass.edu/ 2008 and 2011, respectively, and the M.S. degree in
index.php/Sensors/Sensors/ probability and statistics and Ph.D. degree in elec-
[23] R. Zhang, Z. Huang, and Y. Lv, “Statistical inference for the index trical computer engineering from Auburn University,
parameter in single-index models,” J. Multivariate Anal., vol. 101, no. 4, Auburn, AL, USA, in 2015.
pp. 1026–1041, Apr. 2010. Since 2015, he has been an Assistant Professor
[24] R. Tibshirani, “Regression shrinkage and selection via the lasso,” J. Roy. with the Department of Electrical Engineering,
Stat. Soc. B Methodol., vol. 58, no. 1, pp. 267–288, Jun. 1996. Nanjing University of Aeronautics and Astronautics,
[25] P. Zeng, T. He, and Y. Zhu, “A lasso-type approach for estimation and Nanjing. His current research interest includes smart grids and optimization.
variable selection in single index models,” J. Comput. Graph. Stat.,
vol. 21, no. 1, pp. 92–109, Apr. 2012. R. M. Nelms (F’04) received the B.E.E. and
[26] W. Y. Hwang, H. H. Zhang, and S. Ghosal, “FIRST: Combining for- M.S. degrees in electrical engineering from Auburn
ward iterative selection and shrinkage in high dimensional sparse linear University, Auburn, AL, USA, in 1980 and 1982,
regression,” Stat. Interface, vol. 2, no. 3, pp. 341–348, Jun. 2009. respectively, and the Ph.D. degree in electrical engi-
[27] S. Luo and S. Ghosal, “Forward selection and estimation in high dimen- neering from the Virginia Polytechnic Institute and
sional single index models,” Stat. Methodol., vol. 33, pp. 172–179, State University, Blacksburg, VA, USA, in 1987.
Dec. 2016. He is currently a Professor and the Chair of the
[28] M. G. Kendall, “A new measure of rank correlation,” Biometrika, vol. 30, Department of Electrical and Computer Engineering,
nos. 1–2, pp. 81–89, Jun. 1938. Auburn University. His current research interests
[29] R. E. Barlow, D. J. Bartholomew, J. Bremner, and H. D. Brunk, include power electronics, power systems, and elec-
Statistical Inference Under Order Restrictions: The Theory and tric machinery.
Application of Isotonic Regression. New York, NY, USA: Wiley, 1972. Dr. Nelms is a registered Professional Engineer in the State of Alabama.

08306874

Încărcat de

Informații document

Drepturi de autor

Formate disponibile

Partajați acest document

Partajați sau inserați document

Opțiuni de partajare

Vi se pare util acest document?

Este necorespunzător acest conținut?

Drepturi de autor:

Formate disponibile

08306874

Încărcat de

Drepturi de autor:

Formate disponibile

1090 IEEE INTERNET OF THINGS JOURNAL, VOL. 5, NO.

Solar Power Generation Forecasting With

we adopt the SIM as follows: (4)

where Y ∈ R is the response, X ∈ Rp are the covariates, p is IV. P ROPOSED A LGORITHM

2) Link Function Estimation: Due to the monotone assump-

Zi = XTi β̂. (10)

We then sort {Z1 , Z2 , . . . , Zn } in ascending order, and

where b is chosen by applying the cross validation technique.

the predicted solar intensity and the actually observed solar

B. Results With the UMass Dataset

Fig. 13. Solar power prediction with SVM of Harnhill dataset.

is 21.2468 watts/m2 , which is still better than both SVM

S-ar putea să vă placă și