Skip to content

Univariate time series analysis and modeling project of solar power output in the United States.

Notifications You must be signed in to change notification settings

aokdata/Solar_Power_Output_Time_Series

Repository files navigation

U.S. Solar Power Output Time Series Analysis and Forecasting

Aidan O'Keefe

Stock_Solar_Panels

*Image provided by Freepik*

Overview

Univariate time series analysis and modeling project of solar power output in the United States.

Business Overview

The North American Electric Reliability Corporation (NERC) regulates the electrical grid in the United States, Canada, and the northern part of Baja California, Mexico. They want to know how much power to expect from renewable sources so they can better prepare for power output variability. NERC has asked us to try to forecast solar output in order to help them manage the US grid better.

Data Overview

Data for this project was taken from the U.S. Energy Information Administration's Electricy Data Browser. The data has monthly frequency from Jan 2001 to Dec 2022 of solar output in thousand megawatthours. Each row is a location with power output grouped by individual state, region, and US Total.

Upon exploring the data, there is obvious seasonality with more solar power being produced in the summer months. US Monthly Solar Output by Year

Methods Overview

To preprocess the data, I turned our raw csv into a datetime indexed dataframe of only Total US Solar Power Output. After trying to use rolling statistics, differencing and other transformations, I successfully stationarized the timeseries data using time series decomposition and confirmed stationarity using the Augmented Dickey-Fuller Test. I implemented a Naive model, a variety of SARIMAX/ARIMA models (including parameter search), and a Facebook Prophet model.

Evaluation Overview

An ARIMA (4,0,1) model was chosen as the final model based on it's AIC and RMSE. On test data, the final model had a RMSE of 0.033 thousand megawatthours and a MAPE of 2.32%. We then forecasted US Solar output 12 months out using our final ARIMA(4,0,1) model (as well as the FB Prophet model for comparison).

Test- ARIMA(4,0,1)

ARIMA401_Test

Forecast- ARIMA(4,0,1)

ARIMA401_Solar_Predictions

Recommendations

I would recommend that NERC create a forecast to predict solar power supply and avoid blackouts/grid failure caused by under and overproduction. The NERC can then prepare plan to meet seasonal supply and demand in conjunction with other energy sources.

Next Steps

I would testing this forecast in the next year and calculating errors to see if the model is accurate enough to be helpful moving forwad. I would also add in data on the number on power plants as well as environmental factors (weather, solar irradiation) so the model accounts for more external factors. In the future, I would look to repeat this time series modeling and forecasting at the regional and state level in order to provide more actionable data.

Repository Structure

├── data
├── .gitignore
├── README.md
├── Solar_Power_Output_Time_Series_Notebook.ipynb
└── presentation.pdf

About

Univariate time series analysis and modeling project of solar power output in the United States.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published