Introduction

Neural networks have been applied to forecasting time series data. However, they cannot optimally capture the features of financial datasets. In this study, we analyzed various loss functions that can be employed to optimize forecasting of day-ahead electricity spot prices. We first outlined a set of properties that such loss functions should possess. We proposed to use Theil UII-S as a novel loss function, which is derived from Theil’s forecast accuracy coefficient. We trained five neural network models using the two most currently used loss functions (mean squared error and mean absolute error) and Theil UII and Theil UII-S. Our results showed that Theil UII-S outperforms both the mean squared error and mean absolute error in terms of forecasting day-ahead electricity spot prices. Furthermore, we tested these models using a real-world dataset of the electricity spot market prices in Norway. We believe that our study makes a significant contribution to the literature because we show that Theil UIIS provides accurate forecasts in the average, best-case, and worst-case scenarios, converges faster, is twice differentiable, and has a variable gradient.

Authors:

Ahmad Amine Loutfi.PhD Candidate; Norwegian Univeristy of Science and Technology.
Mengtao Sun.PhD Candidate; Norwegian Univeristy of Science and Technology.
Ijlal Loutfi.Dr; Norwegian Univeristy of Science and Technology
Per Bjarte Solibakke.Professor; Norwegian Univeristy of Science and Technology

Notes

We recommend creating a folder entitled 'Time Series' and saving all the python file in it. This will allow you including files paths that will point to modules we created and that you need to import

Dataset

We analyzed the problem of day-ahead electricity spot price forecasting in Norway.

Our dataset includes the consecutive recordings of 2600 days from January 2nd, 2013 and February 14th, 2020 (source NORDPOOL https://www.nordpoolgroup.com/). After the preprocessing phase, the dataset was divided into a training set (the first 1600 days), validation set (the next 400 days), and test set (the last 600 days).

We used seven input variables selected (explanatory variables):

Electricity consumption prognosis, MWh
Electricity production prognosis, MWh
Wind prognosis, MWh
One quarter forward contract price, EUR
One year forward contract price, EUR
Brent oil price prognosis, EUR
Coal price, EUR

The main dataset set used is entitled:

' #E_data.xlsx'.

Models Building

To ensure that our results were robust and not specific to any one neural network architecture, we developed five models to test our loss function: FFNN, convolutional neural network (CNN), recursive neural network (RNN), long-short term memory neural network (LSTM), and gated recurrent unit (GRU) neural network. Our design follows the principles of simplicity, where we keep each model at 1 hidden layer with 64 corresponding neurons. As the activation function, we primarily used ReLU. We also used the RMSprop as the optimization algorithm for the models’ stochastic gradient descent.

To reproduce the results you need the following python files:

'main csv.py': to import the main file.
'KNN.PY': to import 'KNN' technique module.
'loss.py': to import the loss functions (MAE, MSE, Theill UII, Theill UII square) module.
'metric.py': to import the evaluation metrics (MAE & MSE) module.
'model.py': to import the models (FFNN,CNN,LSTM,RNN,GRU) module.

Missing Values

In this study, we deployed five different techniques to deal with missing values and compared their results before finally selecting the most optimal one for our dataset:

Ignore: we simply ignore the missing values by deleting them.
Mean: the missing values are replaced with the mean of the weekly electricity prices.
Cubic spline interpolation: this mathematical technique constructs new data points within the boundaries of a set of other known points.
Nearest value: the missing values are replaced with those of the nearest day.
K-nearest neighbor (KNN): in this study, we set k as 7 and replaced the missing values with the average of their seven nearest measured data points.

KNN performs best on our dataset. Therefore, we used it to fill the values of our missing data points in 'Models building'

To reproduce the results you need the following python files:

'main_missing_values.py': to import the main file.
'ignore.py': to import 'Ignore' technique module.
'mean.py': to import 'Mean' technique module.
'Cubic_spline.py': to import 'cubic spiline' technique module.
'nearest_value.py': to import 'nearest value' technique module.
'KNN.PY': to import 'KNN' technique module.
'model.py': to import the model (FFNN) module.
'loss.py': to import the loss function (MSE) module.
'metric.py': to import the evaluation metric (MSE) module.

Correlation Coefficient

To solidify our choice of input variables for the dataset, the correlation between its different features (target and explanatory) was explored by computing the Pearson, Spearman, and Kendall correlation matrices.

To reproduce the results you need the following python files:

'main_correlation_coefficients.py': to import the main file.
'KNN.PY': to import 'KNN' technique module.

ZAYN123456 / Empirical-Study-of-Day-Ahead-Electricity-Spot-Price-Forecasting-Insights-into-a-Novel-Loss-Function

Introduction

Authors:

Notes

Dataset

Models Building

Missing Values

Correlation Coefficient

简介

发行版

贡献者

近期动态

ZAYN123456 / Empirical-Study-of-Day-Ahead-Electricity-Spot-Price-Forecasting-Insights-into-a-Novel-Loss-Function .gitee-modal { width: 500px !important; }

Introduction

Authors:

Notes

Dataset

Models Building

Missing Values

Correlation Coefficient

简介

发行版

贡献者

近期动态

搜索帮助

ZAYN123456 / Empirical-Study-of-Day-Ahead-Electricity-Spot-Price-Forecasting-Insights-into-a-Novel-Loss-Function