Wind Power Generation Dataset#

风力功率数据集的一些特点#

Sample Time#

These data is collected from the Supervisory Control and Data Acquisition (SCADA) system. The SCADA data are sampled every 10 minutes from each wind turbine in the wind farm which consists of 134 wind turbines. The statistic of this dataset is shown below.

Days Interval # of columns # of turbines # of records
245 10minutes 13 134 4,727,520

This dataset includes critical external features, such as wind speed, wind direction, and external temperature, that influence the wind power generation; as well as essential internal features, such as the inside temperature, nacelle direction and Pitch angle of blades, which can indicate the operating status of each wind turbine.

Evaluation#

We aim at addressing the forecasting ahead of 48 hours. Fore example, given at 6:00 A.M. today, it is required to effectively forecast the wind power generation beginning from 6:00 A.M. on this day to 5:50 AM on the day after tomorrow, given a series of historical records of the wind farm and the related wind turbines. It is required to output the predicted values every 10 minutes. To be specific, at one time point, it is required to predict a future length-280 wind power supply time-series. The average of RMSE (Root Mean Square Error) and MAE (Mean Absolute Error) is used as the main evaluation score.

Caveats about the data#