WTD dataset is collected from two abnormal-record wind turbines in a large-scale wind farm. Due to the requirements of business confidentiality, the data are processed through data masking, such as renaming parameters, hiding timestamps and standardizing data. The abnormality in dataset is the cracking of main bearing, and the main reason is frequent wind speed varieties. When the abnormalities of abrasion and wearing happen in the bearing, the temperature changes because the bearing suffers from imbalanced mechanical friction in the rotating process. Therefore, we study from the collected 10 parameters such as environmental temperature, wind speed, hub speed, hub temperature, generator-side active power.
The data are collected lasting for 1 to two years, and time interval of data collection is 10 minutes. According to on-the-spot operation and maintenance (O&M) records from wind farm operators, we divide the dataset into training set and testing set. The quantity and abnormal percentage of these two datasets are shown in the following table.
Dataset | Time | Training size | Testing size(Normal/Abnormal) | Dim. | Abnormal proportion |
---|---|---|---|---|---|
WT03 | Jun. 2019 to Apr. 2021 | 24133 | 28657(14122/15516) | 10 | 54.1% |
WT23 | Jun. 2019 to May 2021 | 64966 | 21863(14535/6347) | 10 | 43.7% |
On the basis of O&M records of wind farm and visual analysis of data, the data are labelled manually. For instance, according to on-the-spot records, the abnormal data labelled in WT23 dataset on 9, Feb., 2021 are found. Then we visualize the data such as the difference in temperature and distribution, etc. before and after abnormality occurs for three months. Finally the data deviating from normal mode are labelled through observation.