风电机组运行数据分析
基于R语言,对德国某风电场7台850kw的风电机组运行数据进行分析。约5万条数据。
部分数据:
读取数据:
> data <- read.csv(file.choose())
> head(data)
PCTimeStamp
1 1/1/13
2 1/1/13 0:10
3 1/1/13 0:20
4 1/1/13 0:30
5 1/1/13 0:40
6 1/1/13 0:50
WTG01_Grid.Production.PossiblePower.Avg...1.
1 817
2 732
3 764
4 773
5 689
6 735
WTG02_Grid.Production.PossiblePower.Avg...2.
1 805
2 790
3 774
4 769
5 690
6 753
WTG03_Grid.Production.PossiblePower.Avg...3.
1 786
2 763
3 793
4 759
5 711
6 808
WTG04_Grid.Production.PossiblePower.Avg...4.
1 809
2 809
3 821
4 813
5 800
6 830
WTG05_Grid.Production.PossiblePower.Avg...5.
1 755
2 771
3 736
4 627
5 749
6 832
WTG06_Grid.Production.PossiblePower.Avg...6.
1 745
2 758
3 668
4 717
5 749
6 757
WTG07_Grid.Production.PossiblePower.Avg...7.
1 743
2 811
3 656
4 752
5 723
6 797
WTG01_Total.Active.power..8.
1 3109970
2 3110050
3 3110130
4 3110211
5 3110288
6 3110366
WTG02_Total.Active.power..9.
1 609852
2 609933
3 610014
4 610095
5 610176
6 610257
WTG03_Total.Active.power..10.
1 3254759
2 3254839
3 3254920
4 3255002
5 3255083
6 3255164
WTG04_Total.Active.power..11.
1 3341303
2 3341384
3 3341465
4 3341546
5 3341628
6 3341709
WTG05_Total.Active.power..12.
1 3230186
2 3230266
3 3230346
4 3230421
5 3230501
6 3230582
WTG06_Total.Active.power..13.
1 3264175
2 3264255
3 3264336
4 3264417
5 3264499
6 3264579
WTG07_Total.Active.power..14.
1 3136754
2 3136835
3 3136914
4 3136994
5 3137075
6 3137156
MET_Avg..Wind.speed.1..15.
1 11.3
2 12.0
3 11.6
4 11.8
5 11.2
6 11.1
MET_Min..Wind.speed.1..16.
1 8.1
2 9.1
3 7.1
4 9.4
5 8.1
6 7.1
MET_Max..Wind.speed.1..17. GRID1_KWH_DEL
1 14.5 2510065
2 15.4 2510615
3 16.7 2511165
4 14.2 2511714
5 14.3 2512265
6 14.4 2512815
更改列名:
> new.names<-c("date_time","T1_Possible_Power","T2_Possible_Power","T3_Possible_Power","T4_Possible_Power","T5_Possible_Power","T6_Possible_Power","T7_Possible_Power","T1_Total_Active_Power","T2_Total_Active_Power","T3_Total_Active_Power","T4_Total_Active_Power","T5_Total_Active_Power","T6_Total_Active_Power","T7_Total_Active_Power","mean_wind_mps", "min_wind_mps", "max_wind_mps", "cum_energy_delivered_kwh")
> cbind(names(data),new.names)
> names(data) <- new.names
按类型划分子集数据:
大多数时候我们只需要累计的能量和风速相关的数据。
Cumulative<-subset(data, select=c(1,19))
Possible<-subset(data, select=1:8)
Active<-subset(data, select=c(1,9:15))
Wind<-subset(data, select=c(1,16:18))
dat<-data
数据统计性描述:
1、数据维度:
> dim(dat)
[1] 52560 19
共计:52560行 19列
2、数据类型:
> str(dat)
'data.frame': 52560 obs. of 19 variables:
$ date_time : Factor w/ 52560 levels "1/1/13","1/1/13 0:10",..: 1 2 3 4 5 6 7 8 9 10 ...
$ T1_Possible_Power : int 817 732 764 773 689 735 782 814 730 727 ...
$ T2_Possible_Power : int 805 790 774 769 690 753 818 796 735 773 ...
$ T3_Possible_Power : int 786 763 793 759 711 808 736 789 736 768 ...
$ T4_Possible_Power : int 809 809 821 813 800 830 822 835 805 832 ...
$ T5_Possible_Power : int 755 771 736 627 749 832 713 747 780 797 ...
$ T6_Possible_Power : int 745 758 668 717 749 757 638 696 716 749 ...
$ T7_Possible_Power : int 743 811 656 752 723 797 654 736 683 799 ...
$ T1_Total_Active_Power : int 3109970 3110050 3110130 3110211 3110288 3110366 3110447 3110527 3110604 3110682 ...
$ T2_Total_Active_Power : int 609852 609933 610014 610095 610176 610257 610338 610419 610499 610579 ...
$ T3_Total_Active_Power : int 3254759 3254839 3254920 3255002 3255083 3255164 3255244 3255325 3255406 3255487 ...
$ T4_Total_Active_Power : int 3341303 3341384 3341465 3341546 3341628 3341709 3341790 3341870 3341952 3342033 ...
$ T5_Total_Active_Power : int 3230186 3230266 3230346 3230421 3230501 3230582 3230662 3230742 3230823 3230904 ...
$ T6_Total_Active_Power : int 3264175 3264255 3264336 3264417 3264499 3264579 3264660 3264740 3264821 3264902 ...
$ T7_Total_Active_Power : int 3136754 3136835 3136914 3136994 3137075 3137156 3137234 3137314 3137394 3137474 ...
$ mean_wind_mps : num 11.3 12 11.6 11.8 11.2 11.1 12 11.3 11.9 11.2 ...
$ min_wind_mps : num 8.1 9.1 7.1 9.4 8.1 7.1 9 6.9 8.6 7.9 ...
$ max_wind_mps : num 14.5 15.4 16.7 14.2 14.3 14.4 14.8 14.5 15.5 14.8 ...
$ cum_energy_delivered_kwh: int 2510065 2510615 2511165 2511714 2512265 2512815 2513365 2513915 2514465 2515015 ...
3、数据统计摘要(分位数):
> summary(dat)
date_time T1_Possible_Power
1/1/13 : 1 Min. : -3.0
1/1/13 0:10: 1 1st Qu.:191.0
1/1/13 0:20: 1 Median :518.0
1/1/13 0:30: 1 Mean :475.9
1/1/13 0:40: 1 3rd Qu.:772.0
1/1/13 0:50: 1 Max. :850.0
(Other) :52554 NA&#