数值预测:
1、读入数据
> library(ggplot2)
> library(dplyr)
> data(diamonds)
> diamonds.data<-select(diamonds,carat:clarity,price)
> summary(diamonds.data)
carat cut color clarity price
Min. :0.2000 Fair : 1610 D: 6775 SI1 :13065 Min. : 326
1st Qu.:0.4000 Good : 4906 E: 9797 VS2 :12258 1st Qu.: 950
Median :0.7000 Very Good:12082 F: 9542 SI2 : 9194 Median : 2401
Mean :0.7979 Premium :13791 G:11292 VS1 : 8171 Mean : 3933
3rd Qu.:1.0400 Ideal :21551 H: 8304 VVS2 : 5066 3rd Qu.: 5324
Max. :5.0100 I: 5422 VVS1 : 3655 Max. :18823
J: 2808 (Other): 2531
> dim(diamonds.data)
[1] 53940 5
2、随机划分数据
> set.seed(1234) #为了每次都获得相同的结果
> samp2<-sample