1.整体性统计描述 summary():输出变量的最大值、最小值、四分位数、均值 rm(list = ls()) mtcars summary(mtcars[,3:7]) apply()/sapply():输出变量的自定义描述量 state = function(x){ x = x[!is.na(x)] m = mean(x) n = length(x) s = sd(x) skew = sum((x-m)^3/s^3)/n kurt = sum((x-m)^4/s^4)/n-3 return(c(n = n,mean = m,s = s, skew = skew, kurt = kurt)) } apply(mtcars[,3:7], 2, state) sapply(mtcars[,3:7], state) describe(): 比较常用的统计包,描述量很多 library(psych) describe(mtcars[,3:7]) 2.分组计算描述性统计量 describeBy():输出一个/多个类别型变量分组后describle描述量 library(psych) describeBy(mtcars$mpg,list(mtcars$cyl,mtcars$am)) summaryBy():输出一个/多个类别型变量分组后的单个/多个描述量 library(doBy) head(mtcars) data_describe = summaryBy( mpg+hp+wt~cyl+am, mtcars,FUN = state) # 公式左边是连续变量,右边是类别变量,FUN参数必须存在