【w3】split()、interaction()

最新推荐文章于 2023-07-07 17:53:40 发布

lalalalayahuhei

最新推荐文章于 2023-07-07 17:53:40 发布

阅读量100

点赞数

分类专栏： R

本文链接：https://blog.csdn.net/lalalalayahuhei/article/details/105216027

版权

R 专栏收录该内容

20 篇文章 0 订阅

订阅专栏

split()

按因子分组

> str(split)
function (x, f, drop = FALSE, ...

X: 向量、列表、dataframe
f：因子
drop：去除 empty factors level

split a dataframe

> library(datasets)
> head(airquality)
 Ozone Solar.R Wind Temp Month Day
1 41 190 7.4 67 5 1
2 36 118 8.0 72 5 2
3 12 149 12.6 74 5 3
4 18 313 11.5 62 5 4
5 NA NA 14.3 56 5 5
6 28 NA 14.9 66 5 6

> s <- split(airquality, airquality$Month)   // 按月分组
> lapply(s, function(x) colMeans(x[, c("Ozone", "Solar.R", "Wind")]))
$‘5‘
 Ozone Solar.R Wind
 NA NA 11.62258         // 当列中有NA时，mean结果为NA
$‘6‘
 Ozone Solar.R Wind
 NA 190.16667 10.26667
$‘7‘
 Ozone Solar.R Wind
 NA 216.483871 8.941935

使用sapply，结果

> sapply(s, function(x) colMeans(x[, c("Ozone", "Solar.R", "Wind")]))
 5 6 7 8 9
Ozone NA NA NA NA NA
Solar.R NA 190.16667 216.483871 NA 167.4333
Wind 11.62258 10.26667 8.941935 8.793548 10.1800
> sapply(s, function(x) colMeans(x[, c("Ozone", "Solar.R", "Wind")],
 na.rm = TRUE))    // 用na.rm去除NA
 5 6 7 8 9
Ozone 23.61538 29.44444 59.115385 59.961538 31.44828
Solar.R 181.29630 190.16667 216.483871 171.857143 167.43333
Wind 11.62258 10.26667 8.941935 8.793548 10.18000

用interaction（）生成交叉因子

> x <- rnorm(10)
> f1 <- gl(2, 5)
> f2 <- gl(5, 2)
> f1
 [1] 1 1 1 1 1 2 2 2 2 2
Levels: 1 2
> f2
 [1] 1 1 2 2 3 3 4 4 5 5
Levels: 1 2 3 4 5
> interaction(f1, f2)      // 生成交叉因子
 [1] 1.1 1.1 1.2 1.2 1.3 2.3 2.4 2.4 2.5 2.5
10 Levels: 1.1 2.1 1.2 2.2 1.3 2.3 1.4 ... 2.5

交叉可能会生成空level

> str(split(x, list(f1, f2)))    // list 和interaction同样效果
List of 10
 $ 1.1: num [1:2] -0.378 0.445
 $ 2.1: num(0)
 $ 1.2: num [1:2] 1.4066 0.0166
 $ 2.2: num(0)
 $ 1.3: num -0.355
 $ 2.3: num 0.315
 $ 1.4: num(0)
 $ 2.4: num [1:2] -0.907 0.723
 $ 1.5: num(0)
 $ 2.5: num [1:2] 0.732 0.360

空level可以用drop去除

> str(split(x, list(f1, f2), drop = TRUE))
List of 6
$ 1.1: num [1:2] -0.378 0.445
$ 1.2: num [1:2] 1.4066 0.0166
$ 1.3: num -0.355
$ 2.3: num 0.315
$ 2.4: num [1:2] -0.907 0.723
$ 2.5: num [1:2] 0.732 0.360

lalalalayahuhei

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
【w3】split()、interaction()

split()按因子分组> str(split)function (x, f, drop = FALSE, ...X: 向量、列表、dataframef：因子drop：去除 empty factors levelsplit a dataframe> library(datasets)> head(airquality) Ozone Solar.R Wind ...
复制链接

扫一扫