平时的学习笔记,如有文章侵权,请告知,我会立刻删除
——————————————————————————
一、创建数据框
首先准备好数据框
> patientID=c(1,2,3,4,5,6,7,8)
> age=c(25,34,28,62,33,26,37,47)
> diabetes=c("Type1","Type2","Type1","Type1","Type1","Type2","Type2","Type1")
> status=c("Poor","Improved","Excellent","Poor","Improved","Excellent","Poor","Poor")
> patientdata=data.frame(patientID,age,diabetes,status)
> patientdata
二、数据框的操作
年龄从小到大排序
> patientdata1=patientdata[order(patientdata$age),]
> patientdata1
年龄分组
方法一
> patientdata2= within(patientdata1,{
+ age[age> 55]="老年"
+ age[age>= 31& age<= 55]="中年"
+ age[age< 30]="青年"
+ })
> patientdata2
方法二
另一种写法并产生新的分组年龄
> patientdata1$agecat[patientdata1$age> 55]="Elder"
> patientdata1$agecat[patientdata1$age>= 31&
+ patientdata1$age<= 55]="Middle Aged"
> patientdata1$agecat[patientdata1$age< 55]="yong"
> patientdata1
取指定特点的数据
> rdata=patientdata2[patientdata2$status=="Poor",]
> rdata
参考文献:
[1] 《数据统计分析及R语言编程》,王斌会编著.一广州:暨南大学出版社, 2014. 8