1.R语言数据格式
1.1 vector
vector是 R语言基本的数据格式,vector可存储 一个相同类型的数据集合,理论上可以无限长。
vector按存储数据的类型分为5类:
integer : 整数型(无小数),numeric:数字型(含小数),character:文本数据,logical:逻辑型数据, NULL:数据空,NA:缺失数据
#---------------------Create Vector----------------------
name = c("John", "abner", "young")
temperature = c(35.2, 34.3, 36.8)
status = c(TRUE, FALSE, TRUE)
#---------------------Opr on Vector-----------------------
temperature[2]
#[1] 34.3
temperature[2:3]
#[1] 34.3 36.8
temperature[-2]
#[1] 35.2 36.8 used for exclude the item (from start to end) delete the 2nd item.
temperature[-1]
#[1] 34.3 36.8 used for exclude the item (from start to end) delete the 1st item.
temperature[c(TRUE, FALSE, TRUE)]
#[1] 35.2 36.8 used for delete the 2nd item.
1.2 factor
在我们数据处理的过程中通常会遇到性别、血型等等这样的数据,这样的数据通常用名词性的词语来描述数据的特征,R语言提供了factor来描述这些数据。
#--------------------Create Factor------------------------------
gender = factor(c("MALE", "FEMALE", "MALE"))
#[1] MALE FEMALE MALE
#Levels: FEMALE MALE
bleed = factor(c("O","A","AB"), levels = c("O","A","B","AB"))
#[1] O A AB
#Levels: O A B AB
#--------------------Opr on Factor-------------------------------
gender[2]
#[1] FEMALE
#Levels: FEMALE MALE
gender[-2]
#[1] MALE MALE