以下是导入:
- txt 格式导入:
data<-read.table("C:\\Users\\Administrator\\Desktop\\myfile.txt",header=F)#TXT读入
- CSV 格式导入:
data<-read.csv("C:\\Users\\Administrator\\Desktop\\myfile.csv") #CSV数据读入
- xlsx 格式导入
library("xlsx")
data<-read.xlsx("C:\\Users\\Administrator\\Desktop\\myfile.xlsx",sheetName="file",header=F,encoding='UTF-8')
- 剪切板 直接复制(容易出错):
data <- read.table("clipboard", header = T, sep = '\t')#直接复制
以下是导出
- csv:
write.table(mydata2, file ="C:\\Users\\Administrator\\Desktop\\newdata.csv", sep =",", row.names =FALSE)
- txt
write.table(mydata2,file="C:\\Users\\Administrator\\Desktop\\newdata.txt" , sep =" ", row.names =FALSE,col.names =TRUE, quote =FALSE)
以下为数据连接
- merge 的用法:
- 语法:
merge(x, y, by = , by.x = , by.y = , all = , all.x = , all.y = , sort = , suffixes = , incomparables = , ...)
- 语法:
- join函数 (plyr包)
- 语法:
join(x, y, by = NULL, type = "left", match = "all")
- 语法:
- inner_join/full_join/left_join/right_join 函数 (dplyr包):
- inner_join(x, y) :只包含同时出现在x,y表中的行
- left_join(x, y) :包含所有x中以及y中匹配的行
- right_join(x, y,by=c(“Name”=”name”)) :包含所有y中以及x中匹配的行
- full_join(x,y,by=c(“Name”=”name”)) :包含所以x、y中的行
- semi_join(x, y) :包含x中,在y中有匹配的行,结果为x的子集
- anti_join(x, y) :包含x中,不匹配y的行,结果为x的子集,与semi_join相反