1. 从文件读入data frame的方法之一:
mydata <- read.table("C:\\Users\\abc\\Desktop\\data.csv",sep=",",colClasses=c("character","character","character","character"))
2. data frame的截取,如截取某几列:
mydata <- mydata[,1:2] #表示截取第1至2列
3. character准换为日期类型POSIXlt:
as.POSIXlt(x, tz = "", format, ...) #x是日期字符串,为比选参数,其他参数可选
4. 日期运算:
difftime(time1, time2, tz,units = c("auto", "secs", "mins", "hours","days", "weeks")) #tz,units为可选参数
</pre><p>5. 聚合运算:</p><p><span style="white-space:pre"></span><pre name="code" class="python">aggregate(x, ...)
#如
aggregate(x$a,by=list(id = x$id),sum) #表示按x的id列分类,计算x的a列之和
6. 连接两个data frame:
如果data frame x和y都有一列“ref”,可以这么写
merge(x,y,by="ref")
7. data frame列重命名:
names(myframe) <- c("a","b","c")
8. 绘图:
散点图加线型回归
attach(data)<span style="white-space:pre"> </span>#data是一个data frame
plot(x,y)<span style="white-space:pre"> </span>#x, y是data的两列
abline(lm(y~x))
title("Regression of y on x")
detach(data)