r 数据读取_data<-read.csv-CSDN博客

本文链接：https://blog.csdn.net/qq_41198783/article/details/78652002

一、按路径读取
data <- read.csv('C:/Users/Administrator/Desktop/xx.csv')
二、先设置工作目录（推荐）
setwd("C:/Users/Administrator/Desktop") #直接设置工作目录，路径：files》more》set as working directory

1、csv文本格式（即逗号分隔文件）：
data<- read.csv("xx.csv",header = F,fileEncoding ="utf-8")

2、txt文本格式：
data<- read.table("xx.txt",quote=F,header = T)
#说明：quote=F表示引号不读取。

3、链接mysql数据库http://blog.csdn.net/cao812755156/article/details/48737337
library(RODBC) #包
channel <- odbcConnect("bluemoon", uid="paidan", pwd="dan.690.Zjk",DBMSencoding="UTF8") #data source name、user、password、预防中文乱码，ODBC中的details-character set也要设置为UTF8
sqlTables(channel) #查表
data<-sqlFetch(channel,"表")#读取表中数据
sqlQuery(channel,"select * from 表")

三、不规则文本格式读取：
1、编码：要确保以正确的编码方式读取fileEncoding ="utf-8"（首选）或者encoding ="utf-8"
2、首行：header = T（TURE）是默认状态有首行，可修改为F（FALSE）表示不读首行。
3、分隔符： sep = "\t"（\t为制表符分割），也可以是其他任何符号，等。
4、缺失值：NA表示缺失，na.strings表示字符串，NaN，Inf和-Inf表示数值列。
5、忽略尾部空字段： fill = TRUE。
6、忽略中间空字段：blank.lines.skip = FALSE,fill = TRUE。
7、变量类型：as.is 会抑制字符向量转换成因子（仅此功能）；colClasses运行为输入中的每个列（非每个变量），对标签列同样适用。
8、注释：注释符#后该行会被忽略，当作空白行。要读取可用comment.char = ""（读取速度也会加快）。

四、基本操作

View(data)#查看数据

#查看数据表前10行
head(data,n=10)

#显示数据表后10行
tail(data,n=10)

#查看data表中的loan_amnt字段
data$loan_amnt

#查看贷款金额列数据类型
typeof(data$loan_amnt)

Read more: http://bluewhale.cc/2016-10-12/use-r-to-read-and-view-the-data.html#ixzz4xL2oBMCC
#更改贷款金额字段为integer型，使用as.integer函数将贷款金额字段由之前的double型改为integer型。
data$loan_amnt=as.integer(ldata$loan_amnt)

桌面导入.csv数据到数据框myframe
file <- "C:/Users/Administrator/Desktop/TEST.csv"
myframe <- read.csv(file, header = TRUE, sep = ",")
myframe[1:10,] #查看你导入的数据前10行