文章目录
dataframe
-
heterogeneous data structure
-
contains elements of different classes
-
2 dimensional arrangement
players.name=c("KD","Curry","Klay","Green") players.number=c(35,30,11,23) players.2K=c(87,96,91,85) players.gender=factor(c("male","male","male","male"),levels = c("male","female"))
data.frame() to create a dataframe
players=data.frame(players.name,players.number,players.gender,players.2K)
str(players)
notice the string type players.names are converted to factor
However we can fix it by adding parameter stringsAsFactors=False
make sure not typing as stringAsFactors
players=data.frame(players.name,players.number,players.gender,players.2K,stringsAsFactors=FALSE)
str(players)
players
data frame is like a spreadsheet with row numbers and column names
common operation on data frame
subsetting: extract element(s)
4 ways
use index numbers to extract elements
single bracket to return elements of the same type
players[2]
#single number in the brackets means column number
typeof(players[2])
data frame is basically a list
double bracket to return elements of its own type
players[[2]]
typeof(players[[2]])
2 numbers in the bracket means row and column number
#for example:below means extract the element at row1 and column2
players[1,2]