from pandas import Series, DataFrame
import pandas as pd
1. Series
if index is not indicated, the system will use 0-N-1 automatically
obj = Series([1, 2, 4, 0])
obj.values
obj.index
index is used to indicate very single data
obj2 = Series([1, 2, 3], index=['1st', '2nd', '3rd'])
Get data by index
obj2[‘1st’]
Python object(dict) can create Series directly
dicts = {'1st': 1, '2nd':2, '3rd':3}
obj3 = Series(dicts)
NaN means a missing data or NA, and function below is used to check these data
pd.isnull(object)
pd.notnull(object)
Every object and its index has a property name
obj.name = value
obj.index.name = value
DataFrame
Dataframe is ordered list, just like a table. Every col could be different type. dicts consist of equal-length ndarray can form a dataframe directly
in[1] data = {'1st': ['he', 'is', 'a', 'jerk'],
'2nd': ['ye','I', 'Think', 'so']
}
in[2] frame = DataFrame(data)
Out[2]:
1st 2nd
0 he ye
1 is I
2 a Think
3 jerk so
To specify the sequence of dataframe by manually adding columns=[]
DataFrame = [data, columns=[col1, col2]]
like Series, if input data doesnot exist , it will become NA
As with get values, the sencond method is applicable,too.
frame['var']
frame.var