Pandas has two workhorse data structures: Series and DataFrame.
Series
A Series is a one-dimensional array-like object containing an array of data (of any NumPy data type) and an associated array of data labels, called its index.
#1.create series
obj1 = pd.Series([4, 7, -5, 3])
obj2 = pd.Series([4, 7, -5, 3], index = ['a', 'b', 'c', 'd'])
sdata = {'Ohio': 35000, 'Texas': 71000, 'Oregon': 16000, 'Utah': 5000}
obj3 = pd.Series(sdata)
#2.select
obj2.values
obj2.index
obj2['a']
#3.operations
obj2[obj2 > 0]
np.exp(obj2)
data = pd.Series([1., -999., 2., -999., -1000., 3.])
data.replace(-999, np.nan) #两个-999被换成NAN
DataFrame
#1. create dataframe
data = {'state': ['Ohio', 'Ohio', 'Ohio', 'Nevada', 'Nevada'],
'y