1. 删除某一列
del df[column_name]
Code Example:
# -*- coding: utf-8 -*-
import numpy as np
from pandas import DataFrame
__author__ = 'Brown'
data = np.arange(10).reshape(5, 2)
df = DataFrame(data, columns=['c1', 'c2'])
print '删除前:'
print df
del df['c1']
print '\n删除后:'
print df
Output:
删除前:
c1 c2
0 0 1
1 2 3
2 4 5
3 6 7
4 8 9
删除后:
c2
0 1
1 3
2 5
3 7
4 9
2. 增加某一列
df[new_column_name] = iterator
Code Example:
# -*- coding: utf-8 -*-
import numpy as np
from pandas import DataFrame
__author__ = 'Brown'
data = np.arange(10).reshape(5, 2)
df = DataFrame(data, columns=['c1', 'c2'])
print '增加前:'
print df
df['c3'] = [1, 2, 3, 4, 5]
print '\n增加后:'
print df
Output:
增加前:
c1 c2
0 0 1
1 2 3
2 4 5
3 6 7
4 8 9
增加后:
c1 c2 c3
0 0 1 1
1 2 3 2
2 4 5 3
3 6 7 4
4 8 9 5
3. 转化成Dict结构
将dataframe二维表结构中每一行转化为dict结构
df.to_dict(orient='record')
Code Example:
# -*- coding: utf-8 -*-
import numpy as np
from pandas import DataFrame
__author__ = 'Brown'
data = np.arange(10).reshape(5, 2)
df = DataFrame(data, columns=['c1', 'c2'])
print df.to_dict(orient='record')
Output:
[{'c2': 1, 'c1': 0}, {'c2': 3, 'c1': 2}, {'c2': 5, 'c1': 4}, {'c2': 7, 'c1': 6}, {'c2': 9, 'c1': 8}]
4. 导出到csv
df.to_csv('test.csv',encoding='utf-8')
这样导出的csv如果用excel打开会乱码,因为excel只识别带BOM的utf8文件,所以你可以:
df.to_csv('test.csv',encoding='utf_8_sig')