#每天一点点#
python pandas 设置值
6行4列,以日期为行序,A,B,C,D为列序的df
import numpy as np
import pandas as pd
dates = pd.date_range('20130101',periods=6)
df = pd.DataFrame(np.arange(24).reshape((6,4)),index=dates,columns=['A','B','C','D'])
输出结果:
A B C D
2013-01-01 0 1 2 3
2013-01-02 4 5 6 7
2013-01-03 8 9 10 11
2013-01-04 12 13 14 15
2013-01-05 16 17 18 19
2013-01-06 20 21 22 23
1:修改某个值
将位置为[2,2]的值修改为1111
df.iloc[2,2] = 1111
将修改后的df打印出来,输出结果
A B C D
2013-01-01 0 1 2 3
2013-01-02 4 5 6 7
2013-01-03 8 9 1111 11
2013-01-04 12 13 14 15
2013-01-05 16 17 18 19
2013-01-06 20 21 22 23
2:loc方法修改
df.loc['20130101','B'] =2222
将修改后的df打印出来,输出结果
A B C D
2013-01-01 0 2222 2 3
2013-01-02 4 5 6 7
2013-01-03 8 9 1111 11
2013-01-04 12 13 14 15
2013-01-05 16 17 18 19
2013-01-06 20 21 22 23
3:筛选条件修改数据
3.1 将A列大于4的数字,都修改为0
df[df.A>4] = 0
将修改后的df打印出来,输出结果
A B C D
2013-01-01 0 2222 2 3
2013-01-02 4 5 6 7
2013-01-03 0 0 0 0
2013-01-04 0 0 0 0
2013-01-05 0 0 0 0
2013-01-06 0 0 0 0
3.2 B列的A列值大于4的,修改为0
df.B[df.A>4] = 0
将修改后的df打印出来,输出结果
A B C D
2013-01-01 0 1 2 3
2013-01-02 4 5 6 7
2013-01-03 8 0 10 11
2013-01-04 12 0 14 15
2013-01-05 16 0 18 19
2013-01-06 20 0 22 23
4:增加列,增加F列,且该列的值为nan
df['F'] = np.nan
将修改后的df打印出来,输出结果
A B C D F
2013-01-01 0 1 2 3 NaN
2013-01-02 4 5 6 7 NaN
2013-01-03 8 0 10 11 NaN
2013-01-04 12 0 14 15 NaN
2013-01-05 16 0 18 19 NaN
2013-01-06 20 0 22 23 NaN
5:将定义好的列,添加至已有df
注意:再定义的列,一定要与原有df的列相同,才能对应起来
df['E'] = pd.Series([1,2,3,4,5,6],index=pd.date_range('20130101',periods=6))
将修改后的df打印出来,输出结果
A B C D F E
2013-01-01 0 1 2 3 NaN 1
2013-01-02 4 5 6 7 NaN 2
2013-01-03 8 0 10 11 NaN 3
2013-01-04 12 0 14 15 NaN 4
2013-01-05 16 0 18 19 NaN 5
2013-01-06 20 0 22 23 NaN 6