pandas数据筛选及过滤、字符串提取与操作、画图

1、pandas 库之数据筛选及过滤:
主要介绍 pandas 库的用途,以及安装方法。并介绍它的用途,包括数据的选择和筛选。
<br>
2、pandas 库之字符串提取与操作:
本课时主要介绍使用 pandas 库进行字符串数据的提取与其他操作,使得 pandas 库可以操作字符串数据。
<br>
3、pandas 库之散点图:
本课时主要介绍使用 pandas 绘制散点图以及轨迹的方法,通过pandas库也可以绘制图形。
<br>
4、pandas 库之直方图:
本课时主要介绍使用 pandas 绘制直方图,柱形图和箱形图的方法。

In [278]:
Python
<span class="sd">'''</span> <span class="sd">第一节:</span> <span class="sd">#pandas 库之数据筛选及过滤 ************************************</span> <span class="sd">'''</span>
1
2
3
4
5
6
< span class = "sd" > '''</span>
 
<span class="sd">第一节:</span>
<span class="sd">#pandas 库之数据筛选及过滤 ************************************</span>
 
<span class="sd">''' < / span >
Out[278]:
Python
'\n\n\xe7\xac\xac\xe4\xb8\x80\xe8\x8a\x82\xef\xbc\x9a\n#pandas \xe5\xba\x93\xe4\xb9\x8b\xe6\x95\xb0\xe6\x8d\xae\xe7\xad\x9b\xe9\x80\x89\xe5\x8f\x8a\xe8\xbf\x87\xe6\xbb\xa4 ************************************\n\n'
1
'\n\n\xe7\xac\xac\xe4\xb8\x80\xe8\x8a\x82\xef\xbc\x9a\n#pandas \xe5\xba\x93\xe4\xb9\x8b\xe6\x95\xb0\xe6\x8d\xae\xe7\xad\x9b\xe9\x80\x89\xe5\x8f\x8a\xe8\xbf\x87\xe6\xbb\xa4 ************************************\n\n'
In [3]:
Python
<span class="kn">import</span> <span class="nn">pandas</span> <span class="kn">as</span> <span class="nn">pd</span> <span class="kn">import</span> <span class="nn">numpy</span> <span class="kn">as</span> <span class="nn">np</span> <span class="kn">import</span> <span class="nn">matplotlib.pyplot</span> <span class="kn">as</span> <span class="nn">plt</span> <span class="n">dates</span> <span class="o">=</span> <span class="n">pd</span><span class="o">.</span><span class="n">date_range</span><span class="p">(</span><span class="s">'20160715'</span><span class="p">,</span><span class="n">periods</span><span class="o">=</span><span class="mi">6</span><span class="p">)</span> <span class="n">dates</span>
1
2
3
4
5
< span class = "kn" > import < / span > < span class = "nn" > pandas < / span > < span class = "kn" > as < / span > < span class = "nn" > pd < / span >
< span class = "kn" > import < / span > < span class = "nn" > numpy < / span > < span class = "kn" > as < / span > < span class = "nn" > np < / span >
< span class = "kn" > import < / span > < span class = "nn" > matplotlib . pyplot < / span > < span class = "kn" > as < / span > < span class = "nn" > plt < / span >
< span class = "n" > dates < / span > < span class = "o" >= < / span > < span class = "n" > pd < / span > < span class = "o" > . < / span > < span class = "n" > date_range < / span > < span class = "p" > ( < / span > < span class = "s" > '20160715' < / span > < span class = "p" > , < / span > < span class = "n" > periods < / span > < span class = "o" >= < / span > < span class = "mi" > 6 < / span > < span class = "p" > ) < / span >
< span class = "n" > dates < / span >
Out[3]:
Python
DatetimeIndex(['2016-07-15', '2016-07-16', '2016-07-17', '2016-07-18', '2016-07-19', '2016-07-20'], dtype='datetime64[ns]', freq='D', tz=None)
1
2
3
DatetimeIndex ( [ '2016-07-15' , '2016-07-16' , '2016-07-17' , '2016-07-18' ,
               '2016-07-19' , '2016-07-20' ] ,
               dtype = 'datetime64[ns]' , freq = 'D' , tz = None )
In [115]:
Python
<span class="n">df</span> <span class="o">=</span> <span class="n">pd</span><span class="o">.</span><span class="n">DataFrame</span><span class="p">(</span><span class="n">np</span><span class="o">.</span><span class="n">random</span><span class="o">.</span><span class="n">rand</span><span class="p">(</span><span class="mi">6</span><span class="p">,</span><span class="mi">4</span><span class="p">),</span><span class="n">index</span><span class="o">=</span><span class="n">dates</span><span class="p">,</span><span class="n">columns</span><span class="o">=</span><span class="nb">list</span><span class="p">(</span><span class="s">'ABCD'</span><span class="p">))</span> <span class="n">df</span>
1
2
< span class = "n" > df < / span > < span class = "o" >= < / span > < span class = "n" > pd < / span > < span class = "o" > . < / span > < span class = "n" > DataFrame < / span > < span class = "p" > ( < / span > < span class = "n" > np < / span > < span class = "o" > . < / span > < span class = "n" > random < / span > < span class = "o" > . < / span > < span class = "n" > rand < / span > < span class = "p" > ( < / span > < span class = "mi" > 6 < / span > < span class = "p" > , < / span > < span class = "mi" > 4 < / span > < span class = "p" > ) , < / span > < span class = "n" > index < / span > < span class = "o" >= < / span > < span class = "n" > dates < / span > < span class = "p" > , < / span > < span class = "n" > columns < / span > < span class = "o" >= < / span > < span class = "nb" > list < / span > < span class = "p" > ( < / span > < span class = "s" > 'ABCD' < / span > < span class = "p" > ) ) < / span >
< span class = "n" > df < / span >
Out[115]:
ABCD
2016-07-150.2575090.2648850.5402920.485975
2016-07-160.6298270.0797770.3383860.187553
2016-07-170.3757270.7005790.3846950.909140
2016-07-180.4180690.3080240.4512420.758287
2016-07-190.1146250.3973670.8880260.358038
2016-07-200.4548360.1822360.1587150.002074
In [12]:
Python
<span class="n">df2</span> <span class="o">=</span> <span class="n">pd</span><span class="o">.</span><span class="n">DataFrame</span><span class="p">({</span><span class="s">'A'</span><span class="p">:</span><span class="n">np</span><span class="o">.</span><span class="n">random</span><span class="o">.</span><span class="n">randn</span><span class="p">(</span><span class="mi">6</span><span class="p">)})</span><span class="c">#定义一个随机数字的DF数据表</span> <span class="n">df2</span>
1
2
< span class = "n" > df2 < / span > < span class = "o" >= < / span > < span class = "n" > pd < / span > < span class = "o" > . < / span > < span class = "n" > DataFrame < / span > < span class = "p" > ( { < / span > < span class = "s" > 'A' < / span > < span class = "p" > : < / span > < span class = "n" > np < / span > < span class = "o" > . < / span > < span class = "n" > random < / span > < span class = "o" > . < / span > < span class = "n" > randn < / span > < span class = "p" > ( < / span > < span class = "mi" > 6 < / span > < span class = "p" > ) } ) < / span > < span class = "c" > #定义一个随机数字的DF数据表</span>
< span class = "n" > df2 < / span >
Out[12]:
A
0-2.058364
10.817129
21.630002
30.673549
40.416836
50.033933
In [22]:
Python
<span class="n">df</span><span class="p">[</span><span class="s">'A'</span><span class="p">]</span><span class="c">#调用A列</span>
1
< span class = "n" > df < / span > < span class = "p" > [ < / span > < span class = "s" > 'A' < / span > < span class = "p" > ] < / span > < span class = "c" > #调用A列</span>
Out[22]:
Python
2016-07-15 0.736079 2016-07-16 0.183782 2016-07-17 0.198436 2016-07-18 0.865754 2016-07-19 0.095199 2016-07-20 0.731607 Freq: D, Name: A, dtype: float64
1
2
3
4
5
6
7
2016 - 07 - 15      0.736079
2016 - 07 - 16      0.183782
2016 - 07 - 17      0.198436
2016 - 07 - 18      0.865754
2016 - 07 - 19      0.095199
2016 - 07 - 20      0.731607
Freq : D , Name : A , dtype : float64
In [25]:
Python
<span class="n">df</span><span class="p">[</span><span class="mi">1</span><span class="p">:</span><span class="mi">3</span><span class="p">]</span><span class="c">#调用从1~2的行数</span>
1
< span class = "n" > df < / span > < span class = "p" > [ < / span > < span class = "mi" > 1 < / span > < span class = "p" > : < / span > < span class = "mi" > 3 < / span > < span class = "p" > ] < / span > < span class = "c" > #调用从1~2的行数</span>
Out[25]:
ABCD
2016-07-160.1837820.7407870.5896550.167018
2016-07-170.1984360.4528050.2758510.119994
In [27]:
Python
<span class="n">df</span><span class="p">[</span><span class="s">'20160715'</span><span class="p">:</span><span class="s">'20160718'</span><span class="p">]</span><span class="c">#调用时间序列XX~XX时间的数据</span>
1
< span class = "n" > df < / span > < span class = "p" > [ < / span > < span class = "s" > '20160715' < / span > < span class = "p" > : < / span > < span class = "s" > '20160718' < / span > < span class = "p" > ] < / span > < span class = "c" > #调用时间序列XX~XX时间的数据</span>
Out[27]:
ABCD
2016-07-150.7360790.9593400.8305990.627481
2016-07-160.1837820.7407870.5896550.167018
2016-07-170.1984360.4528050.2758510.119994
2016-07-180.8657540.5849430.3814340.966995
In [30]:
Python
<span class="n">df</span><span class="o">.</span><span class="n">loc</span><span class="p">[</span><span class="s">'20160715'</span><span class="p">:</span><span class="s">'20160718'</span><span class="p">,[</span><span class="s">'A'</span><span class="p">,</span><span class="s">'B'</span><span class="p">]]</span><span class="c">#.loc是通过标签来切片</span>
1
< span class = "n" > df < / span > < span class = "o" > . < / span > < span class = "n" > loc < / span > < span class = "p" > [ < / span > < span class = "s" > '20160715' < / span > < span class = "p" > : < / span > < span class = "s" > '20160718' < / span > < span class = "p" > , [ < / span > < span class = "s" > 'A' < / span > < span class = "p" > , < / span > < span class = "s" > 'B' < / span > < span class = "p" > ] ] < / span > < span class = "c" > #.loc是通过标签来切片</span>
Out[30]:
AB
2016-07-150.7360790.959340
2016-07-160.1837820.740787
2016-07-170.1984360.452805
2016-07-180.8657540.584943
In [31]:
Python
<span class="n">df</span><span class="o">.</span><span class="n">loc</span><span class="p">[</span><span class="s">'20160715'</span><span class="p">,[</span><span class="s">'A'</span><span class="p">]]</span><span class="c">#.loc是通过标签来切片</span>
1
< span class = "n" > df < / span > < span class = "o" > . < / span > < span class = "n" > loc < / span > < span class = "p" > [ < / span > < span class = "s" > '20160715' < / span > < span class = "p" > , [ < / span > < span class = "s" > 'A' < / span > < span class = "p" > ] ] < / span > < span class = "c" > #.loc是通过标签来切片</span>
Out[31]:
Python
A 0.736079 Name: 2016-07-15 00:00:00, dtype: float64
1
2
A      0.736079
Name : 2016 - 07 - 15 00 : 00 : 00 , dtype : float64
In [38]:
Python
<span class="n">df</span><span class="o">.</span><span class="n">at</span><span class="p">[</span><span class="n">dates</span><span class="p">[</span><span class="mi">0</span><span class="p">],</span><span class="s">'B'</span><span class="p">]</span><span class="c">#.at是通过数据位置来取数</span>
1
< span class = "n" > df < / span > < span class = "o" > . < / span > < span class = "n" > at < / span > < span class = "p" > [ < / span > < span class = "n" > dates < / span > < span class = "p" > [ < / span > < span class = "mi" > 0 < / span > < span class = "p" > ] , < / span > < span class = "s" > 'B' < / span > < span class = "p" > ] < / span > < span class = "c" > #.at是通过数据位置来取数</span>
Out[38]:
Python
0.95934014640962595
1
0.95934014640962595
In [40]:
Python
<span class="n">df</span><span class="o">.</span><span class="n">head</span><span class="p">(</span><span class="mi">3</span><span class="p">)</span><span class="c">#查看前3行</span>
1
< span class = "n" > df < / span > < span class = "o" > . < / span > < span class = "n" > head < / span > < span class = "p" > ( < / span > < span class = "mi" > 3 < / span > < span class = "p" > ) < / span > < span class = "c" > #查看前3行</span>
Out[40]:
ABCD
2016-07-150.7360790.9593400.8305990.627481
2016-07-160.1837820.7407870.5896550.167018
2016-07-170.1984360.4528050.2758510.119994
In [41]:
Python
<span class="n">df</span><span class="o">.</span><span class="n">tail</span><span class="p">()</span><span class="c">#默认查看后5行</span>
1
< span class = "n" > df < / span > < span class = "o" > . < / span > < span class = "n" > tail < / span > < span class = "p" > ( ) < / span > < span class = "c" > #默认查看后5行</span>
Out[41]:
ABCD
2016-07-160.1837820.7407870.5896550.167018
2016-07-170.1984360.4528050.2758510.119994
2016-07-180.8657540.5849430.3814340.966995
2016-07-190.0951990.4314790.3942740.041155
2016-07-200.7316070.0189310.6943800.189079
In [42]:
Python
<span class="n">df</span><span class="o">.</span><span class="n">index</span><span class="c">#数据表的索引</span>
1
< span class = "n" > df < / span > < span class = "o" > . < / span > < span class = "n" > index < / span > < span class = "c" > #数据表的索引</span>
Out[42]:
Python
DatetimeIndex(['2016-07-15', '2016-07-16', '2016-07-17', '2016-07-18', '2016-07-19', '2016-07-20'], dtype='datetime64[ns]', freq='D', tz=None)
1
2
3
DatetimeIndex ( [ '2016-07-15' , '2016-07-16' , '2016-07-17' , '2016-07-18' ,
               '2016-07-19' , '2016-07-20' ] ,
               dtype = 'datetime64[ns]' , freq = 'D' , tz = None )
In [43]:
Python
<span class="n">df</span><span class="o">.</span><span class="n">columns</span><span class="c">#查看字段表名</span>
1
< span class = "n" > df < / span > < span class = "o" > . < / span > < span class = "n" > columns < / span > < span class = "c" > #查看字段表名</span>
Out[43]:
Python
Index([u'A', u'B', u'C', u'D'], dtype='object')
1
Index ( [ u 'A' , u 'B' , u 'C' , u 'D' ] , dtype = 'object' )
In [45]:
Python
<span class="n">df</span><span class="o">.</span><span class="n">values</span><span class="c">#查看数据</span>
1
< span class = "n" > df < / span > < span class = "o" > . < / span > < span class = "n" > values < / span > < span class = "c" > #查看数据</span>
Out[45]:
Python
array([[ 0.73607859, 0.95934015, 0.83059865, 0.62748078], [ 0.18378218, 0.74078748, 0.58965485, 0.16701782], [ 0.19843576, 0.4528047 , 0.27585071, 0.1199943 ], [ 0.86575368, 0.58494297, 0.38143413, 0.96699532], [ 0.09519916, 0.43147948, 0.39427373, 0.0411551 ], [ 0.73160684, 0.01893109, 0.69437963, 0.18907934]])
1
2
3
4
5
6
array ( [ [ 0.73607859 ,    0.95934015 ,    0.83059865 ,    0.62748078 ] ,
       [ 0.18378218 ,    0.74078748 ,    0.58965485 ,    0.16701782 ] ,
       [ 0.19843576 ,    0.4528047 ,    0.27585071 ,    0.1199943 ] ,
       [ 0.86575368 ,    0.58494297 ,    0.38143413 ,    0.96699532 ] ,
       [ 0.09519916 ,    0.43147948 ,    0.39427373 ,    0.0411551 ] ,
       [ 0.73160684 ,    0.01893109 ,    0.69437963 ,    0.18907934 ] ] )
In [46]:
Python
<span class="n">df</span><span class="o">.</span><span class="n">describe</span><span class="p">()</span><span class="c">#查询表中的描述统计</span>
1
< span class = "n" > df < / span > < span class = "o" > . < / span > < span class = "n" > describe < / span > < span class = "p" > ( ) < / span > < span class = "c" > #查询表中的描述统计</span>
Out[46]:
ABCD
count6.0000006.0000006.0000006.000000
mean0.4684760.5313810.5276990.351954
std0.3440890.3189450.2125990.364780
min0.0951990.0189310.2758510.041155
25%0.1874460.4368110.3846440.131750
50%0.4650210.5188740.4919640.178049
75%0.7349610.7018260.6681980.517880
max0.8657540.9593400.8305990.966995
In [63]:
Python
<span class="n">df</span><span class="o">.</span><span class="n">T</span><span class="c">#反转</span>
1
< span class = "n" > df < / span > < span class = "o" > . < / span > < span class = "n" > T < / span > < span class = "c" > #反转</span>
Out[63]:
2016-07-15 00:00:002016-07-16 00:00:002016-07-17 00:00:002016-07-18 00:00:002016-07-19 00:00:002016-07-20 00:00:00
A0.7360790.1837820.1984360.8657540.0951990.731607
B0.9593400.7407870.4528050.5849430.4314790.018931
C0.8305990.5896550.2758510.3814340.3942740.694380
D0.6274810.1670180.1199940.9669950.0411550.189079
In [ ]:
Python
<span class="c">#df2 = pd.DataFrame([1,2,3],index=['a','b','c'])</span> <span class="c">#df2</span>
1
2
< span class = "c" > #df2 = pd.DataFrame([1,2,3],index=['a','b','c'])</span>
< span class = "c" > #df2</span>
In [60]:
Python
<span class="n">df</span><span class="o">.</span><span class="n">sort</span><span class="p">()</span><span class="c">#排序 从小到大排序</span>
1
< span class = "n" > df < / span > < span class = "o" > . < / span > < span class = "n" > sort < / span > < span class = "p" > ( ) < / span > < span class = "c" > #排序 从小到大排序</span>
Out[60]:
ABCD
2016-07-150.7360790.9593400.8305990.627481
2016-07-160.1837820.7407870.5896550.167018
2016-07-170.1984360.4528050.2758510.119994
2016-07-180.8657540.5849430.3814340.966995
2016-07-190.0951990.4314790.3942740.041155
2016-07-200.7316070.0189310.6943800.189079
In [64]:
Python
<span class="n">df</span><span class="o">.</span><span class="n">sort</span><span class="p">(</span><span class="s">'C'</span><span class="p">)</span><span class="c">#根据C这个列表进行从小到大排序</span>
1
< span class = "n" > df < / span > < span class = "o" > . < / span > < span class = "n" > sort < / span > < span class = "p" > ( < / span > < span class = "s" > 'C' < / span > < span class = "p" > ) < / span > < span class = "c" > #根据C这个列表进行从小到大排序</span>
Out[64]:
ABCD
2016-07-170.1984360.4528050.2758510.119994
2016-07-180.8657540.5849430.3814340.966995
2016-07-190.0951990.4314790.3942740.041155
2016-07-160.1837820.7407870.5896550.167018
2016-07-200.7316070.0189310.6943800.189079
2016-07-150.7360790.9593400.8305990.627481
In [77]:
Python
<span class="n">df1</span> <span class="o">=</span> <span class="n">pd</span><span class="o">.</span><span class="n">DataFrame</span><span class="p">(</span><span class="n">np</span><span class="o">.</span><span class="n">random</span><span class="o">.</span><span class="n">randn</span><span class="p">(</span><span class="mi">6</span><span class="p">,</span><span class="mi">4</span><span class="p">),</span><span class="n">columns</span><span class="o">=</span><span class="nb">list</span><span class="p">(</span><span class="s">'abcd'</span><span class="p">))</span> <span class="n">df1</span>
1
2
< span class = "n" > df1 < / span > < span class = "o" >= < / span > < span class = "n" > pd < / span > < span class = "o" > . < / span > < span class = "n" > DataFrame < / span > < span class = "p" > ( < / span > < span class = "n" > np < / span > < span class = "o" > . < / span > < span class = "n" > random < / span > < span class = "o" > . < / span > < span class = "n" > randn < / span > < span class = "p" > ( < / span > < span class = "mi" > 6 < / span > < span class = "p" > , < / span > < span class = "mi" > 4 < / span > < span class = "p" > ) , < / span > < span class = "n" > columns < / span > < span class = "o" >= < / span > < span class = "nb" > list < / span > < span class = "p" > ( < / span > < span class = "s" > 'abcd' < / span > < span class = "p" > ) ) < / span >
< span class = "n" > df1 < / span >
Out[77]:
abcd
01.225333-0.694005-0.8684982.540235
1-0.0898460.0751650.7220560.261062
21.087733-0.5901800.139107-0.768135
31.587061-0.176495-0.7847790.186754
40.9789270.244961-0.5762003.148124
5-0.0575620.5751550.550383-0.859195
In [78]:
Python
<span class="n">df1</span><span class="p">[</span><span class="n">df1</span><span class="o">.</span><span class="n">d</span><span class="o">></span><span class="mi">0</span><span class="p">]</span>
1
< span class = "n" > df1 < / span > < span class = "p" > [ < / span > < span class = "n" > df1 < / span > < span class = "o" > . < / span > < span class = "n" > d < / span > < span class = "o" >> < / span > < span class = "mi" > 0 < / span > < span class = "p" > ] < / span >
Out[78]:
abcd
01.225333-0.694005-0.8684982.540235
1-0.0898460.0751650.7220560.261062
31.587061-0.176495-0.7847790.186754
40.9789270.244961-0.5762003.148124
In [97]:
Python
<span class="n">df1</span><span class="p">[</span><span class="n">df1</span><span class="o">.</span><span class="n">c</span><span class="o"><</span><span class="mi">0</span><span class="p">][[</span><span class="s">'a'</span><span class="p">,</span><span class="s">'b'</span><span class="p">]]</span>
1
< span class = "n" > df1 < / span > < span class = "p" > [ < / span > < span class = "n" > df1 < / span > < span class = "o" > . < / span > < span class = "n" > c < / span > < span class = "o" > << / span > < span class = "mi" > 0 < / span > < span class = "p" > ] [ [ < / span > < span class = "s" > 'a' < / span > < span class = "p" > , < / span > < span class = "s" > 'b' < / span > < span class = "p" > ] ] < / span >
Out[97]:
ab
01.225333-0.694005
31.587061-0.176495
40.9789270.244961
In [280]:
Python
<span class="sd">'''</span> <span class="sd">第二节:</span> <span class="sd">#pandas 库之字符串提取与操作 ************************************</span> <span class="sd">'''</span>
1
2
3
4
5
6
< span class = "sd" > '''</span>
 
<span class="sd">第二节:</span>
<span class="sd">#pandas 库之字符串提取与操作 ************************************</span>
 
<span class="sd">''' < / span >
Out[280]:
Python
'\n\n\xe7\xac\xac\xe4\xba\x8c\xe8\x8a\x82\xef\xbc\x9a\n#pandas \xe5\xba\x93\xe4\xb9\x8b\xe5\xad\x97\xe7\xac\xa6\xe4\xb8\xb2\xe6\x8f\x90\xe5\x8f\x96\xe4\xb8\x8e\xe6\x93\x8d\xe4\xbd\x9c ************************************\n\n'
1
'\n\n\xe7\xac\xac\xe4\xba\x8c\xe8\x8a\x82\xef\xbc\x9a\n#pandas \xe5\xba\x93\xe4\xb9\x8b\xe5\xad\x97\xe7\xac\xa6\xe4\xb8\xb2\xe6\x8f\x90\xe5\x8f\x96\xe4\xb8\x8e\xe6\x93\x8d\xe4\xbd\x9c ************************************\n\n'
In [227]:
Python
<span class="n">s</span> <span class="o">=</span> <span class="n">pd</span><span class="o">.</span><span class="n">Series</span><span class="p">(</span><span class="nb">list</span><span class="p">(</span><span class="s">'ABCDEF'</span><span class="p">))</span> <span class="n">s</span>
1
2
< span class = "n" > s < / span > < span class = "o" >= < / span > < span class = "n" > pd < / span > < span class = "o" > . < / span > < span class = "n" > Series < / span > < span class = "p" > ( < / span > < span class = "nb" > list < / span > < span class = "p" > ( < / span > < span class = "s" > 'ABCDEF' < / span > < span class = "p" > ) ) < / span >
< span class = "n" > s < / span >
Out[227]:
Python
0 A 1 B 2 C 3 D 4 E 5 F dtype: object
1
2
3
4
5
6
7
0      A
1      B
2      C
3      D
4      E
5      F
dtype : object
In [228]:
Python
<span class="n">s</span><span class="o">.</span><span class="n">str</span><span class="o">.</span><span class="n">lower</span><span class="p">()</span><span class="c">#.lower()将字符串转化为小写</span>
1
< span class = "n" > s < / span > < span class = "o" > . < / span > < span class = "n" > str < / span > < span class = "o" > . < / span > < span class = "n" > lower < / span > < span class = "p" > ( ) < / span > < span class = "c" > #.lower()将字符串转化为小写</span>
Out[228]:
Python
0 a 1 b 2 c 3 d 4 e 5 f dtype: object
1
2
3
4
5
6
7
0      a
1      b
2      c
3      d
4      e
5      f
dtype : object
In [229]:
Python
<span class="n">s</span><span class="o">.</span><span class="n">str</span><span class="o">.</span><span class="n">upper</span><span class="p">()</span><span class="c">#.upper()将字符串转化为大写</span>
1
< span class = "n" > s < / span > < span class = "o" > . < / span > < span class = "n" > str < / span > < span class = "o" > . < / span > < span class = "n" > upper < / span > < span class = "p" > ( ) < / span > < span class = "c" > #.upper()将字符串转化为大写</span>
Out[229]:
Python
0 A 1 B 2 C 3 D 4 E 5 F dtype: object
1
2
3
4
5
6
7
0      A
1      B
2      C
3      D
4      E
5      F
dtype : object
In [230]:
Python
<span class="n">s</span><span class="o">.</span><span class="n">str</span><span class="o">.</span><span class="n">len</span><span class="p">()</span><span class="c">#.str.len()求字符串长度</span>
1
< span class = "n" > s < / span > < span class = "o" > . < / span > < span class = "n" > str < / span > < span class = "o" > . < / span > < span class = "n" > len < / span > < span class = "p" > ( ) < / span > < span class = "c" > #.str.len()求字符串长度</span>
Out[230]:
Python
0 1 1 1 2 1 3 1 4 1 5 1 dtype: int64
1
2
3
4
5
6
7
0      1
1      1
2      1
3      1
4      1
5      1
dtype : int64
In [231]:
Python
<span class="n">s</span><span class="o">.</span><span class="n">str</span><span class="o">.</span><span class="n">split</span><span class="p">(</span><span class="s">''</span><span class="p">)</span><span class="c">#.split('')切割字符串('这里写分隔符')</span>
1
< span class = "n" > s < / span > < span class = "o" > . < / span > < span class = "n" > str < / span > < span class = "o" > . < / span > < span class = "n" > split < / span > < span class = "p" > ( < / span > < span class = "s" > '' < / span > < span class = "p" > ) < / span > < span class = "c" > #.split('')切割字符串('这里写分隔符')</span>
Out[231]:
Python
0 [A] 1 [B] 2 [C] 3 [D] 4 [E] 5 [F] dtype: object
1
2
3
4
5
6
7
0      [ A ]
1      [ B ]
2      [ C ]
3      [ D ]
4      [ E ]
5      [ F ]
dtype : object
In [232]:
Python
<span class="n">s</span><span class="o">.</span><span class="n">str</span><span class="o">.</span><span class="n">replace</span><span class="p">(</span><span class="s">'A'</span><span class="p">,</span><span class="s">'Z'</span><span class="p">)</span><span class="c">#.replace()将字符串替换成另外的</span>
1
< span class = "n" > s < / span > < span class = "o" > . < / span > < span class = "n" > str < / span > < span class = "o" > . < / span > < span class = "n" > replace < / span > < span class = "p" > ( < / span > < span class = "s" > 'A' < / span > < span class = "p" > , < / span > < span class = "s" > 'Z' < / span > < span class = "p" > ) < / span > < span class = "c" > #.replace()将字符串替换成另外的</span>
Out[232]:
Python
0 Z 1 B 2 C 3 D 4 E 5 F dtype: object
1
2
3
4
5
6
7
0      Z
1      B
2      C
3      D
4      E
5      F
dtype : object
In [233]:
Python
<span class="n">s1</span> <span class="o">=</span> <span class="n">pd</span><span class="o">.</span><span class="n">Series</span><span class="p">([</span><span class="s">'a1'</span><span class="p">,</span><span class="s">'a2'</span><span class="p">,</span><span class="s">'a3'</span><span class="p">,</span><span class="s">'a4'</span><span class="p">])</span><span class="c">#创建一个新的序列</span> <span class="n">s1</span>
1
2
< span class = "n" > s1 < / span > < span class = "o" >= < / span > < span class = "n" > pd < / span > < span class = "o" > . < / span > < span class = "n" > Series < / span > < span class = "p" > ( [ < / span > < span class = "s" > 'a1' < / span > < span class = "p" > , < / span > < span class = "s" > 'a2' < / span > < span class = "p" > , < / span > < span class = "s" > 'a3' < / span > < span class = "p" > , < / span > < span class = "s" > 'a4' < / span > < span class = "p" > ] ) < / span > < span class = "c" > #创建一个新的序列</span>
< span class = "n" > s1 < / span >
Out[233]:
Python
0 a1 1 a2 2 a3 3 a4 dtype: object
1
2
3
4
5
0      a1
1      a2
2      a3
3      a4
dtype : object
In [238]:
Python
<span class="n">s1</span><span class="o">.</span><span class="n">str</span><span class="o">.</span><span class="n">extract</span><span class="p">(</span><span class="s">'[ab](\d)'</span><span class="p">)</span><span class="c">#提取</span>
1
< span class = "n" > s1 < / span > < span class = "o" > . < / span > < span class = "n" > str < / span > < span class = "o" > . < / span > < span class = "n" > extract < / span > < span class = "p" > ( < / span > < span class = "s" > '[ab](\d)' < / span > < span class = "p" > ) < / span > < span class = "c" > #提取</span>
Out[238]:
Python
0 1 1 2 2 3 3 4 dtype: object
1
2
3
4
5
0      1
1      2
2      3
3      4
dtype : object
In [239]:
Python
<span class="n">s1</span><span class="o">.</span><span class="n">str</span><span class="o">.</span><span class="n">extract</span><span class="p">(</span><span class="s">'([abc])(\d)'</span><span class="p">)</span><span class="c">#提取</span>
1
< span class = "n" > s1 < / span > < span class = "o" > . < / span > < span class = "n" > str < / span > < span class = "o" > . < / span > < span class = "n" > extract < / span > < span class = "p" > ( < / span > < span class = "s" > '([abc])(\d)' < / span > < span class = "p" > ) < / span > < span class = "c" > #提取</span>
Out[239]:
01
0a1
1a2
2a3
3a4
In [240]:
Python
<span class="n">s1</span><span class="o">.</span><span class="n">str</span><span class="o">.</span><span class="n">extract</span><span class="p">(</span><span class="s">'(?P<str>[abc])(?P<digit>\d)'</span><span class="p">)</span><span class="c">#提取</span>
1
< span class = "n" > s1 < / span > < span class = "o" > . < / span > < span class = "n" > str < / span > < span class = "o" > . < / span > < span class = "n" > extract < / span > < span class = "p" > ( < / span > < span class = "s" > '(?P<str>[abc])(?P<digit>\d)' < / span > < span class = "p" > ) < / span > < span class = "c" > #提取</span>
Out[240]:
strdigit
0a1
1a2
2a3
3a4
In [253]:
Python
<span class="n">s2</span> <span class="o">=</span> <span class="n">pd</span><span class="o">.</span><span class="n">Series</span><span class="p">([</span><span class="s">'a'</span><span class="p">,</span><span class="s">'B'</span><span class="p">,</span><span class="s">'c'</span><span class="p">,</span><span class="s">'d'</span><span class="p">])</span> <span class="n">a_z</span> <span class="o">=</span> <span class="s">r'[a-z]'</span><span class="c">#正则表达式</span> <span class="n">s2</span><span class="o">.</span><span class="n">str</span><span class="o">.</span><span class="n">contains</span><span class="p">(</span><span class="n">a_z</span><span class="p">)</span><span class="c">#.contains()检测列表里面有没有包含指定的内容</span>
1
2
3
< span class = "n" > s2 < / span > < span class = "o" >= < / span > < span class = "n" > pd < / span > < span class = "o" > . < / span > < span class = "n" > Series < / span > < span class = "p" > ( [ < / span > < span class = "s" > 'a' < / span > < span class = "p" > , < / span > < span class = "s" > 'B' < / span > < span class = "p" > , < / span > < span class = "s" > 'c' < / span > < span class = "p" > , < / span > < span class = "s" > 'd' < / span > < span class = "p" > ] ) < / span >
< span class = "n" > a_z < / span > < span class = "o" >= < / span > < span class = "s" > r '[a-z]' < / span > < span class = "c" > #正则表达式</span>
< span class = "n" > s2 < / span > < span class = "o" > . < / span > < span class = "n" > str < / span > < span class = "o" > . < / span > < span class = "n" > contains < / span > < span class = "p" > ( < / span > < span class = "n" > a_z < / span > < span class = "p" > ) < / span > < span class = "c" > #.contains()检测列表里面有没有包含指定的内容</span>
Out[253]:
Python
0 True 1 False 2 True 3 True dtype: bool
1
2
3
4
5
0      True
1      False
2      True
3      True
dtype : bool
In [267]:
Python
<span class="n">s3</span> <span class="o">=</span> <span class="n">pd</span><span class="o">.</span><span class="n">Series</span><span class="p">([</span><span class="s">'ab'</span><span class="p">,</span><span class="s">'Ba'</span><span class="p">,</span><span class="s">'ac'</span><span class="p">,</span><span class="s">'d'</span><span class="p">])</span> <span class="n">s3</span>
1
2
< span class = "n" > s3 < / span > < span class = "o" >= < / span > < span class = "n" > pd < / span > < span class = "o" > . < / span > < span class = "n" > Series < / span > < span class = "p" > ( [ < / span > < span class = "s" > 'ab' < / span > < span class = "p" > , < / span > < span class = "s" > 'Ba' < / span > < span class = "p" > , < / span > < span class = "s" > 'ac' < / span > < span class = "p" > , < / span > < span class = "s" > 'd' < / span > < span class = "p" > ] ) < / span >
< span class = "n" > s3 < / span >
Out[267]:
Python
0 ab 1 Ba 2 ac 3 d dtype: object
1
2
3
4
5
0      ab
1      Ba
2      ac
3      d
dtype : object
In [268]:
Python
<span class="n">s3</span><span class="o">.</span><span class="n">str</span><span class="o">.</span><span class="n">contains</span><span class="p">(</span><span class="s">'^a'</span><span class="p">)</span><span class="c">#.contains('^a')检测列表里面有没有包含指定的a开头的内容并返回bool值</span>
1
< span class = "n" > s3 < / span > < span class = "o" > . < / span > < span class = "n" > str < / span > < span class = "o" > . < / span > < span class = "n" > contains < / span > < span class = "p" > ( < / span > < span class = "s" > '^a' < / span > < span class = "p" > ) < / span > < span class = "c" > #.contains('^a')检测列表里面有没有包含指定的a开头的内容并返回bool值</span>
Out[268]:
Python
0 True 1 False 2 True 3 False dtype: bool
1
2
3
4
5
0      True
1      False
2      True
3      False
dtype : bool
In [271]:
Python
<span class="n">s3</span><span class="o">.</span><span class="n">str</span><span class="o">.</span><span class="n">startswith</span><span class="p">(</span><span class="s">'a'</span><span class="p">)</span><span class="c">#.startswith('')检测列表里面有没有包含指定的a开头的内容并返回bool值,同上</span>
1
< span class = "n" > s3 < / span > < span class = "o" > . < / span > < span class = "n" > str < / span > < span class = "o" > . < / span > < span class = "n" > startswith < / span > < span class = "p" > ( < / span > < span class = "s" > 'a' < / span > < span class = "p" > ) < / span > < span class = "c" > #.startswith('')检测列表里面有没有包含指定的a开头的内容并返回bool值,同上</span>
Out[271]:
Python
0 True 1 False 2 True 3 False dtype: bool
1
2
3
4
5
0      True
1      False
2      True
3      False
dtype : bool
In [272]:
Python
<span class="n">s3</span><span class="o">.</span><span class="n">str</span><span class="o">.</span><span class="n">endswith</span><span class="p">(</span><span class="s">'a'</span><span class="p">)</span><span class="c">#.endswith('')检测列表里面有没有包含指定的a结尾的内容并返回bool值</span>
1
< span class = "n" > s3 < / span > < span class = "o" > . < / span > < span class = "n" > str < / span > < span class = "o" > . < / span > < span class = "n" > endswith < / span > < span class = "p" > ( < / span > < span class = "s" > 'a' < / span > < span class = "p" > ) < / span > < span class = "c" > #.endswith('')检测列表里面有没有包含指定的a结尾的内容并返回bool值</span>
Out[272]:
Python
0 False 1 True 2 False 3 False dtype: bool
1
2
3
4
5
0      False
1      True
2      False
3      False
dtype : bool
In [275]:
Python
<span class="n">s3</span><span class="o">.</span><span class="n">str</span><span class="o">.</span><span class="n">contains</span><span class="p">(</span><span class="s">'a$'</span><span class="p">)</span><span class="c">#.contains('a$')检测列表里面有没有包含指定的a结尾的内容并返回bool值,同上</span>
1
< span class = "n" > s3 < / span > < span class = "o" > . < / span > < span class = "n" > str < / span > < span class = "o" > . < / span > < span class = "n" > contains < / span > < span class = "p" > ( < / span > < span class = "s" > 'a$' < / span > < span class = "p" > ) < / span > < span class = "c" > #.contains('a$')检测列表里面有没有包含指定的a结尾的内容并返回bool值,同上</span>
Out[275]:
Python
0 False 1 True 2 False 3 False dtype: bool
1
2
3
4
5
0      False
1      True
2      False
3      False
dtype : bool
In [281]:
Python
<span class="sd">'''</span> <span class="sd">第三节:</span> <span class="sd">#pandas 库之散点图 ************************************</span> <span class="sd">'''</span>
1
2
3
4
5
6
< span class = "sd" > '''</span>
 
<span class="sd">第三节:</span>
<span class="sd">#pandas 库之散点图 ************************************</span>
 
<span class="sd">''' < / span >
Out[281]:
Python
'\n\n\xe7\xac\xac\xe4\xb8\x89\xe8\x8a\x82\xef\xbc\x9a\n#pandas \xe5\xba\x93\xe4\xb9\x8b\xe6\x95\xa3\xe7\x82\xb9\xe5\x9b\xbe ************************************\n\n'
1
'\n\n\xe7\xac\xac\xe4\xb8\x89\xe8\x8a\x82\xef\xbc\x9a\n#pandas \xe5\xba\x93\xe4\xb9\x8b\xe6\x95\xa3\xe7\x82\xb9\xe5\x9b\xbe ************************************\n\n'
In [14]:
Python
<span class="n">duqu</span> <span class="o">=</span> <span class="n">pd</span><span class="o">.</span><span class="n">read_csv</span><span class="p">(</span><span class="s">'603318.XSHG_1m.csv'</span><span class="p">)</span><span class="c">#读取.csv文件:pd.read_csv('路径')</span> <span class="n">duqu</span><span class="o">.</span><span class="n">head</span><span class="p">()</span>
1
2
< span class = "n" > duqu < / span > < span class = "o" >= < / span > < span class = "n" > pd < / span > < span class = "o" > . < / span > < span class = "n" > read_csv < / span > < span class = "p" > ( < / span > < span class = "s" > '603318.XSHG_1m.csv' < / span > < span class = "p" > ) < / span > < span class = "c" > #读取.csv文件:pd.read_csv('路径')</span>
< span class = "n" > duqu < / span > < span class = "o" > . < / span > < span class = "n" > head < / span > < span class = "p" > ( ) < / span >
Out[14]:
Unnamed: 0openclosehighlowvolumemoney
02015-04-24 09:30:009.399.399.399.39100939
12015-04-24 09:31:009.399.399.399.39200018780
22015-04-24 09:32:009.399.399.399.3910009390
32015-04-24 09:33:009.399.399.399.3910009390
42015-04-24 09:34:009.399.399.399.3900
In [306]:
Python
<span class="n">pl</span> <span class="o">=</span> <span class="n">duqu</span><span class="o">.</span><span class="n">plot</span><span class="p">(</span><span class="n">kind</span><span class="o">=</span><span class="s">'scatter'</span><span class="p">,</span><span class="n">x</span><span class="o">=</span><span class="s">'volume'</span><span class="p">,</span><span class="n">y</span><span class="o">=</span><span class="s">'money'</span><span class="p">)</span><span class="o">.</span><span class="n">get_figure</span><span class="p">()</span> <span class="c">#.plot(kind='种类函数',x轴,y轴)绘制散点图</span> <span class="n">pl</span><span class="o">.</span><span class="n">savefig</span><span class="p">(</span><span class="s">'1.jpg'</span><span class="p">)</span><span class="c">#保存图片</span>
1
2
3
< span class = "n" > pl < / span > < span class = "o" >= < / span > < span class = "n" > duqu < / span > < span class = "o" > . < / span > < span class = "n" > plot < / span > < span class = "p" > ( < / span > < span class = "n" > kind < / span > < span class = "o" >= < / span > < span class = "s" > 'scatter' < / span > < span class = "p" > , < / span > < span class = "n" > x < / span > < span class = "o" >= < / span > < span class = "s" > 'volume' < / span > < span class = "p" > , < / span > < span class = "n" > y < / span > < span class = "o" >= < / span > < span class = "s" > 'money' < / span > < span class = "p" > ) < / span > < span class = "o" > . < / span > < span class = "n" > get_figure < / span > < span class = "p" > ( ) < / span >
< span class = "c" > #.plot(kind='种类函数',x轴,y轴)绘制散点图</span>
< span class = "n" > pl < / span > < span class = "o" > . < / span > < span class = "n" > savefig < / span > < span class = "p" > ( < / span > < span class = "s" > '1.jpg' < / span > < span class = "p" > ) < / span > < span class = "c" > #保存图片</span>
In [20]:
Python
<span class="sd">'''</span> <span class="sd">第四节:</span> <span class="sd">#pandas 库之柱形图\直方图\箱型图 ************************************</span> <span class="sd">'''</span>
1
2
3
4
5
6
< span class = "sd" > '''</span>
 
<span class="sd">第四节:</span>
<span class="sd">#pandas 库之柱形图\直方图\箱型图 ************************************</span>
 
<span class="sd">''' < / span >
Out[20]:
Python
'\n\n\xe7\xac\xac\xe5\x9b\x9b\xe8\x8a\x82\xef\xbc\x9a\n#pandas \xe5\xba\x93\xe4\xb9\x8b\xe6\x9f\xb1\xe5\xbd\xa2\xe5\x9b\xbe\\\xe7\x9b\xb4\xe6\x96\xb9\xe5\x9b\xbe\\\xe7\xae\xb1\xe5\x9e\x8b\xe5\x9b\xbe ************************************\n\n'
1
'\n\n\xe7\xac\xac\xe5\x9b\x9b\xe8\x8a\x82\xef\xbc\x9a\n#pandas \xe5\xba\x93\xe4\xb9\x8b\xe6\x9f\xb1\xe5\xbd\xa2\xe5\x9b\xbe\\\xe7\x9b\xb4\xe6\x96\xb9\xe5\x9b\xbe\\\xe7\xae\xb1\xe5\x9e\x8b\xe5\x9b\xbe ************************************\n\n'
In [9]:
Python
<span class="n">df6</span> <span class="o">=</span> <span class="n">pd</span><span class="o">.</span><span class="n">DataFrame</span><span class="p">(</span><span class="n">np</span><span class="o">.</span><span class="n">random</span><span class="o">.</span><span class="n">rand</span><span class="p">(</span><span class="mi">10</span><span class="p">,</span><span class="mi">4</span><span class="p">),</span><span class="n">columns</span><span class="o">=</span><span class="nb">list</span><span class="p">(</span><span class="s">'ABCD'</span><span class="p">))</span> <span class="n">pd</span><span class="o">.</span><span class="n">set_option</span><span class="p">(</span><span class="s">'mpl_style'</span><span class="p">,</span><span class="s">'default'</span><span class="p">)</span><span class="c">#设置风格kind='bar' 为柱形图</span> <span class="n">plt6</span> <span class="o">=</span> <span class="n">df6</span><span class="o">.</span><span class="n">plot</span><span class="p">(</span><span class="n">kind</span><span class="o">=</span><span class="s">'bar'</span><span class="p">,</span><span class="n">stacked</span><span class="o">=</span><span class="bp">True</span><span class="p">)</span><span class="o">.</span><span class="n">get_figure</span><span class="p">()</span><span class="c">#.plot(kind='风格',stacked=是否堆积效果bool)</span> <span class="n">plt6</span><span class="o">.</span><span class="n">savefig</span><span class="p">(</span><span class="s">'2.jpg'</span><span class="p">)</span>
1
2
3
4
< span class = "n" > df6 < / span > < span class = "o" >= < / span > < span class = "n" > pd < / span > < span class = "o" > . < / span > < span class = "n" > DataFrame < / span > < span class = "p" > ( < / span > < span class = "n" > np < / span > < span class = "o" > . < / span > < span class = "n" > random < / span > < span class = "o" > . < / span > < span class = "n" > rand < / span > < span class = "p" > ( < / span > < span class = "mi" > 10 < / span > < span class = "p" > , < / span > < span class = "mi" > 4 < / span > < span class = "p" > ) , < / span > < span class = "n" > columns < / span > < span class = "o" >= < / span > < span class = "nb" > list < / span > < span class = "p" > ( < / span > < span class = "s" > 'ABCD' < / span > < span class = "p" > ) ) < / span >
< span class = "n" > pd < / span > < span class = "o" > . < / span > < span class = "n" > set_option < / span > < span class = "p" > ( < / span > < span class = "s" > 'mpl_style' < / span > < span class = "p" > , < / span > < span class = "s" > 'default' < / span > < span class = "p" > ) < / span > < span class = "c" > #设置风格kind='bar' 为柱形图</span>
< span class = "n" > plt6 < / span > < span class = "o" >= < / span > < span class = "n" > df6 < / span > < span class = "o" > . < / span > < span class = "n" > plot < / span > < span class = "p" > ( < / span > < span class = "n" > kind < / span > < span class = "o" >= < / span > < span class = "s" > 'bar' < / span > < span class = "p" > , < / span > < span class = "n" > stacked < / span > < span class = "o" >= < / span > < span class = "bp" > True < / span > < span class = "p" > ) < / span > < span class = "o" > . < / span > < span class = "n" > get_figure < / span > < span class = "p" > ( ) < / span > < span class = "c" > #.plot(kind='风格',stacked=是否堆积效果bool)</span>
< span class = "n" > plt6 < / span > < span class = "o" > . < / span > < span class = "n" > savefig < / span > < span class = "p" > ( < / span > < span class = "s" > '2.jpg' < / span > < span class = "p" > ) < / span >
In [19]:
Python
<span class="n">df7</span> <span class="o">=</span> <span class="n">pd</span><span class="o">.</span><span class="n">DataFrame</span><span class="p">(</span><span class="n">np</span><span class="o">.</span><span class="n">random</span><span class="o">.</span><span class="n">rand</span><span class="p">(</span><span class="mi">100</span><span class="p">,</span><span class="mi">4</span><span class="p">),</span><span class="n">columns</span><span class="o">=</span><span class="nb">list</span><span class="p">(</span><span class="s">'abcd'</span><span class="p">))</span> <span class="n">d8</span> <span class="o">=</span> <span class="n">df7</span><span class="p">[</span><span class="s">'a'</span><span class="p">]</span><span class="o">.</span><span class="n">hist</span><span class="p">()</span><span class="o">.</span><span class="n">get_figure</span><span class="p">()</span><span class="c">#绘制直方图</span> <span class="n">d8</span><span class="o">.</span><span class="n">savefig</span><span class="p">(</span><span class="s">'3.jpg'</span><span class="p">)</span>
1
2
3
< span class = "n" > df7 < / span > < span class = "o" >= < / span > < span class = "n" > pd < / span > < span class = "o" > . < / span > < span class = "n" > DataFrame < / span > < span class = "p" > ( < / span > < span class = "n" > np < / span > < span class = "o" > . < / span > < span class = "n" > random < / span > < span class = "o" > . < / span > < span class = "n" > rand < / span > < span class = "p" > ( < / span > < span class = "mi" > 100 < / span > < span class = "p" > , < / span > < span class = "mi" > 4 < / span > < span class = "p" > ) , < / span > < span class = "n" > columns < / span > < span class = "o" >= < / span > < span class = "nb" > list < / span > < span class = "p" > ( < / span > < span class = "s" > 'abcd' < / span > < span class = "p" > ) ) < / span >
< span class = "n" > d8 < / span > < span class = "o" >= < / span > < span class = "n" > df7 < / span > < span class = "p" > [ < / span > < span class = "s" > 'a' < / span > < span class = "p" > ] < / span > < span class = "o" > . < / span > < span class = "n" > hist < / span > < span class = "p" > ( ) < / span > < span class = "o" > . < / span > < span class = "n" > get_figure < / span > < span class = "p" > ( ) < / span > < span class = "c" > #绘制直方图</span>
< span class = "n" > d8 < / span > < span class = "o" > . < / span > < span class = "n" > savefig < / span > < span class = "p" > ( < / span > < span class = "s" > '3.jpg' < / span > < span class = "p" > ) < / span >
In [33]:
Python
<span class="n">pd</span><span class="o">.</span><span class="n">set_option</span><span class="p">(</span><span class="s">'mpl_style'</span><span class="p">,</span><span class="s">'default'</span><span class="p">)</span> <span class="n">fig</span><span class="p">,</span><span class="n">ax</span> <span class="o">=</span> <span class="n">plt</span><span class="o">.</span><span class="n">subplots</span><span class="p">()</span> <span class="n">df9</span> <span class="o">=</span> <span class="n">pd</span><span class="o">.</span><span class="n">DataFrame</span><span class="p">(</span><span class="n">np</span><span class="o">.</span><span class="n">random</span><span class="o">.</span><span class="n">rand</span><span class="p">(</span><span class="mi">100</span><span class="p">,</span><span class="mi">2</span><span class="p">),</span><span class="n">columns</span><span class="o">=</span><span class="nb">list</span><span class="p">(</span><span class="s">'ab'</span><span class="p">))</span> <span class="n">df9</span><span class="o">.</span><span class="n">boxplot</span><span class="p">(</span><span class="n">ax</span><span class="o">=</span><span class="n">ax</span><span class="p">)</span><span class="c">#绘制箱型图</span> <span class="n">plt</span><span class="o">.</span><span class="n">show</span><span class="p">()</span>
1
2
3
4
5
< span class = "n" > pd < / span > < span class = "o" > . < / span > < span class = "n" > set_option < / span > < span class = "p" > ( < / span > < span class = "s" > 'mpl_style' < / span > < span class = "p" > , < / span > < span class = "s" > 'default' < / span > < span class = "p" > ) < / span >
< span class = "n" > fig < / span > < span class = "p" > , < / span > < span class = "n" > ax < / span > < span class = "o" >= < / span > < span class = "n" > plt < / span > < span class = "o" > . < / span > < span class = "n" > subplots < / span > < span class = "p" > ( ) < / span >
< span class = "n" > df9 < / span > < span class = "o" >= < / span > < span class = "n" > pd < / span > < span class = "o" > . < / span > < span class = "n" > DataFrame < / span > < span class = "p" > ( < / span > < span class = "n" > np < / span > < span class = "o" > . < / span > < span class = "n" > random < / span > < span class = "o" > . < / span > < span class = "n" > rand < / span > < span class = "p" > ( < / span > < span class = "mi" > 100 < / span > < span class = "p" > , < / span > < span class = "mi" > 2 < / span > < span class = "p" > ) , < / span > < span class = "n" > columns < / span > < span class = "o" >= < / span > < span class = "nb" > list < / span > < span class = "p" > ( < / span > < span class = "s" > 'ab' < / span > < span class = "p" > ) ) < / span >
< span class = "n" > df9 < / span > < span class = "o" > . < / span > < span class = "n" > boxplot < / span > < span class = "p" > ( < / span > < span class = "n" > ax < / span > < span class = "o" >= < / span > < span class = "n" > ax < / span > < span class = "p" > ) < / span > < span class = "c" > #绘制箱型图</span>
< span class = "n" > plt < / span > < span class = "o" > . < / span > < span class = "n" > show < / span > < span class = "p" > ( ) < / span >

 

 




  • zeropython 微信公众号 5868037 QQ号 5868037@qq.com QQ邮箱
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值