第八章笔记

Pandas简介

定义

一个开源的python类库,用于数据分析、数据处理、数据可视化

特点

高性能、容易使用的数据结构、容易使用的数据分析工具

pandas数据类型

Series是由相同数据类型组成的一维数组。

DataFrame:二维的表格型数据结构,数据帧(DataFrame)是大小可变的数据结构,每列可以是不同的数据类型(整型、字符串、布尔值等)

面板(Panel)可以由不同的数据类型构成的三维数据结构,Panel是DataFrame的容器

数据结构 维数 描述 数据 大小
Series 1 数据可变, 一维数组,大小不可变,Series是由相同数据类型组成的一维数组。 数据可变 大小不可变
DataFrame 2 二维数组,大小可变的表格结构,它含有一组有序的列,每列可以是不同的数据类型(整型、字符串、布尔值等) 数据可变 大小可变
Panel 3 数据可变 大小可变的三维数组 数据可变 大小可变

pandas安装

<span style="background-color:#f8f8f8"><span style="color:#333333"><span style="color:#000000">pip</span> <span style="color:#000000">install</span> <span style="color:#000000">pandas</span>
<span style="color:#000000">pip</span> <span style="color:#3300aa">list</span></span></span>

一维数据结构:Series对象

<span style="background-color:#f8f8f8"><span style="color:#333333"><span style="color:#aa5500">#查看列索引</span>
<span style="color:#3300aa">print</span>(<span style="color:#000000">d1</span>.<span style="color:#000000">columns</span>)
<span style="color:#aa5500">#查看行索引</span>
<span style="color:#3300aa">print</span>(<span style="color:#000000">d1</span>.<span style="color:#000000">index</span>)</span></span>

Series对象的生成

使用Pandas.Series

<span style="background-color:#f8f8f8"><span style="color:#333333"><span style="color:#aa1111">"""</span>
<span style="color:#aa1111">pandas.Series( data, index, dtype, copy)</span>
<span style="color:#aa1111">data:数据,可以是序列类型,可以是int</span>
<span style="color:#aa1111">index:索引值必须是唯一的,与data的长度相同,默认为np.arange(n)</span>
<span style="color:#aa1111">dtype:数据类型</span>
<span style="color:#aa1111">copy:是否复制数据,默认为false</span>
<span style="color:#aa1111">打印左侧为索引,右侧为数据</span>
<span style="color:#aa1111">"""</span>
<span style="color:#aa5500">#如果全部是int类型,那么打印s1会显示是int类型</span>
<span style="color:#000000">s1</span> = <span style="color:#000000">pd</span>.<span style="color:#000000">Series</span>([<span style="color:#116644">1</span>,<span style="color:#116644">2</span>,<span style="color:#116644">3</span>,<span style="color:#116644">4</span>]) <span style="color:#aa5500">#dtype: int64</span>
<span style="color:#3300aa">print</span>(<span style="color:#000000">s1</span>)
<span style="color:#aa5500">#如果什么数据类型都存在,那么打印s1会显示object类型的</span>
<span style="color:#000000">s1</span> = <span style="color:#000000">pd</span>.<span style="color:#000000">Series</span>([<span style="color:#116644">1</span>,<span style="color:#aa1111">"a"</span>,<span style="color:#116644">0.5</span>,[<span style="color:#aa1111">"张三"</span>,<span style="color:#aa1111">"李四"</span>]])<span style="color:#aa5500">#dtype: object</span>
<span style="color:#3300aa">print</span>(<span style="color:#000000">s1</span>)
<span style="color:#000000">s1</span> = <span style="color:#000000">pd</span>.<span style="color:#000000">Series</span>([<span style="color:#116644">1</span>,<span style="color:#aa1111">"a"</span>,<span style="color:#116644">0.5</span>,[<span style="color:#aa1111">"张三"</span>,<span style="color:#aa1111">"李四"</span>]],<span style="color:#000000">index</span>=[<span style="color:#aa1111">'a'</span>,<span style="color:#aa1111">'b'</span>,<span style="color:#aa1111">'c'</span>,<span style="color:#aa1111">'d'</span>])
<span style="color:#000000">s1</span> = <span style="color:#000000">pd</span>.<span style="color:#000000">Series</span>([<span style="color:#116644">1</span>,<span style="color:#116644">2</span>,<span style="color:#116644">3</span>,<span style="color:#116644">4</span>],<span style="color:#000000">index</span>=[<span style="color:#aa1111">'a'</span>,<span style="color:#aa1111">'b'</span>,<span style="color:#aa1111">'c'</span>,<span style="color:#aa1111">'d'</span>],<span style="color:#000000">dtype</span>=<span style="color:#000000">np</span>.<span style="color:#000000">float32</span>)
<span style="color:#aa1111">"""</span>
<span style="color:#aa1111">copy=True后,数据x不会改变,数据s1会改变</span>
<span style="color:#aa1111">copy=False,数据x会改变,数据s1会改变</span>
<span style="color:#aa1111">"""</span>
<span style="color:#000000">x</span>=<span style="color:#000000">np</span>.<span style="color:#000000">array</span>([<span style="color:#116644">10</span>,<span style="color:#116644">20</span>,<span style="color:#116644">30</span>,<span style="color:#116644">40</span>])
<span style="color:#000000">s1</span> = <span style="color:#000000">pd</span>.<span style="color:#000000">Series</span>(<span style="color:#000000">x</span>,<span style="color:#000000">index</span>=[<span style="color:#aa1111">'a'</span>,<span style="color:#aa1111">'b'</span>,<span style="color:#aa1111">'c'</span>,<span style="color:#aa1111">'d'</span>],<span style="color:#000000">copy</span>=<span style="color:#770088">True</span>)
<span style="color:#000000">s1</span>[<span style="color:#aa1111">'a'</span>]=<span style="color:#116644">100</span>
<span style="color:#3300aa">print</span>(<span style="color:#000000">s1</span>)
<span style="color:#3300aa">print</span>(<span style="color:#000000">x</span>)
​</span></span>

从ndarray创建一个Series

<span style="background-color:#f8f8f8"><span style="color:#333333"><span style="color:#000000">x</span>=<span style="color:#000000">np</span>.<span style="color:#000000">array</span>([<span style="color:#116644">10</span>,<span style="color:#116644">20</span>,<span style="color:#116644">30</span>,<span style="color:#116644">40</span>])
<span style="color:#000000">s1</span> = <span style="color:#000000">pd</span>.<span style="color:#000000">Series</span>(<span style="color:#000000">x</span>)
<span style="color:#000000">s1</span> = <span style="color:#000000">pd</span>.<span style="color:#000000">Series</span>(<span style="color:#000000">x</span>,<span style="color:#000000">index</span>=[<span style="color:#aa1111">'a'</span>,<span style="color:#aa1111">'b'</span>,<span style="color:#aa1111">'c'</span>,<span style="color:#aa1111">'d'</span>])
<span style="color:#3300aa">print</span>(<span style="color:#000000">s1</span>)
​</span></span>

从字典创建一个Series

<span style="background-color:#f8f8f8"><span style="color:#333333"><span style="color:#aa1111">"""</span>
<span style="color:#aa1111">由于我们的Series有索引有数据,和我们的字典很像,字典是有key和vaule</span>
<span style="color:#aa1111">"""</span>
<span style="color:#aa5500">#a是索引 100数据</span>
<span style="color:#000000">x</span>={<span style="color:#aa1111">'a'</span>:<span style="color:#116644">100</span>,<span style="color:#aa1111">'b'</span>:<span style="color:#116644">200</span>,<span style="color:#aa1111">'c'</span>:<span style="color:#116644">300</span>,<span style="color:#aa1111">'d'</span>:<span style="color:#116644">400</span>}
<span style="color:#000000">s1</span> = <span style="color:#000000">pd</span>.<span style="color:#000000">Series</span>(<span style="color:#000000">x</span>)
<span style="color:#aa5500">#设置索引</span>
<span style="color:#000000">s1</span> = <span style="color:#000000">pd</span>.<span style="color:#000000">Series</span>(<span style="color:#000000">x</span>,<span style="color:#000000">index</span>=[<span style="color:#aa1111">'a'</span>,<span style="color:#aa1111">'b'</span>,<span style="color:#aa1111">'c'</span>,<span style="color:#aa1111">'d'</span>])
<span style="color:#aa5500">#如果设置的索引位置换一下呢,数据也会跟着换</span>
<span style="color:#000000">s1</span> = <span style="color:#000000">pd</span>.<span style="color:#000000">Series</span>(<span style="color:#000000">x</span>,<span style="color:#000000">index</span>=[<span style="color:#aa1111">'a'</span>,<span style="color:#aa1111">'c'</span>,<span style="color:#aa1111">'d'</span>,<span style="color:#aa1111">'b'</span>])
<span style="color:#aa5500">#如果设置的索引,在字典中不存在呢,为nan</span>
<span style="color:#000000">s1</span> = <span style="color:#000000">pd</span>.<span style="color:#000000">Series</span>(<span style="color:#000000">x</span>,<span style="color:#000000">index</span>=[<span style="color:#aa1111">'a'</span>,<span style="color:#aa1111">'b'</span>,<span style="color:#aa1111">'e'</span>,<span style="color:#aa1111">'f'</span>])
&l
  • 0
    点赞
  • 2
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值