pandas数据选择（索引）

最新推荐文章于 2024-09-02 00:58:40 发布

单明火

最新推荐文章于 2024-09-02 00:58:40 发布

阅读量3.1k

点赞数 2

分类专栏： python基础

本文链接：https://blog.csdn.net/knowmore0823/article/details/79039610

版权

python基础专栏收录该内容

11 篇文章 0 订阅

订阅专栏

import pandas as pd
import numpy as np

dates = pd.date_range('20180101',periods=6)
df = pd.DataFrame(np.arange(24).reshape(6,4),index=dates,columns=['A','B','C','D'])

print(df)  #基本数据

             A   B   C   D
2018-01-01   0   1   2   3
2018-01-02   4   5   6   7
2018-01-03   8   9  10  11
2018-01-04  12  13  14  15
2018-01-05  16  17  18  19
2018-01-06  20  21  22  23

一、索引的基本方法

print(df.A, df['A']) #引用A列的两种方式

2018-01-01     0
2018-01-02     4
2018-01-03     8
2018-01-04    12
2018-01-05    16
2018-01-06    20
Freq: D, Name: A, dtype: int32 2018-01-01     0
2018-01-02     4
2018-01-03     8
2018-01-04    12
2018-01-05    16
2018-01-06    20
Freq: D, Name: A, dtype: int32

print(df[0:2])  #引用行的两种方式

            A  B  C  D
2018-01-01  0  1  2  3
2018-01-02  4  5  6  7

print( df['2018-01-02':'2018-01-04'])

             A   B   C   D
2018-01-02   4   5   6   7
2018-01-03   8   9  10  11
2018-01-04  12  13  14  15

二、loc方法(loc——通过行标签索引行数据 )

print(df.loc['2018-01-02'])  #loc——通过行标签索引行数据

A    4
B    5
C    6
D    7
Name: 2018-01-02 00:00:00, dtype: int32

print(df.loc['2018-01-02',['A','B']]) #应用在index的基础上之后，可以选取所需的列

A    4
B    5
Name: 2018-01-02 00:00:00, dtype: int32

三、iloc方法(iloc——通过行号索引行数据 )

print(df.iloc[3:5,2:4])

             C   D
2018-01-04  14  15
2018-01-05  18  19

print(df.iloc[[1,3,5],2:4]) #跳行取数

             C   D
2018-01-02   6   7
2018-01-04  14  15
2018-01-06  22  23

四、ix方法–通过行标签或者行号索引行数据（基于loc和iloc 的混合）

print(df.ix[0:3,['A','B']])

            A  B
2018-01-01  0  1
2018-01-02  4  5
2018-01-03  8  9


C:\Users\Administrator\Anaconda3\lib\site-packages\ipykernel_launcher.py:1: DeprecationWarning: 
.ix is deprecated. Please use
.loc for label based indexing or
.iloc for positional indexing

五、布尔型索引

print(df[df.A>=8])

             A   B   C   D
2018-01-03   8   9  10  11
2018-01-04  12  13  14  15
2018-01-05  16  17  18  19
2018-01-06  20  21  22  23

print(df.A>=8)

2018-01-01    False
2018-01-02    False
2018-01-03     True
2018-01-04     True
2018-01-05     True
2018-01-06     True
Freq: D, Name: A, dtype: bool

单明火

关注

2
点赞
踩
3

收藏

觉得还不错? 一键收藏
1
评论
复制链接

分享到 QQ

分享到新浪微博

扫一扫

专栏目录