pandas 和 numpy 中 where 使用

最新推荐文章于 2024-04-19 19:43:47 发布

liu_liuqiu

最新推荐文章于 2024-04-19 19:43:47 发布

阅读量5.4k

点赞数 4

分类专栏： python

原文链接：https://www.cnblogs.com/zz22--/p/8654395.html

版权

python 专栏收录该内容

34 篇文章 1 订阅

订阅专栏

注意： df1.where(cond,df2) 等价于 np.where(cond, df1, df2)

1. pandas.DataFrame.where

DataFrame.where(cond, other=nan, inplace=False, axis=None, level=None, try_cast=False, raise_on_error=True)
功能：按条件查找替换，cond 为 True 则 self 值保持不变；False 改为参数 other 对应的值。结果返回与 self 相同的对象。
参数：
cond 查找条件
other cond为False时要替换的值
inplace 是否在原数据上操作

>>> import numpy as np
>>> import pandas as pd
>>> data = pd.DataFrame(np.arange(12).reshape(3,4),columns=['a','b','c','d'])
>>> data
   a  b   c   d
0  0  1   2   3
1  4  5   6   7
2  8  9  10  11
>>> data.where(data['a']>3, 3.0)
   a  b   c   d
0  3  3   3   3
1  4  5   6   7
2  8  9  10  11

2.numpy.where

声明：首先强调一下，where()函数对于不同的输入，返回值是不同的。

当数组是一维数组时，返回的值是一维的索引，所以只有一组索引数组
当数组是多维数组时，满足条件的数组值返回的是值的位置索引，因此会有两组索引数组来表示值的位置。

>>> import numpy as np
>>> a = np.arange(10)
>>> a
array([0, 1, 2, 3, 4, 5, 6, 7, 8, 9])
>>> np.where(a>5)
(array([6, 7, 8, 9], dtype=int64),)

>>> b = np.arange(20).reshape(4,5)
>>> b
array([[ 0,  1,  2,  3,  4],
       [ 5,  6,  7,  8,  9],
       [10, 11, 12, 13, 14],
       [15, 16, 17, 18, 19]])
>>> np.where(b>10)
(array([2, 2, 2, 2, 3, 3, 3, 3, 3], dtype=int64), array([1, 2, 3, 4, 0, 1, 2, 3, 4], dtype=int64))

具体实现

numpy.where(condition[, x, y])
功能：
参数：
condition：判定条件，如果True，选择 x；False，选择y（数据类型为数组，bool 值）x,y（可选）： x 和 y 的 shape 必须和 condition 相同（可以采用 broadcast，广播机制）
①如果参数有condition，x和y，它们三个参数的shape是相同的。那么，当condition中的值是true时返回x对应位置的值，false是返回y的。
②如果参数只有condition的话，返回值是condition中元素值为true的位置索引，且是以元组形式返回，元组的元素是ndarray数组，表示位置的索引

>>> np.where([[True, False], [True, True]],	# 官网例子
...          [[1, 2], [3, 4]],
...          [[9, 8], [7, 6]])
array([[1, 8],
       [3, 4]])
       
 >>> x = np.arange(9).reshape(3, 3)
>>> np.where(x>5)		# ndarray 数组分别表示对应的 行和列
(array([2, 2, 2], dtype=int64), array([0, 1, 2], dtype=int64))

https://www.cnblogs.com/zz22–/p/8654395.html

liu_liuqiu

关注

4
点赞
踩
2

收藏

觉得还不错? 一键收藏
0
评论
pandas 和 numpy 中 where 使用

注意： df1.where(cond,df2) 等价于 np.where(cond, df1, df2)1. pandas.DataFrame.whereDataFrame.where(cond, other=nan, inplace=False, axis=None, level=None, try_cast=False, raise_on_error=True)功能：按条件查找替换，c...
复制链接

扫一扫