python二维数组去重复_删除numpy数组的重复行

最新推荐文章于 2022-06-22 16:38:17 发布

weixin_39928017

最新推荐文章于 2022-06-22 16:38:17 发布

阅读量867

点赞数

文章标签： python二维数组去重复

使用^{}-# Perform lex sort and get sorted data

sorted_idx = np.lexsort(data.T)

sorted_data = data[sorted_idx,:]

# Get unique row mask

row_mask = np.append([True],np.any(np.diff(sorted_data,axis=0),1))

# Get unique rows

out = sorted_data[row_mask]

样本运行-In [199]: data

Out[199]:

array([[1, 8, 3, 3, 4],

[1, 8, 9, 9, 4],

[1, 8, 3, 3, 4],

[1, 8, 3, 3, 4],

[1, 8, 0, 3, 4],

[1, 8, 9, 9, 4]])

In [200]: sorted_idx = np.lexsort(data.T)

...: sorted_data = data[sorted_idx,:]

...: row_mask = np.append([True],np.any(np.diff(sorted_data,axis=0),1))

...: out = sorted_data[row_mask]

...:

In [201]: out

Out[201]:

array([[1, 8, 0, 3, 4],

[1, 8, 3, 3, 4],

[1, 8, 9, 9, 4]])

运行时测试-

本节乘以迄今为止提出的解决方案中提出的所有方法。In [34]: data = np.random.randint(0,10,(10000,10))

In [35]: def tuple_based(data):

...: new_array = [tuple(row) for row in data]

...: return np.unique(new_array)

...:

...: def lexsort_based(data):

...: sorted_data = data[np.lexsort(data.T),:]

...: row_mask = np.append([True],np.any(np.diff(sorted_data,axis=0),1))

...: return sorted_data[row_mask]

...:

...: def unique_based(a):

...: a = np.ascontiguousarray(a)

...: unique_a = np.unique(a.view([('', a.dtype)]*a.shape[1]))

...: return unique_a.view(a.dtype).reshape((unique_a.shape[0], a.shape[1]))

...:

In [36]: %timeit tuple_based(data)

10 loops, best of 3: 63.1 ms per loop

In [37]: %timeit lexsort_based(data)

100 loops, best of 3: 8.92 ms per loop

In [38]: %timeit unique_based(data)

10 loops, best of 3: 29.1 ms per loop

weixin_39928017

关注

0
点赞
踩
2

收藏

觉得还不错? 一键收藏
0
评论
python二维数组去重复_删除numpy数组的重复行

使用^{}-# Perform lex sort and get sorted datasorted_idx = np.lexsort(data.T)sorted_data = data[sorted_idx,:]# Get unique row maskrow_mask = np.append([True],np.any(np.diff(sorted_data,axis=0),1))# Get...
复制链接

扫一扫

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。