numpy array 以及pytorch tensor 的索引（切片索引，整型索引）

最新推荐文章于 2025-03-31 17:34:22 发布

AI无昵称

最新推荐文章于 2025-03-31 17:34:22 发布

阅读量1k

点赞数

分类专栏： python 文章标签： numpy 索引共享

本文链接：https://blog.csdn.net/xiaojiajia007/article/details/81352299

版权

python 专栏收录该内容

33 篇文章

订阅专栏

本文探讨了Python中变量赋值与拷贝的区别，并详细解释了Numpy与PyTorch中视图（view）和拷贝（copy）的概念。通过对比不同索引方式的影响，帮助读者了解何时会产生原始数据的引用，何时会创建新副本。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

概括

我们知道，在python中，变量的赋值‘＝’其实是对原始变量的引用，是一种绑定。因此有时我们为了创建新的变量而不影响原始数据，需要使用拷贝（copy），这也是赋值和拷贝的不同。

numpy中的索引有切片索引（slice array indexing）和整型索引（integer array indxing）

其中，numpy array中的切片索引（slice）是对原始数组的一个view，会影响原始数组的。而后者情况比较复杂，不过是basic indexing，则依然是原始数组的一个view，如果是advanced indexig，会用原始数组创建一个新的数组，不会影响原始数据。

x[1, 3:8], x[2:5, 6:9]是slice索引，而x[1,2]是基本integer索引, x[[1,2], [1,4]]是高级integer索引。

针对pytorch tensor可以通过data_ptr()查看第一个元素地址是否相同:

>>> t = torch.rand(4, 4)
>>> b = t.view(2, 8)
>>> t.storage().data_ptr() == b.storage().data_ptr()  # `t` and `b` share the same underlying data.
True
# Modifying view tensor changes base tensor as well.
>>> b[0][0] = 3.14
>>> t[0][0]
tensor(3.14)

Pytorch中

更加详细的解释：https://pytorch.org/docs/stable/tensor_view.html

For reference, here’s a full list of view ops in PyTorch:

Basic slicing and indexing op, e.g. tensor[0, 2:, 1:7:2] returns a view of base tensor, see note below.

as_strided()

detach()

diagonal()

expand()

expand_as()

movedim()

narrow()

permute()

select()

squeeze()

transpose()

t()

T

real

imag

view_as_real()

view_as_imag()

unflatten()

unfold()

unsqueeze()

view()

view_as()

unbind()

split()

split_with_sizes()

chunk()

indices() (sparse tensor only)

values() (sparse tensor only)

It’s also worth mentioning a few ops with special behaviors:

reshape(), reshape_as() and flatten() can return either a view or new tensor, user code shouldn’t rely on whether it’s view or not.

contiguous() returns itself if input tensor is already contiguous, otherwise it returns a new contiguous tensor by copying data.

Numpy中

Pytorch的行为是模仿Numpy的，numpy提供了详细的说明，什么时候是view，什么时候是copy:

https://numpy.org/doc/stable/reference/arrays.indexing.html

Advanced indexing is triggered when the selection object, obj, is a non-tuple sequence object, an ndarray (of data type integer or bool), or a tuple with at least one sequence object or ndarray (of data type integer or bool). There are two types of advanced indexing: integer and Boolean.

Advanced indexing always returns a copy of the data (contrast with basic slicing that returns a view).

具体说明参加上面链接