陷阱一:数据结构混乱
array 和 matrix 都可以用来表示多维矩阵
In [98]: a = np.array([[1, 2, 3], [4, 5, 6], [7, 8, 9]])
In [99]: a
Out[99]:
array([[1, 2, 3],
[4, 5, 6],
[7, 8, 9]])
In [100]: A = np.matrix([[1, 2, 3], [4, 5, 6], [7, 8, 9]])
In [101]: A
Out[101]:
matrix([[1, 2, 3],
[4, 5, 6],
[7, 8, 9]])
In [102]: a.shape
Out[102]: (3, 3)
In [103]: A.shape
Out[103]: (3, 3)
看起来效果不错。假设我们要对数据进行筛选,取第 1 列的第 1 行和第 3 行数据构成一个 2 x 1 的列向量。先看对 array 的做法
In [99]: a
Out[99]:
array([[1, 2, 3],
[4, 5, 6],
[7, 8, 9]])
In [100]: y
Out[100]:
matrix([[1],
[0],
[1]])
In [101]: a[:, 0]
Out[101]: array([1, 4, 7])
In [102]: a[:, 0].shape
Out[102]: (3,)
In [110]: a[:, 0][y == 1]
---------------------------------------------------------------------------
IndexError Traceback (most recent call last)
<ipython-input-110-f32ed63aa2a8> in <module>()
----> 1 a[:, 0][y == 1]
IndexError: too many indices for array
In [111]: a[:, 0].reshape(3, 1)[y == 1]
Out[111]: array([1, 7])
从 Out[101] 可以看到一个陷阱,a