我试图将主成分分析应用于大型稀疏矩阵,在下面的链接中,它说明sklearn的randomizedPCA可以处理scipy稀疏格式的稀疏矩阵。
Apply PCA on very large sparse matrix
但是,我总是犯错误。有人能指出我做错了什么吗。
输入矩阵“X_train”包含float64中的数字:>>>type(X_train)
>>>X_train.shape
(2365436, 1617899)
>>>X_train.ndim
2
>>>X_train[0]
<1x1617899 sparse matrix of type ''
with 81 stored elements in Compressed Sparse Row format>
我想做的是:>>>from sklearn.decomposition import RandomizedPCA
>>>pca = RandomizedPCA()
>>>pca.fit(X_train)
Traceback (most recent call last):
File "", line 1, in
File "/home/RT11/.pyenv/versions/2.7.9/lib/python2.7/site-packages/sklearn/decomposition/pca.py", line 567, in fit
self._fit(check_array(X))
File "/home/RT11/.pyenv/versio