我正在尝试实施这样的高斯NB训练。但是国民生产总值()如果X的维数不相等(即X内的所有列表的长度都必须相同),则抛出异常。如果我的训练样本是不同长度的向量,那么调用fit()的正确方法是什么?在def train(X, Y):
gnb = GaussianNB()
gnb.fit(X, Y)
return gnb
>>> X = [[1,2,3], [4,5,6,7], [8,9]]
>>> Y = [1,1,1]
>>> snb.train(X, Y)
/Library/Python/2.7/site-packages/sklearn/utils/validation.py:395:
DeprecationWarning: Passing 1d arrays as data is deprecated in 0.17
and will raise ValueError in 0.19. Reshape your data either using
X.reshape(-1, 1) if your data has a single feature or X.reshape(1, -1)
if it contains a single sample.
DeprecationWarning)
Traceback (most recent call last):
File "", line 1, in
File "snb.py", line 113, in train
gnb.fit(X, Y)
File "/Library/Python/2.7/site-packages/sklearn/naive_bayes.py", line
182, in fit
X, y = check_X_y(X, y)
File "/Library/Python/2.7/site-packages/sklearn/utils/validation.py",
line 521, in check_X_y
ensure_min_features, warn_on_dtype, estimator)
File "/Library/Python/2.7/site-packages/sklearn/utils/validation.py",
line 402, in check_array
array = array.astype(np.float64)
ValueError: setting an array element with a sequence.