tf-idf和lgs简单的例子

最新推荐文章于 2022-04-25 22:04:32 发布

zj1244

最新推荐文章于 2022-04-25 22:04:32 发布

阅读量727

点赞数

本文链接：https://blog.csdn.net/zj1244/article/details/79150925

版权

import pandas as pd
import numpy as np
from sklearn.feature_extraction.text import TfidfVectorizer
from sklearn.linear_model.logistic import LogisticRegression
from sklearn.model_selection import train_test_split, cross_val_score
 
df = pd.read_csv('e:\\SMSSpamCollection', delimiter='\t',header=None)

In [10]:

df.head()

Out[10]:

	0	1
0	ham	Go until jurong point, crazy.. Available only ...
1	ham	Ok lar... Joking wif u oni...
2	spam	Free entry in 2 a wkly comp to win FA Cup fina...
3	ham	U dun say so early hor... U c already then say...
4	ham	Nah I don't think he goes to usf, he lives aro...

In [12]:

X_train_raw, X_test_raw, y_train, y_test = train_test_split(df[1],df[0])
 
vectorizer = TfidfVectorizer()
X_train = vectorizer.fit_transform(X_train_raw)
classifier = LogisticRegression()
classifier.fit(X_train, y_train)

X_test = vectorizer.transform( ['URGENT! Your Mobile No 1234 was awarded a Prize', 'Hey honey, whats up?'] )
predictions = classifier.predict(X_test)
print(predictions)

['spam' 'ham']

如果新建个tfidf实例就会报错，必须要在原来的进行转换，如下就会报错：

new_vectorizer=TfidfVectorizer()
new_test = new_vectorizer.transform( ['URGENT! Your Mobile No 1234 was awarded a Prize', 'Hey honey, whats up?'] )
predictions=classifier.predict(new_test)

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

zj1244

关注关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
tf-idf和lgs简单的例子

import pandas as pdimport numpy as npfrom sklearn.feature_extraction.text import TfidfVectorizerfrom sklearn.linear_model.logistic import LogisticRegressionfrom sklearn.model_selection import trai
复制链接

扫一扫