CountVectorizer

import sklearn

from sklearn.feature_extraction.text import CountVectorizer

vector = CountVectorizer()
res = vector.fit_transform(["life is is short,I like python","life is long,I dislike python"])

print(vector.get_feature_names())
print(res.toarray())

C:\Python38\python.exe D:/Project/Study/python/machine/test.py
['dislike', 'is', 'life', 'like', 'long', 'python', 'short']
[[0 2 1 1 0 1 1]
 [1 1 1 0 1 1 0]]
import sklearn

from sklearn.feature_extraction.text import CountVectorizer

vector = CountVectorizer()
res = vector.fit_transform(["人生 苦 短,我喜欢 python","人生 漫长,不 喜欢python"])

print(vector.get_feature_names())
print(res.toarray())

C:\Python38\python.exe D:/Project/Study/python/machine/test.py
['python', '人生', '喜欢python', '我喜欢', '漫长']
[[1 1 0 1 0]
 [0 1 1 0 1]]

 

评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值