【机器学习】读取txt文本内容计算TF-IDF值，算法，python

最新推荐文章于 2023-09-27 21:34:52 发布

HelenLee01

最新推荐文章于 2023-09-27 21:34:52 发布

阅读量1.6k

点赞数 2

分类专栏：机器学习文章标签： TF-IDF python 机器学习 sklearn 算法

本文链接：https://blog.csdn.net/weixin_43289135/article/details/104649809

版权

Sklearn库的学习之TF-IDF算法：

# coding:utf-8
import jieba
import jieba.posseg as pseg
import os
import sys
from sklearn import feature_extraction
from sklearn.feature_extraction.text import TfidfTransformer
from sklearn.feature_extraction.text import CountVectorizer
one = open(r'one.txt',encoding = "utf-8")
onee = list(one)
two = open(r'two.txt',encoding = "utf-8")
twoo = list(two)
three = open(r'three.txt',encoding = "utf-8")
threee = list(three)
four = open(r'four.txt',encoding = "utf-8")
fourr = list(four)
five = open(r'five.txt',encoding = "utf-8")
fivee = list(five)
six = open(r'six.txt',encoding = "utf-8")
sixx = list(six)
one.close()
two.close()
three.close()
if __name__ == "__main__":
    corpus= onee + twoo + threee + fourr + fivee

最低0.47元/天解锁文章

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

HelenLee01

关注关注

2
点赞
踩
9

收藏

觉得还不错? 一键收藏
0
评论
【机器学习】读取txt文本内容计算TF-IDF值，算法，python

Sklearn库的学习之TF-IDF算法：# coding:utf-8import jiebaimport jieba.posseg as psegimport osimport sysfrom sklearn import feature_extractionfrom sklearn.feature_extraction.text import TfidfTransformer...
复制链接

扫一扫