探索文本数据

weixin_45271076

于 2021-05-14 16:21:02 发布

阅读量7k

点赞数

本文链接：https://blog.csdn.net/weixin_45271076/article/details/116792890

版权

#探索文本数据
from sklearn.datasets import fetch_20newsgroups
data=fetch_20newsgroups()#类字典的方式

#不同类型的新闻，标签的分类
data.target_names

在这里插入图片描述

import numpy as np
import pandas as pd
categories=["sci.space"
           ,"rec.sport.hockey"
           ,"talk.politics.guns"
           ,"talk.politics.mideast"]
train=fetch_20newsgroups(subset="train",categories=categories)
test=fetch_20newsgroups(subset="test",categories=categories)