- 博客(3)
- 收藏
- 关注
原创 Python学习之路-NLP(人物提取)
import osimport jieba.posseg as psgimport jiebaimport timeimport renovel = 'hlm'dir = './Text/'+novel#jieba.load_userdict(dir+'/mydict.txt')# 字典合并def combineListToDic(list): if len(list) <= 1: return list temp = {} for l i.
2021-06-11 18:00:18 3351 1
原创 Python学习之路-爬虫(四大名著)
今天继续学习,爬取四大名著,来自静态网站http://www.purepen.com/index.html
2021-06-10 20:22:22 1456
原创 Python学习之路-爬虫(百度&微博热搜)
两段程序,在前辈基础上略有改动百度热搜Top50#百度热搜TOP50import requestsfrom lxml import etreehead = {}url = "http://top.baidu.com/buzz?b=341&fr=topindex"head["User-Agent"] = "Mozilla/5.0 (Windows NT 10.0; WOW64; rv:63.0) Gecko/20100101 Firefox/63.0"head["Accept"
2021-06-09 19:39:50 274 2
空空如也
空空如也
TA创建的收藏夹 TA关注的收藏夹
TA关注的人