- 博客(3)
- 收藏
- 关注
原创 Python学习之路-NLP(人物提取)
import os import jieba.posseg as psg import jieba import time import re novel = 'hlm' dir = './Text/'+novel #jieba.load_userdict(dir+'/mydict.txt') # 字典合并 def combineListToDic(list): if len(list) <= 1: return list temp = {} for l i.
2021-06-11 18:00:18
4180
1
原创 Python学习之路-爬虫(四大名著)
今天继续学习,爬取四大名著,来自静态网站http://www.purepen.com/index.html
2021-06-10 20:22:22
1691
原创 Python学习之路-爬虫(百度&微博热搜)
两段程序,在前辈基础上略有改动 百度热搜Top50 #百度热搜TOP50 import requests from lxml import etree head = {} url = "http://top.baidu.com/buzz?b=341&fr=topindex" head["User-Agent"] = "Mozilla/5.0 (Windows NT 10.0; WOW64; rv:63.0) Gecko/20100101 Firefox/63.0" head["Accept"
2021-06-09 19:39:50
424
2
空空如也
空空如也
TA创建的收藏夹 TA关注的收藏夹
TA关注的人
RSS订阅