![](https://img-blog.csdnimg.cn/20201014180756918.png?x-oss-process=image/resize,m_fixed,h_64,w_64)
python
薛定喵的颚
精通点亮流水灯
展开
-
爬虫爬取b站弹幕遇到乱码问题
今天在b站爬取弹幕的时候发现爬取的弹幕是乱码。最后发现是编码问题。综合整理如下:#首先准备request库和lxml库import requestsfrom lxml import etree#b站网址url="https://api.bilibili.com/x/v2/dm/history?type=1&oid=129023838&date=2021-01-09"#设置请求头防止反扒headers={ "User-Agent": "Mozilla/5.0 (Windows原创 2021-01-12 20:51:41 · 3049 阅读 · 2 评论 -
python爬虫爬取豆瓣一周榜单
#首先准备request库和lxml库import requestsfrom lxml import etree#豆瓣网址url="https://movie.douban.com/chart"#设置请求头防止反扒headers={ "User-Agent": "Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/86.0.4240.75 Safari/537.36"原创 2021-01-12 15:21:14 · 232 阅读 · 0 评论