python
the_power
这个作者很懒,什么都没留下…
展开
-
OpenAccessLibrary 网站简单爬取
目标目标网站:Open Access Library地址:https://www.oalib.com/目标爬取内容https://www.oalib.com/journal/3174/1 岩石力学与工程学报内容代码import requestsimport timefrom scrapy import Selectorclass OalibSpider: """ 1、构造分页的url https://www.oalib.com/journal/3174/1原创 2021-01-17 13:17:18 · 1830 阅读 · 1 评论 -
BeautifulSoup库(基于内容查找)
BeautifulSoup库(基于内容查找部分)#导入库from bs4 import BeautifulSoup打开一个文件并初始化soup(练习准备)with open('test.html','r',encoding='utf-8') as test: soup = BeautifulSoup(test) 注意:如果读文件报错, UnicodeDecodeErr...原创 2019-01-25 18:19:07 · 3748 阅读 · 0 评论