爬取小说斗破苍穹

最新推荐文章于 2021-11-04 22:04:46 发布

@派大星@

最新推荐文章于 2021-11-04 22:04:46 发布

阅读量181

点赞数

分类专栏：爬虫文章标签： python 正则表达式

本文链接：https://blog.csdn.net/weixin_45919561/article/details/104413277

版权

爬虫专栏收录该内容

13 篇文章 1 订阅

订阅专栏

from urllib.request import urlopen
from urllib.request import Request
import re
headers = {
    'User-Agent':
        'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/74.0.3729.157 Safari/537.36'
}
url="http://www.doupoxs.com/doupocangqiong/1.html"
resp=Request(url,headers=headers)
response = urlopen(resp)
#使用正则表达式匹配信息
#re模块的findall(pattern,string[,flag])方法：在字符串 string 中查找正则表达式模式 pattern 的所有(非重复)出现；返回一个匹配对象的列表
res=re.findall('<p>(.*?)</p>',response.read().decode('utf-8'))
with open('E:/python/myPython/doupochangqiong.txt','a+') as f:
    f.write(str(res))
print(res)

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

@派大星@

关注关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
爬取小说斗破苍穹

from urllib.request import urlopenfrom urllib.request import Requestimport reheaders = { 'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) ...
复制链接

扫一扫