Python3爬虫小说章节内容

最新推荐文章于 2022-09-20 19:52:14 发布

Cep�Murphy laws

最新推荐文章于 2022-09-20 19:52:14 发布

阅读量398

点赞数

分类专栏：爬虫人工智能文章标签： python

版权声明：本文为博主原创文章，遵循 CC 4.0 BY-SA 版权协议，转载请附上原文出处链接和本声明。

本文链接：https://blog.csdn.net/weixin_44600471/article/details/100791052

版权

人工智能同时被 2 个专栏收录

9 篇文章 0 订阅

订阅专栏

8 篇文章 0 订阅

订阅专栏

import requests
from bs4 import BeautifulSoup
import txtread

url = ‘https://www.biqukan.com/0_790/’

responce = requests.get(url)
responce.encoding = ‘gbk’
html = responce.text
soup = BeautifulSoup(html, ‘lxml’)
#print(soup)

#获取标题名字
title = soup.find_all(‘dt’)
title_text = title[1].string[:-3]
print(title_text)

#获取章节链接和章节名在 'a’标签的列表里面分析
CharAll = soup.find_all(‘div’,class_=‘listmain’)
#print(type(CharAll))
#print(CharAll[0])
HrefList = BeautifulSoup(str(CharAll[0]), ‘lxml’)
Server = ‘https://www.biqukan.com’
for each in HrefList.find_all(‘a’)[12:]:
href = Server + each.get(‘href’)
CharName = each.string
txtChar = txtread.CharTxt(href)
print(CharName, href)
print(txtChar)

Cep�Murphy laws

关注

0
点赞
踩
2

收藏

觉得还不错? 一键收藏
0
评论
Python3爬虫小说章节内容

import requestsfrom bs4 import BeautifulSoupimport txtreadurl = ‘https://www.biqukan.com/0_790/’responce = requests.get(url)responce.encoding = ‘gbk’html = responce.textsoup = BeautifulSoup(htm...
复制链接

扫一扫

专栏目录

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。