python爬取小说写入txt_python---爬取小说并存入txt文件的简单demo

weixin_39783771

于 2020-11-23 16:11:58 发布

阅读量963

点赞数

文章标签： python爬取小说写入txt

import requests

import re,os

##请求网页

from lxml import etree

headers = {

'User-Agent': 'Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/78.0.3904.108 Safari/537.36'

}

##获取目录章节和对应的链接

def get_info(url):

response = requests.get(url,headers=headers)

response.encoding = 'utf-8'

get_info_list = []

html = etree.HTML(response.text)

dd_list = html.xpath('//*[@id="list"]/dl/dd')

for dd in dd_list:

title = dd.xpath('a/text()')[0]

href = url + dd.xpath('a/@href')[0]

chapter = {'title':title,'href':href}

get_info_list.append(chapter)

return get_info_list

##全部存入一个文件中

def get_demo(get_info,txt):

for chapter_info in get_info:

response = requests.get

最低0.47元/天解锁文章

weixin_39783771

关注

0
点赞
踩
1

收藏

觉得还不错? 一键收藏
0
评论
python爬取小说写入txt_python---爬取小说并存入txt文件的简单demo

import requestsimport re,os##请求网页from lxml import etreeheaders = {'User-Agent': 'Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/78.0.3904.108 Safari/537.36'}##获取目录章...
复制链接

扫一扫

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。