python爬虫获取文字（小说等）基础教程

最新推荐文章于 2024-07-28 15:46:11 发布

CJ130923

最新推荐文章于 2024-07-28 15:46:11 发布

阅读量1k

点赞数

文章标签： python 爬虫

本文链接：https://blog.csdn.net/CJ130923/article/details/82911219

版权

一个简单的爬取文字的程序，结合前述3篇博客，基本包括一些爬虫基础，希望可以互相学习

import requests
from lxml import etree

def get_url():
    url='https://share.html5.qq.com/fx/u?r=rBHXbBC'
    r=requests.get(url)
    r.encoding = 'UTF-8'
    #print(r.text)
    html=etree.HTML(r.text)
    ts = html.xpath('//div[@class="item article"]/section/article/p/span/text()')
    #print(ts)
    for t in ts:
        # 去掉空格换行之类的
        d = t.strip()
        print(d)
        save1File(d)
def save1File(d):
    print('''保存''')
    with open('F:python//test//爬虫学习//保存文字//datas.txt', 'a',encoding='utf-8') as fp:  
        fp.write(d+'\n')    

get_url()
save1File()

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

CJ130923

关注关注

0
点赞
踩
1

收藏

觉得还不错? 一键收藏
1
评论
python爬虫获取文字（小说等）基础教程

一个简单的爬取文字的程序，结合前述3篇博客，基本包括一些爬虫基础，希望可以互相学习import requestsfrom lxml import etreedef get_url(): url='https://share.html5.qq.com/fx/u?r=rBHXbBC' r=requests.get(url) r.encoding = 'UTF-8'...
复制链接

扫一扫