起点中文网字体反爬注意事项

请首先阅读:
Code皮皮虾
Python爬虫进阶之起点中文网字体反扒保姆级教程!!!
https://blog.csdn.net/llllllkkkkkooooo/article/details/108430930?ops_request_misc=%25257B%252522request%25255Fid%252522%25253A%252522161119264116780255297604%252522%25252C%252522scm%252522%25253A%25252220140713.130102334.pc%25255Fall.%252522%25257D&request_id=161119264116780255297604&biz_id=0&utm_medium=distribute.pc_search_result.none-task-blog-2allfirst_rank_v2~rank_v29-1-108430930.first_rank_v2_pc_rank_v29&utm_term=Python%E7%88%AC%E8%99%AB%E8%BF%9B%E9%98%B6%E4%B9%8B%E8%B5%B7%E7%82%B9%E4%B8%AD%E6%96%87%E7%BD%91%E5%AD%97%E4%BD%93%E5%8F%8D

import requests
import re
from fontTools.ttLib import TTFont
from lxml import etree
url = "https://book.qidian.com/info/1025457786"
response = requests.get(url=url)
response.encoding = 'utf-8'
html_data = response.text
with open("d:/zhusc/反扒成功.html","w",encoding="utf-8") as f:
    f.write(html_data)
t1 = re.findall("(\S{27})</span></em><cite>万字",html_data)[0]
print(t1)
t2 = re.findall("(\S{27})</span></em><cite>万总推荐",html_data)[0]
print(t2)
t3 = re.findall("(\S{27})</span></em><cite>周推荐",html_data)[0]
print(t3)
selector = etree.HTML(html_data) 
x1='/html/body/div/div[6]/div[1]/div[2]/p[3]/em[1]/span/@class' #万字
a1=selector.xpath(x1)[0]
print(a1)
x2='/html/body/div/div[6]/div[1]/div[2]/p[3]/em[2]/span/@class' #万总推荐
a2=selector.xpath(x2)[0]
print(a2) 
x3='/html/body/div/div[6]/div[1]/div[2]/p[3]/em[3]/span/@class' #周推荐
a3=selector.xpath(x3)[0]
print(a3) 

如图示,运行结果次次不同。界面一刷新,加密的数字马上变,span class也变
在这里插入图片描述
在这里插入图片描述
在这里插入图片描述

import requests
import time
from lxml import etree
url="https://book.qidian.com/info/1025457786"
x='/html/body/div/div[6]/div[1]/div[2]/p[3]/em[1]/span/text()'
res=requests.get(url)
html=res.content
selector = etree.HTML(html)
target=str(selector.xpath(x)[0])
print(target)   #𘡦𘡨𘡦

lxml.etree.HTML(text) 解析HTML文档,
像这种:
𘡤𘡠𘡤
会变成:𘡦𘡨𘡦

评论 1
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值