爬取短美文付费文章
代码如下:
import requests
from lxml import etree
//获取url
url = "https://www.duanmeiwen.com/yulu/shanggan/2060493.html"
# 发起网络请求
content = requests.get(url).content.decode('GBK')
# 加载数据, 过滤
doc = etree.HTML(content).xpath("//div[@class='content']/p")
# 循环打印
for item in doc:
print(item.text)
运行结果演示: