xpath爬虫二手房案例代码

最新推荐文章于 2024-01-24 22:45:20 发布

fsafgfhujff

最新推荐文章于 2024-01-24 22:45:20 发布

阅读量1.7k

点赞数

文章标签： python

本文链接：https://blog.csdn.net/y1366210615/article/details/123916332

版权

import requests
from lxml import etree

if name == ‘main’:
# ua 伪装 =》模拟浏览器上网
headers = {
“User-Agent”: ‘Mozilla/5.0 (Windows NT 6.1; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/98.0.4758.102 Safari/537.36’
}

url = "https://dl.58.com/ershoufang"

#1.通用爬虫
page_info = requests.get(url,headers=headers)

#2.数据解析
root = etree.HTML(page_info.text)

#3.标签定位
div_list = root.xpath('//section[@class="list"]/div')

fp = open("D:\sxwang\project\pycharm\python-sk\data\二手房.txt","w",encoding="jbk")
for div in div_list:
    #标签定位
    title = div.xpath('./a/div[@class="property-content"]/div[@class="property-content-detail"]'
                      '/div[@class="property-content-title"]/h3/text()')[0]
    fp.write(title+"\n")
    print(title,"=>爬虫ok")

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

fsafgfhujff

关注关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
xpath爬虫二手房案例代码

import requestsfrom lxml import etreeif name == ‘main’:# ua 伪装 =》模拟浏览器上网headers = {“User-Agent”: ‘Mozilla/5.0 (Windows NT 6.1; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/98.0.4758.102 Safari/537.36’}url = "https://dl.58.com/ershoufa
复制链接

扫一扫