链家（beautifulsoup）

最新推荐文章于 2021-10-30 17:16:10 发布

cheng535

最新推荐文章于 2021-10-30 17:16:10 发布

阅读量366

点赞数

分类专栏： python

本文链接：https://blog.csdn.net/cheng535/article/details/81988025

版权

python 专栏收录该内容

14 篇文章 0 订阅

订阅专栏

import requests
from bs4 import BeautifulSoup

for i in range(1,6):
    url = 'https://bj.lianjia.com/ditiezufang/rp%s/'%(i)

    response = requests.get(url)

    # with open('lianjia.html','wb') as f:
    #     f.write(response.content)

    # html = response.text
    # print(html)

    soup = BeautifulSoup(response.text, 'lxml')
    ul_tag = soup.find('ul', id="house-lst")
    # print(ul_tag)

    li_tags = ul_tag.find_all('li')
    # print(li_tags)

    for li_tag in li_tags:
        # print(li_tag)
        title = li_tag.select('div.info-panel > h2 > a')[0].text
        print(title)
        info = li_tag.select('div.where')[0].text
        print(info)
        info_lou = li_tag.select('div.con')[0].text
        print(info_lou)
        tags = li_tag.select('div.view-label')[0].text
        print(tags)
        price = li_tag.select('div.price')[0].text
        print(price)
        update_time = li_tag.select('div.price-pre')[0].text
        print(update_time)
        # page_num = li_tag.select('div.list-wrap > div > a:nth-child')[0].text
        # print(page_num)
        print('-'*50)
        # body > div.wrapper > div.main - box.clear > div > div.list - wrap > div > a: nth - child(5)

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

cheng535

关注关注

0
点赞
踩
1

收藏

觉得还不错? 一键收藏
0
评论
链家（beautifulsoup）

import requestsfrom bs4 import BeautifulSoupfor i in range(1,6): url = 'https://bj.lianjia.com/ditiezufang/rp%s/'%(i) response = requests.get(url) # with open('lianjia.html','wb') as ...
复制链接

扫一扫