python爬取智联招聘_【原创源码】python 爬取智联招聘

最新推荐文章于 2024-04-08 08:24:53 发布

weixin_39619433

最新推荐文章于 2024-04-08 08:24:53 发布

阅读量411

点赞数

文章标签： python爬取智联招聘

[Python] 纯文本查看复制代码from selenium import webdriver

from selenium.webdriver.chrome.options import Options

from selenium.webdriver.common.keys import Keys

from pyquery import PyQuery as pq

import time

class ZhiLian:

def __init__(self):

# 设置 chrome 无界面化模式

self.chrome_options = Options()

self.chrome_options.add_argument('--headless')

self.chrome_options.add_argument('--disable-gpu')

self.driver = webdriver.Chrome(chrome_options=self.chrome_options)

def get_url(self, search='python'):

"""

获取搜索职位的url, demo里面默认搜索python

:param search:

:return:

"""

self.driver.get("https://www.zhaopin.com/")

element = self.driver.find_element_by_class_name("zp-search__input")

element.send_keys(f"{search}")

element.send_keys(Keys.ENTER)

# 切换窗口

self.driver.switch_to.window(self.driver.window_handles[1])

# 等待js渲染完成后，在获取html

time.sleep(4)

html = self.driver.find_element_by_xpath("//*").get_attribute("outerHTML")

return html

def data_processing(self):

"""

处理数据

:return:

"""

html = self.get_url()

doc = pq(html)

contents = doc(".contentpile__content__wrapper")

for content in contents.items():

jobname = content(".contentpile__content__wrapper__item__info__box__jobname__title").text()

companyname = content(".contentpile__content__wrapper__item__info__box__cname").text()

saray = content(".contentpile__content__wrapper__item__info__box__job__saray").text()

demand = content(".contentpile__content__wrapper__item__info__box__job__demand").text()

yield jobname, companyname, saray, ",".join(demand.split("\n"))

datas = ZhiLian().data_processing()

for data in datas:

print(data)

weixin_39619433

关注

0
点赞
踩
3

收藏

觉得还不错? 一键收藏
0
评论
python爬取智联招聘_【原创源码】python 爬取智联招聘

[Python] 纯文本查看复制代码from selenium import webdriverfrom selenium.webdriver.chrome.options import Optionsfrom selenium.webdriver.common.keys import Keysfrom pyquery import PyQuery as pqimport timeclass Z...
复制链接

扫一扫

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。