步骤:
1.在智联招聘网站选择好职位关键词和作用地址。
2.运行代码。
其中注意点
1.用driver爬取首页时,会弹出如下图窗口。此时可以在代码中设置睡眠2秒,自己手动取消窗口。
2.在这一次爬取中,发现网页的翻页按钮不能单纯用buttonTag.click()
点击,于是改为self.driver.execute_script("arguments[0].click()",nextBtn)
代码如下
# encoding: utf-8
from lxml import etree
from selenium import webdriver
import time
from selenium.webdriver.support.ui import WebDriverWait
from selenium.webdriver.support import expected_conditions as EC
from selenium.webdriver.common.by import By
import csv
class ZhiLian(object):
driver_path = r"C:\...\chromedriver.exe"
def __init__(self):
self.driver = webdriver.Chrome(executable_path=ZhiLian.driver_path)
self.url = "https://sou.zhaopin.com/?jl=768&sf=0&st=0&kw=%E7%A0%94%E5%8F%91%E5%B7%A5%E7%A8%8B%E5%B8%88&kt=3"
self.fp = open('zhilian_yanfa.csv','a',encoding='utf-8')
self.writer = csv.DictWriter(self.fp,['title',