用Jupyter—Notebook爬取网页数据实例14

最新推荐文章于 2023-06-10 19:58:25 发布

HongMeng07

最新推荐文章于 2023-06-10 19:58:25 发布

阅读量1.9k

点赞数

分类专栏：学习实例文章标签：大数据

本文链接：https://blog.csdn.net/HongMeng07/article/details/110415737

版权

学习实例专栏收录该内容

13 篇文章 15 订阅

订阅专栏

用selenium库爬取中华英才网校招信息

看来都是姚老板的

在这里插入图片描述
哦，正事差点忘了，上代码

#引入selenium、 pandas、openpyxl库
from selenium import webdriver
import pandas as pd
import openpyxl
#定义存储变量
zwgs=[]
xixl=[]
wssj=[]
#获取网页源代码
for i in range(4):
    url='http://campus.chinahr.com/qz/p'+str(i)+'/'
    browser = webdriver.Chrome()
    browser.get(url)
#解析源代码，提取所需数据信息     
    for i in browser.find_elements_by_class_name('item'):
        zwgs.append(i.find_elements_by_class_name('top-area')[0].text.replace('\n',''))
        xixl.append(i.find_elements_by_class_name('center-area')[0].find_elements_by_class_name('job-info')[0].text.replace('\n',''))
        wssj.append(i.find_elements_by_class_name('bottom-area')[0].text.replace('\n',''))
pd.DataFrame({'职位公司':zwgs,'薪资学历':xixl,'网申时间':wssj})
data=pd.DataFrame({'职位公司':zwgs,'薪资学历':xixl,'网申时间':wssj})
writer=pd.ExcelWriter('zhonghuayingcaiwang.xlsx')
data.to_excel(writer,'爬虫数据')
writer.save()

HongMeng07

关注

0
点赞
踩
4

收藏

觉得还不错? 一键收藏
1
评论
用Jupyter—Notebook爬取网页数据实例14

用selenium库爬取中华英才网校招信息看来都是姚老板的哦，正事差点忘了，上代码#引入selenium、 pandas、openpyxl库from selenium import webdriverimport pandas as pdimport openpyxl#定义存储变量zwgs=[]xixl=[]wssj=[]#获取网页源代码for i in range(4): url='http://campus.chinahr.com/qz/p'+str(i)+'/'
复制链接

扫一扫