继续研究爬网页,这次的网站是https://www.yuanjisong.com/job/shanghai
这是一个纯静态的网页,请求方式是get,所以直接使用request模块就行了。
每个任务的相关html代码如下
<div class="weui_panel weui_panel_access weui_panel_access_adapt db_adapt margin-top-2 ">
<a href="https://www.yuanjisong.com/job/104128" target="_blank">
<div class="weui_panel_hd weui_panel_hd_adapt media_desc_adapt_url">
<div class="topic_title">系统二次开发</div></div></a>
<div class="job_list_item_div">
<div class="weui_panel_bd ">
<div class="weui_media_box weui_media_text media_box_adapt">
<a href="https://www.yuanjisong.com/job/104128" class="media_desc_content_adapt" target="_blank">
<p class="media_desc_adapt ">
<span class="glyphicon glyphicon-th-large" aria-hidden="true"></span>
<span class="job_list_item_title ">描述:</span>在系统基础上增加新模块。具体需求加附件QQ我发给你。要求1. 3年以上**********MVC 开发经验;; 2. 至少掌握一种SQL关系型数据库(mysql或sqlserver); 3. 熟练掌握EasyUI、HTML、CSS、JavaScript、jQuery、AJAX、JSON等Web前端技术; 4. 使用Redis、MongoDB参与过实际项目的优先考虑<!-- <span class="more_text">详情...</span> --></p></a></div></div>
<div class="weui_panel_bd" >
<a href="https://www.yuanjisong.com/employer/134659" class="weui_media_box weui_media_