Python爬虫框架Scrapy实战 - 抓取BOSS直聘招聘信息

最新推荐文章于 2024-03-09 15:48:35 发布

VIP文章 jtahstu

最新推荐文章于 2024-03-09 15:48:35 发布

阅读量4.7k

点赞数

分类专栏：汇总 study 文章标签： python 爬虫招聘

本文链接：https://blog.csdn.net/jtahstu/article/details/78774400

版权

原文地址：http://www.jtahstu.com/blog/scrapy_zhipin_spider.html

Python爬虫框架Scrapy实战 - 抓取BOSS直聘招聘信息

Python爬虫框架Scrapy实战 - 抓取BOSS直聘招聘信息

零、开发环境

MacBook Pro (13-inch, 2016, Two Thunderbolt 3 ports)
CPU : 2 GHz Intel Core i5
RAM : 8 GB 1867 MHz LPDDR3
Python 版本: v3.6.3 [GCC 4.2.1 (Apple Inc. build 5666) (dot 3)] on darwin
MongoDB 版本: v3.4.7
MongoDB 可视化工具：MongoBooster v4.1.3

一、准备工作

安装 Scrapy

pip3 install scrapy

如果顺利的话,会像本人这样,装了一大堆软件包

参考翻译文档的安装教程：http://scrapy-chs.readthedocs.io/zh_CN/latest/intro/install.html

官方 GitHub 地址：https://github.com/scrapy/scrapy

二、新建项目

scrapy startproject www_zhipin_com

如果顺利的话,会像本人这样

三、定义要抓取的 Item

在items.py 文件中定义一个类

class WwwZhipinComItem(scrapy.Item):
    # define the fields for your item here like:
    # name = scrapy.Field()
    pid = scrapy.Field()
    positionName = scrapy.Field()
    positionLables = scrapy.Field

最低0.47元/天解锁文章

优惠劵

jtahstu

关注关注

0
点赞
踩
25

收藏

觉得还不错? 一键收藏
0
评论
Python爬虫框架Scrapy实战 - 抓取BOSS直聘招聘信息

详情链接：http://www.jtahstu.com/blog/scrapy_zhipin_spider.html零、开发环境MacBook Pro (13-inch, 2016, Two Thunderbolt 3 ports)CPU : 2 GHz Intel Core i5RAM : 8 GB 1867 MHz LPDDR3Python : Python 3.6.3 [GCC 4.2
复制链接

扫一扫