使用scrapy爬取豆瓣电影排行top250的电影，并存入mongoDB

最新推荐文章于 2021-03-16 17:59:54 发布

好好生活ying

最新推荐文章于 2021-03-16 17:59:54 发布

阅读量1.5k

点赞数 1

分类专栏：网络爬虫 scrapy框架一起学python 文章标签： scrapy 爬虫存储到数据库爬虫爬虫框架

本文链接：https://blog.csdn.net/qq_42281826/article/details/81040165

版权

一.scrapy startproject 项目名；并进入项目目录；建立爬虫：scrapy genspider 爬虫名爬取域名二.在pycharm中进行编程1.item文件的编写：需要获取标题，电影演职员信息，评分，简介import scrapyclass MongotestItem(scrapy.Item): # define the fields for your item ...

摘要由CSDN通过智能技术生成

一.scrapy startproject 项目名；并进入项目目录；建立爬虫：scrapy genspider 爬虫名爬取域名

二.在pycharm中进行编程

1.item文件的编写：需要获取标题，电影演职员信息，评分，简介

import scrapy


class MongotestItem(scrapy.Item):
    # define the fields for your item here like:
    # name = scrapy.Field()
    title=scrapy.Field()
    info=scrapy.Field()
    content=scrapy.Field()
    scores=scrapy.Field()

2.编写爬虫文件

import scrapy
from mongotest.items import MongotestItem

class Test1Spider(scrapy.Spider):
    name = 'test1'
    allowed_domains = ['movie.douban.com']
    off_set=0
    url=

最低0.47元/天解锁文章

好好生活ying

关注

1
点赞
踩
9

收藏

觉得还不错? 一键收藏
0
评论
使用scrapy爬取豆瓣电影排行top250的电影，并存入mongoDB

一.scrapy startproject 项目名；并进入项目目录；建立爬虫：scrapy genspider 爬虫名爬取域名二.在pycharm中进行编程1.item文件的编写：需要获取标题，电影演职员信息，评分，简介import scrapyclass MongotestItem(scrapy.Item): # define the fields for your item ...
复制链接

扫一扫