scrapy从命令行传值

最新推荐文章于 2022-10-17 09:17:57 发布

bangfeng7363

最新推荐文章于 2022-10-17 09:17:57 发布

阅读量206

点赞数

文章标签： python 爬虫

原文链接：http://www.cnblogs.com/ptwg/p/11538301.html

版权

1.新建文件run.py

from scrapy.cmdline import execute


# tmall：爬虫的名字
# pro=男装为需要传入的参数值
execute(['scrapy', 'crawl', 'tmall', '-a', 'pro=男装', '--nolog'])

# 下面是无参数用法
# execute(['scrapy', 'crawl', 'tmall', '--nolog'])

2.爬虫.py中重写init方法，传入参数；（字典编码以字符串形式拼接到url后边）

# 倒入头文件 （字典编码后以参数形式拼接到url）
from urllib.parse import urlencode


class TmallSpider(scrapy.Spider):
    name = 'tmall'
    allowed_domains = ['tmall.com']

    def __init__(self, pro=None, *args, **kwargs):
        super(TmallSpider, self).__init__(*args, **kwargs)
        self.params = {
            'q': pro,
            'total_Page': 1,
            'jumpto': 1,
        }

        self.start_url = 'https://list.tmall.com/search_product.htm?' + urlencode(self.params)

    def start_requests(self):
        print('self.start_url:' + self.start_url)
        # yield scrapy.Request(
        #     url=self.start_url,
        #     callback=self.get_total_page,
        #     dont_filter=True,
        # )

    def get_total_page(self, response):
        pass

转载于:https://www.cnblogs.com/ptwg/p/11538301.html

优惠劵

bangfeng7363

关注关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
scrapy从命令行传值

1.新建文件run.pyfrom scrapy.cmdline import execute# tmall：爬虫的名字# pro=男装为需要传入的参数值execute(['scrapy', 'crawl', 'tmall', '-a', 'pro=男装', '--nolog'])# 下面是无参数用法# execute(['scrapy', 'cra...
复制链接

扫一扫