Scrapy的Request类支持设置cookie属性,要在爬虫请求中带上cookie,可以重载Spider的start_requests方法。
import sysfrom scrapy.spider import Spiderfrom scrapy.selector import Selectorfrom scrapy.http.request import Requestclass InfoqSpider(Spider): name = "techbrood" allowed_domains = ["techbrood.com"] start_urls = [ "http://techbrood.com", ] def start_requests(self): for url in self.start_urls: yield Request(url, cookies={'techbrood.com': 'true'})
参考文档:
再分享一下我老师大神的人工智能教程吧。零基础!通俗易懂!风趣幽默!还带黄段子!希望你也加入到我们人工智能的队伍中来!https://blog.csdn.net/jiangjunshow