scrapy爬取新浪微博关键字微博

最新推荐文章于 2024-04-19 16:33:21 发布

salome_

最新推荐文章于 2024-04-19 16:33:21 发布

阅读量1.6k

点赞数 2

分类专栏： python

本文链接：https://blog.csdn.net/nio_jiy/article/details/83759239

版权

#weibo.py
# -*- coding: utf-8 -*-
from scrapy import Spider, Request, FormRequest
import re
from weibosearch.items  import WeiboItem
import  json


class WeiboSpider(Spider):
    name = "weibo"
    allowed_domains = ["weibo.cn"]
    search_url = 'https://weibo.cn/search/mblog'
    max_page=100

    cookie_raw=''#插入自己的cookie
    headers={
        'Accept': ' text/html,application/xhtml+xml,application/xml;q=0.9,image/webp,image/apng,*/*;q=0.8',
        'Accept-Encoding': ' gzip, deflate, br',
        'Accept-Language': ' zh-CN,zh;q=0.9,en-US;q=0.8,en;q=0.7',
        'Cache-Control': ' max-age=0',
        'Connection': ' keep-alive',
        'Content-Type': ' application/x-www-form-urlencoded',
        'Host': ' weibo.cn',
        'Origin':

最低0.47元/天解锁文章

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

salome_

关注关注

2
点赞
踩
9

收藏

觉得还不错? 一键收藏
2
评论
scrapy爬取新浪微博关键字微博

#weibo.py# -*- coding: utf-8 -*-from scrapy import Spider, Request, FormRequestimport refrom weibosearch.items import WeiboItemimport jsonclass WeiboSpider(Spider): name = "weibo" a...
复制链接

扫一扫