Python爬虫Headers处理脚本-字符串转字典

最近学python爬虫,复制chrom浏览器里的header到py里还要自己加引号,逗号,难受。

搜索查到的是 ”转换请求头参数的python脚本,自动添加引号“
链接
https://blog.csdn.net/weixin_43795201/article/details/84986421

心想一步到位。直接转字典好了,于是用正则拼了个 ,已经满足需求。还没大量测试,不知道会不出bug,有需要的可以试试。

import re
formstr = '''
Accept: text/html,application/xhtml+xml,application/xml;q=0.9,image/webp,image/apng,*/*;q=0.8
Accept-Encoding: gzip, deflate, br
Accept-Language: zh-CN,zh;q=0.9
Cache-Control: max-age=0
Connection: keep-alive
Cookie: __jsluid=e0f50b42c0133479609728104ecfa0bdb; _ga=GA1.2.733796313.1544613362; _gid=GA1.2.1036781813.15446313362; Hm_lvt_c4e8e5b919a5c12647962ea08462e63b=1544613362; gr_user_id=29e5f963-4dbb-411a-9233-fb3caf3c7692; is_read_index_notice_=1; __guid=XynHr45c10edf29685b6.52148503; FISKCDDCC=b92099aa55b9f8ab4f431843e799e41d; _nyjy_newadv_=1; SHADOWMAN=%7B%22key%22%3A%2250442c70686cf47a4a6e3196e074f17df%22%2C%22val%22%3A%22e058341874ca0adecba52db27besf94f0%22%2C%22tm%22%3A1544614351%7D; channel_id=16; location_city_id=5; NY_VALIDATE_KEY=c20a31c6de4eaf9ad8b1a0d9670a525a; gr_session_id_88c697c1877e5045=ea12172c-4b96-4fdc-8398-3e8830a50124; gr_session_id_88c697c1877e5045_ea12172c-4b96-4fdc-8398-3e8830a50124=true; ip_city=fuyang; Hm_lpvt_c4e8e5b919a5c12647962ea08462e63b=1544627691
Host: www.91160.com
Upgrade-Insecure-Requests: 1
User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10_12_1) AppleWebKit/5327.36 (KHTML, like Gecko) Chrome/70.0.3538.110 Safari/537.36
'''

def str2dict(formstr):
    form = {}
    #正则获取k,v列表
    r = re.findall(r'(.*?):(.*?)\n', formstr)
    for i in r:
        #移除k,v前后空格
        form[i[0].strip()] = i[1].strip()
    return form

header=str2dict(formstr)
print(header)

输出

{
    'Accept': 'text/html,application/xhtml+xml,application/xml;q=0.9,image/webp,image/apng,*/*;q=0.8', 
    'Accept-Encoding': 'gzip, deflate, br',
    'Accept-Language': 'zh-CN,zh;q=0.9',
    'Cache-Control': 'max-age=0',
    'Connection': 'keep-alive',
    'Cookie': '__jsluid=e0f50b42c0133479609728104ecfa0bdb; _ga=GA1.2.733796313.1544613362; _gid=GA1.2.1036781813.15446313362; Hm_lvt_c4e8e5b919a5c12647962ea08462e63b=1544613362; gr_user_id=29e5f963-4dbb-411a-9233-fb3caf3c7692; is_read_index_notice_=1; __guid=XynHr45c10edf29685b6.52148503; FISKCDDCC=b92099aa55b9f8ab4f431843e799e41d; _nyjy_newadv_=1; SHADOWMAN=%7B%22key%22%3A%2250442c70686cf47a4a6e3196e074f17df%22%2C%22val%22%3A%22e058341874ca0adecba52db27besf94f0%22%2C%22tm%22%3A1544614351%7D; channel_id=16; location_city_id=5; NY_VALIDATE_KEY=c20a31c6de4eaf9ad8b1a0d9670a525a; gr_session_id_88c697c1877e5045=ea12172c-4b96-4fdc-8398-3e8830a50124; gr_session_id_88c697c1877e5045_ea12172c-4b96-4fdc-8398-3e8830a50124=true; ip_city=fuyang; Hm_lpvt_c4e8e5b919a5c12647962ea08462e63b=1544627691',
    'Host': 'www.91160.com',
    'Upgrade-Insecure-Requests': '1',
    'User-Agent': 'Mozilla/5.0 (Macintosh; Intel Mac OS X 10_12_1) AppleWebKit/5327.36 (KHTML, like Gecko) Chrome/70.0.3538.110 Safari/537.36'
}
  • 1
    点赞
  • 2
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值