淘宝模拟请求的坑

3 篇文章 0 订阅
3 篇文章 0 订阅

前言

最近在搞淘宝模拟请求,上次把请求的加密参数搞定,所以接着把整个流程搞完.

sign坑

  1. sign中签名参数c.data千万不能用json序列化.因为签名的顺序是固定的,用json将字典序列化后是顺序是随机的.

page坑

  1. page=1请求在url中是字符串,page=2请求在url中是数字.真是巨坑,调试了好久.

cookie坑

  1. 没有登录时,只能获取page=1的商品数据.无法获取其他分页的数据,如果请求会返回一个longing的url.
  2. 登录后,在cookie中有token,这个token有一个小时的有效期.

最后附上代码

# -*- coding: utf-8 -*-
# @Time    : 2020-03-30 16:11
# @Author  : ken
'''
1.sign参数顺序一致
2.需要有登录操作
'''
import requests
import json
import time
import execjs


class TianMao:
    '''天猫指定店铺爬取'''

    # t = str(int(time.time() * 1000))
    lists = []

    def __init__(self, token):
        self.token = token

    def get_goods_data(self, page):
        '''
        获取分页商品信息
        :param page: 页数
        :return:
        '''
        t = str(int(time.time() * 1000))
        url = None
        '''page=1 url中是str,page!=1 url中是int'''
        if page == 1:
            url = f'https://h5api.m.taobao.com/h5/mtop.taobao.wsearch.appsearch/1.0/?jsv=2.5.1' \
                f'&appKey=12574478' \
                f'&t={t}' \
                f'&sign={self.sign(self.token, page, t)}' \
                '&api=mtop.taobao.wsearch.appSearch' \
                '&v=1.0' \
                '&H5Request=true' \
                '&AntiCreep=true' \
                '&type=jsonp' \
                '&timeout=3000' \
                '&dataType=jsonp' \
                '&callback=mtopjsonp9' \
                f'&data=%7B%22m%22%3A%22shopitemsearch%22%2C%22vm%22%3A%22nw%22%2C%22sversion%22%3A%224.6%22%2C%22shopId%22%3A%22575905230%22%2C%22sellerId%22%3A%222200723654612%22%2C%22style%22%3A%22wf%22%2C%22page%22%3A%22{page}%22%2C%22sort%22%3A%22_coefp%22%2C%22catmap%22%3A%22%22%2C%22wirelessShopCategoryList%22%3A%22%22%7D'
        if page != 1:
            url = f'https://h5api.m.taobao.com/h5/mtop.taobao.wsearch.appsearch/1.0/?jsv=2.5.1' \
                f'&appKey=12574478' \
                f'&t={t}' \
                f'&sign={self.sign(self.token, page, t)}' \
                '&api=mtop.taobao.wsearch.appSearch' \
                '&v=1.0' \
                '&H5Request=true' \
                '&AntiCreep=true' \
                '&type=jsonp' \
                '&timeout=3000' \
                '&dataType=jsonp' \
                '&callback=mtopjsonp9' \
                f'&data=%7B%22m%22%3A%22shopitemsearch%22%2C%22vm%22%3A%22nw%22%2C%22sversion%22%3A%224.6%22%2C%22shopId%22%3A%22575905230%22%2C%22sellerId%22%3A%222200723654612%22%2C%22style%22%3A%22wf%22%2C%22page%22%3A{page}%2C%22sort%22%3A%22_coefp%22%2C%22catmap%22%3A%22%22%2C%22wirelessShopCategoryList%22%3A%22%22%7D'

        headers = {
            'authority': 'h5api.m.taobao.com',
            'method': 'GET',
            'path': url.split('h5api.m.taobao.com')[1],
            'scheme': 'https',
            'accept': '*/*',
            'accept-encoding': 'gzip, deflate, br',
            'accept-language': 'zh-CN,zh;q=0.9,en;q=0.8',
            'cache-control': 'no-cache',
            'cookie': 'miid=24666692082798684; thw=cn; tracknick=%5Cu4F17%5Cu751F%5Cu666E%5Cu6E21; _cc_=V32FPkk%2Fhw%3D%3D; tg=0; tfstk=cpuRBOjEBIC88y1C_CxmOR4oWRJGZ_9Lh_whJBK0Om16jY_di83iWVYvVW68kIC..; isg=BAwM24CXvjQqQaiq32TBD6bS3Ww-RbDv-wsr2GbMN7dR8a37jlD1f9LAlflJuehH; l=dBMfdTsPvpvP55hJBOfw5FocdE_OVIOb8sPyQ2PDFIB1t41QbdFWZHwEPQZXC3QQEt5AxetryLlA9dhwr4zLSVkDBeYI85iyVIpw8e1..; cna=Rzf3FjNUjHICAbcPty9iA0f8; enc=UY6B%2BsOgY43TW%2Be2jF4d4AjksBv1j9tAncstac1UxZUJ6IhMT7%2FtBJPtdSCGcNdcYi5%2BX7%2BKFGRJ%2Fy1jKkjBKA%3D%3D; _samesite_flag_=true; cookie2=18ab585438d550616f65fdc1dab51358; t=85aeffc3447ce057e176bd573b2152e2; _tb_token_=3b081380f7b35; _m_h5_tk=dab3bc5374595a30e716504c3b823a82_1585657982143; _m_h5_tk_enc=25af8089d26c1fd9bda0d95e9e117abd',
            'pragma': 'no-cache',
            'referer': 'https://market.m.taobao.com/app/tb-source-app/shop-auction/pages/auction?_w&sellerId=2200723654612&shopId=575905230&dynamicCard=true&disablePromotionTips=true&shop_navi=allitems&displayShopHeader=true',
            'sec-fetch-dest': 'script',
            'sec-fetch-mode': 'no-cors',
            'sec-fetch-site': 'same-site',
            'user-agent': 'Mozilla/5.0 (iPhone; CPU iPhone OS 13_2_3 like Mac OS X) AppleWebKit/605.1.15 (KHTML, like Gecko) Version/13.0.3 Mobile/15E148 Safari/604.1',
        }
        # result = xxxx.request('GET', url, headers=headers) # 代理请求
        result = requests.request('GET', url, headers=headers) # 不使用代理

        cookies = requests.utils.dict_from_cookiejar(result.cookies)
        print(cookies)
        print(result.text)

        # 清洗数据
        data = result.text.split('mtopjsonp9(')[1][:-1]
        self.clean_data(json.loads(data))

    def sign(self, token, page, t):
        '''参数破解'''
        js = '''    function h(a) {
        function b(a, b) {
            return a << b | a >>> 32 - b
        }
        function c(a, b) {
            var c, d, e, f, g;
            return e = 2147483648 & a,
            f = 2147483648 & b,
            c = 1073741824 & a,
            d = 1073741824 & b,
            g = (1073741823 & a) + (1073741823 & b),
            c & d ? 2147483648 ^ g ^ e ^ f : c | d ? 1073741824 & g ? 3221225472 ^ g ^ e ^ f : 1073741824 ^ g ^ e ^ f : g ^ e ^ f
        }
        function d(a, b, c) {
            return a & b | ~a & c
        }
        function e(a, b, c) {
            return a & c | b & ~c
        }
        function f(a, b, c) {
            return a ^ b ^ c
        }
        function g(a, b, c) {
            return b ^ (a | ~c)
        }
        function h(a, e, f, g, h, i, j) {
            return a = c(a, c(c(d(e, f, g), h), j)),
            c(b(a, i), e)
        }
        function i(a, d, f, g, h, i, j) {
            return a = c(a, c(c(e(d, f, g), h), j)),
            c(b(a, i), d)
        }
        function j(a, d, e, g, h, i, j) {
            return a = c(a, c(c(f(d, e, g), h), j)),
            c(b(a, i), d)
        }
        function k(a, d, e, f, h, i, j) {
            return a = c(a, c(c(g(d, e, f), h), j)),
            c(b(a, i), d)
        }
        function l(a) {
            for (var b, c = a.length, d = c + 8, e = (d - d % 64) / 64, f = 16 * (e + 1), g = new Array(f - 1), h = 0, i = 0; c > i; )
                b = (i - i % 4) / 4,
                h = i % 4 * 8,
                g[b] = g[b] | a.charCodeAt(i) << h,
                i++;
            return b = (i - i % 4) / 4,
            h = i % 4 * 8,
            g[b] = g[b] | 128 << h,
            g[f - 2] = c << 3,
            g[f - 1] = c >>> 29,
            g
        }
        function m(a) {
            var b, c, d = "", e = "";
            for (c = 0; 3 >= c; c++)
                b = a >>> 8 * c & 255,
                e = "0" + b.toString(16),
                d += e.substr(e.length - 2, 2);
            return d
        }
        function n(a) {
            a = a.replace(/\\r\\n/g, "\\n");
            for (var b = "", c = 0; c < a.length; c++) {
                var d = a.charCodeAt(c);
                128 > d ? b += String.fromCharCode(d) : d > 127 && 2048 > d ? (b += String.fromCharCode(d >> 6 | 192),
                b += String.fromCharCode(63 & d | 128)) : (b += String.fromCharCode(d >> 12 | 224),
                b += String.fromCharCode(d >> 6 & 63 | 128),
                b += String.fromCharCode(63 & d | 128))
            }
            return b
        }
        var o, p, q, r, s, t, u, v, w, x = [], y = 7, z = 12, A = 17, B = 22, C = 5, D = 9, E = 14, F = 20, G = 4, H = 11, I = 16, J = 23, K = 6, L = 10, M = 15, N = 21;
        for (a = n(a),
        x = l(a),
        t = 1732584193,
        u = 4023233417,
        v = 2562383102,
        w = 271733878,
        o = 0; o < x.length; o += 16)
            p = t,
            q = u,
            r = v,
            s = w,
            t = h(t, u, v, w, x[o + 0], y, 3614090360),
            w = h(w, t, u, v, x[o + 1], z, 3905402710),
            v = h(v, w, t, u, x[o + 2], A, 606105819),
            u = h(u, v, w, t, x[o + 3], B, 3250441966),
            t = h(t, u, v, w, x[o + 4], y, 4118548399),
            w = h(w, t, u, v, x[o + 5], z, 1200080426),
            v = h(v, w, t, u, x[o + 6], A, 2821735955),
            u = h(u, v, w, t, x[o + 7], B, 4249261313),
            t = h(t, u, v, w, x[o + 8], y, 1770035416),
            w = h(w, t, u, v, x[o + 9], z, 2336552879),
            v = h(v, w, t, u, x[o + 10], A, 4294925233),
            u = h(u, v, w, t, x[o + 11], B, 2304563134),
            t = h(t, u, v, w, x[o + 12], y, 1804603682),
            w = h(w, t, u, v, x[o + 13], z, 4254626195),
            v = h(v, w, t, u, x[o + 14], A, 2792965006),
            u = h(u, v, w, t, x[o + 15], B, 1236535329),
            t = i(t, u, v, w, x[o + 1], C, 4129170786),
            w = i(w, t, u, v, x[o + 6], D, 3225465664),
            v = i(v, w, t, u, x[o + 11], E, 643717713),
            u = i(u, v, w, t, x[o + 0], F, 3921069994),
            t = i(t, u, v, w, x[o + 5], C, 3593408605),
            w = i(w, t, u, v, x[o + 10], D, 38016083),
            v = i(v, w, t, u, x[o + 15], E, 3634488961),
            u = i(u, v, w, t, x[o + 4], F, 3889429448),
            t = i(t, u, v, w, x[o + 9], C, 568446438),
            w = i(w, t, u, v, x[o + 14], D, 3275163606),
            v = i(v, w, t, u, x[o + 3], E, 4107603335),
            u = i(u, v, w, t, x[o + 8], F, 1163531501),
            t = i(t, u, v, w, x[o + 13], C, 2850285829),
            w = i(w, t, u, v, x[o + 2], D, 4243563512),
            v = i(v, w, t, u, x[o + 7], E, 1735328473),
            u = i(u, v, w, t, x[o + 12], F, 2368359562),
            t = j(t, u, v, w, x[o + 5], G, 4294588738),
            w = j(w, t, u, v, x[o + 8], H, 2272392833),
            v = j(v, w, t, u, x[o + 11], I, 1839030562),
            u = j(u, v, w, t, x[o + 14], J, 4259657740),
            t = j(t, u, v, w, x[o + 1], G, 2763975236),
            w = j(w, t, u, v, x[o + 4], H, 1272893353),
            v = j(v, w, t, u, x[o + 7], I, 4139469664),
            u = j(u, v, w, t, x[o + 10], J, 3200236656),
            t = j(t, u, v, w, x[o + 13], G, 681279174),
            w = j(w, t, u, v, x[o + 0], H, 3936430074),
            v = j(v, w, t, u, x[o + 3], I, 3572445317),
            u = j(u, v, w, t, x[o + 6], J, 76029189),
            t = j(t, u, v, w, x[o + 9], G, 3654602809),
            w = j(w, t, u, v, x[o + 12], H, 3873151461),
            v = j(v, w, t, u, x[o + 15], I, 530742520),
            u = j(u, v, w, t, x[o + 2], J, 3299628645),
            t = k(t, u, v, w, x[o + 0], K, 4096336452),
            w = k(w, t, u, v, x[o + 7], L, 1126891415),
            v = k(v, w, t, u, x[o + 14], M, 2878612391),
            u = k(u, v, w, t, x[o + 5], N, 4237533241),
            t = k(t, u, v, w, x[o + 12], K, 1700485571),
            w = k(w, t, u, v, x[o + 3], L, 2399980690),
            v = k(v, w, t, u, x[o + 10], M, 4293915773),
            u = k(u, v, w, t, x[o + 1], N, 2240044497),
            t = k(t, u, v, w, x[o + 8], K, 1873313359),
            w = k(w, t, u, v, x[o + 15], L, 4264355552),
            v = k(v, w, t, u, x[o + 6], M, 2734768916),
            u = k(u, v, w, t, x[o + 13], N, 1309151649),
            t = k(t, u, v, w, x[o + 4], K, 4149444226),
            w = k(w, t, u, v, x[o + 11], L, 3174756917),
            v = k(v, w, t, u, x[o + 2], M, 718787259),
            u = k(u, v, w, t, x[o + 9], N, 3951481745),
            t = c(t, p),
            u = c(u, q),
            v = c(v, r),
            w = c(w, s);
        var O = m(t) + m(u) + m(v) + m(w);
        return O.toLowerCase()
    }'''
        param = None
        if page == 1:
            param = f'{token}&{t}&12574478&' + '{"m":"shopitemsearch","vm":"nw","sversion":"4.6","shopId":"575905230","sellerId":"2200723654612","style":"wf","page":"%s","sort":"_coefp","catmap":"","wirelessShopCategoryList":""}' % page
        if page != 1:
            param = f'{token}&{t}&12574478&' + '{"m":"shopitemsearch","vm":"nw","sversion":"4.6","shopId":"575905230","sellerId":"2200723654612","style":"wf","page":%s,"sort":"_coefp","catmap":"","wirelessShopCategoryList":""}' % page
        ej = execjs.compile(js)
        result = ej.call('h', param)
        print(f'加密参数--> {param}')
        print(f'sign--> {result}')
        return result

    def clean_data(self, data):
        '''清洗参数'''
        '''
        data : json数据
        '''
        try:
            for i in data['data']['itemsArray']:
                data = {
                    'price': i['priceWap'],
                    'sold': i['sold'],
                    'name': i['title']
                }
                TianMao.lists.append(data)
            print(TianMao.lists)
        except BaseException as e:
            pass


if __name__ == '__main__':
    # token需要模拟登录,获取cookie的 _m_h5_tk字段值
    token = '..........'
    t = TianMao(token)
    for i in range(1, 11):
        print(f'正在请求第{i}页')
        t.get_goods_data(i)

  • 0
    点赞
  • 4
    收藏
    觉得还不错? 一键收藏
  • 6
    评论
Fiddler可以用于模拟请求,以便在测试过程中修改请求或响应数据,或者直接模拟服务器的响应。在Fiddler中,可以使用断点功能来完成这些操作。你可以针对不同的请求数据格式进行模拟。例如,对于四种POST请求数据格式,Fiddler提供了相应的请求模拟实现。另外,在一些情况下,如果不想使用实际返回的请求数据,或者服务暂时无法返回数据,你可以使用Fiddler来模拟返回数据。这样可以方便地进行测试和调试。所以,Fiddler提供了丰富的功能来模拟请求和响应数据。<span class="em">1</span><span class="em">2</span><span class="em">3</span> #### 引用[.reference_title] - *1* *3* [fiddler之模拟响应、修改请求或响应数据(断点)](https://blog.csdn.net/qq_38368494/article/details/108516167)[target="_blank" data-report-click={"spm":"1018.2226.3001.9630","extra":{"utm_source":"vip_chatgpt_common_search_pc_result","utm_medium":"distribute.pc_search_result.none-task-cask-2~all~insert_cask~default-1-null.142^v93^chatsearchT3_2"}}] [.reference_item style="max-width: 50%"] - *2* [Fiddler模拟post四种请求数据](https://blog.csdn.net/weixin_34401479/article/details/89663311)[target="_blank" data-report-click={"spm":"1018.2226.3001.9630","extra":{"utm_source":"vip_chatgpt_common_search_pc_result","utm_medium":"distribute.pc_search_result.none-task-cask-2~all~insert_cask~default-1-null.142^v93^chatsearchT3_2"}}] [.reference_item style="max-width: 50%"] [ .reference_list ]

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论 6
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值