面向对象,国药监化妆品许可证爬取

面向函数——>面向对象;

只涉及列表页,详情页没有涉及;

所以只要一个类:CFDA,就够了。/20180225

__author__ = '姜枫渔火'

import requests, time, random
from fake_useragent import UserAgent

class CFDA():
    def __init__(self):
        self.url = 'http://125.35.6.84:81/xk/itownet/portalAction.do?method=getXkzsList'
        self.headers = {'User-Agent': UserAgent().random}
        self.data = {'on':'true',
                    'page':'1',
                    'pageSize':'15',
                    'productName':'',
                    'conditionType':'1',
                    'applyname':'',
                    'applysn':''}

    def getData(self, data):
        try:
            response = requests.post(self.url, data = data, headers = self.headers, timeout = 3)
            if response.status_code == 200:
                html = response.json()
                # print(html)
                return html
        except requests.exceptions.RequestException as e:
            print(e)
            return None
        except requests.exceptions.ConnectTimeout as e:
            print(e)
            return None

    def extractData(self, html):
        name_list = []
        try:
            for i in range(len(html['list'])):
                cfda_data = html['list'][i]['EPS_NAME']
                # print(cfda_data)
                name_list.append(cfda_data)
            return name_list
        except:
            print('!!!')
            return None

    def writeData(self, cfda_data):
        with open('CFDA.txt', 'a') as f:
            f.write(cfda_data + '\n')

    def run(self):
        for page in range(1, int(d)+1):
            print('第{}页:'.format(str(page)))
            time.sleep(random.uniform(1,3))
            self.data['page'] = str(page)
            one_page = self.getData(self.data)
            for i in self.extractData(one_page):
                print(i)
                self.writeData(i)
        print('任务完成')


if __name__ == '__main__':
    d = input('请输入爬取深度:')
    cfda = CFDA()
    cfda.run()




  • 0
    点赞
  • 3
    收藏
    觉得还不错? 一键收藏
  • 0
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值