使用python爬取猫眼电影、房王、股吧论坛、百度翻译、有道翻译、高德天气、华夏基金、扇贝单词、糗事百科（股吧论坛）

物喜己悲

于 2019-05-26 22:11:25 发布

阅读量235

点赞数

分类专栏：爬虫文章标签：爬虫股吧

本文链接：https://blog.csdn.net/yu1860110/article/details/90581619

版权

爬虫专栏收录该内容

8 篇文章 0 订阅

订阅专栏

'''
翻页获取股吧数据
http://guba.eastmoney.com/
获取10页信息，然后放到指定文件夹中
'''
'''
爬取板块：国产芯片
思路：
    找规律
        第一页：http://so.eastmoney.com/web/s?keyword=%E5%9B%BD%E4%BA%A7%E8%8A%AF%E7%89%87
        第二页：http://so.eastmoney.com/web/s?keyword=%E5%9B%BD%E4%BA%A7%E8%8A%AF%E7%89%87&pageindex=2
        第三页：http://so.eastmoney.com/web/s?keyword=%E5%9B%BD%E4%BA%A7%E8%8A%AF%E7%89%87&pageindex=3
'''
import requests,os

def guba(pageindex):
    base_url = 'http://so.eastmoney.com/web/s?'
    # base_url = 'http://so.eastmoney.com/web/s?keyword=%E5%9B%BD%E4%BA%A7%E8%8A%AF%E7%89%87&pageindex=4'
    params = {
        'keyword': '%E5%9B%BD%E4%BA%A7%E8%8A%AF%E7%89%87',
    }
    path = './guba/'+pageindex+'/'
    if not os.path.exists(path):
        os.makedirs(path)

    for page in range(1,11):
        print(f'——————————————开始下载第{page}页——————————————')
        params['pageindex'] = str(page)
        file_path = path + str(page) +'.html'
        print(requests.get(base_url,params=params).url)
        with open(file_path,'w',encoding='utf-8')as f :
            f.write(requests.get(base_url,params=params).text)

    print('下载完成')


if __name__ == '__main__':
    pageindex = input('请输入文件夹名称')
    guba(pageindex)

物喜己悲

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
使用python爬取猫眼电影、房王、股吧论坛、百度翻译、有道翻译、高德天气、华夏基金、扇贝单词、糗事百科（股吧论坛）

'''翻页获取股吧数据http://guba.eastmoney.com/获取10页信息，然后放到指定文件夹中''''''爬取板块：国产芯片思路：找规律第一页：http://so.eastmoney.com/web/s?keyword=%E5%9B%BD%E4%BA%A7%E8%8A%AF%E7%89%87 第二页：http://so.ea...
复制链接

扫一扫

专栏目录