Python爬虫实战项目------爬取天天基金

最新推荐文章于 2024-08-26 11:17:15 发布

独一无二的VV

最新推荐文章于 2024-08-26 11:17:15 发布

阅读量1.8k

点赞数 4

分类专栏：爬虫 Python 文章标签： python 爬虫

本文链接：https://blog.csdn.net/qq_45976392/article/details/119765077

版权

本文介绍了一个使用Python爬虫技术抓取天天基金数据的实战项目，涉及Beautifulsoup解析HTML、js2py处理JavaScript、正则表达式提取数据以及PyQt5界面设计。通过伪装头获取排行榜数据，实现搜索功能，将数据以颜色区分正负并在QTableWidget中展示。项目总结强调实践对于编程学习的重要性。

摘要由CSDN通过智能技术生成

爬虫实战项目------爬取天天基金

功能展示

请添加图片描述

技术

语言：python
Beautifulsoup解析html
js2py解析JavaScript
python 爬虫
正则表达式提取数据
界面设计用的PyQt5

功能片段

伪装头

# headers
headers = {
   
    'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/79.0.3945.79 Safari/537.36',
    'Referer': 'http://fund.eastmoney.com/data/fundranking.html',
}

爬取天天基金排行榜数据

# 爬取排行榜基金数据
def getRankDatas():
    url = "http://fund.eastmoney.com/data/rankhandler.aspx?op=ph&dt=kf&ft=all&rs=&gs=0&sc=6yzf&st=desc&sd=2020-08-15&ed=2021-08-15&qdii=&tabSubtype=,,,,,&pi=1&pn=50&dx=1&v=0.6075346553325671"
    req = requests.get(url=url, headers=headers)
    datas = req.text
    db = js2py.eval_js(datas)#数据是JavaScript，要用js2py解析器解析
    data = db['datas']
    return data

搜索功能实现

# 搜索基金数据
def searchData(text):
    url = "http://fund.eastmoney.com/" + str(text) + ".html"
    req = urllib.request.Request(url=url, headers=headers)
    response = urllib.request.urlopen(req)
    bs = BeautifulSoup(response, "html.parser")#数据是HTML，要用BeautifulSoup解析器解析
    return bs

根据文本框内容爬取相应的基金信息页面，提取的是html中的数据

界面设计

class Table(QWidget):
    def __init__(self):
        super(Table, self).__init__()
        self.i

最低0.47元/天解锁文章

独一无二的VV

关注

4
点赞
踩
16

收藏

觉得还不错? 一键收藏
打赏
3
评论
复制链接

分享到 QQ

分享到新浪微博

扫一扫

专栏目录