网络爬虫之股票实例

enenenn

于 2020-04-24 21:48:38 发布

阅读量920

点赞数

文章标签： python

本文链接：https://blog.csdn.net/enenenn/article/details/105740557

版权

从东方财富获取股票列表，从百度股票获取各个股票详细信息

#CrawBaiduStocksB.py
import requests
from bs4 import BeautifulSoup
import traceback
import re
 
def getHTMLText(url, code="utf-8"):
    try:
        r = requests.get(url)
        r.raise_for_status()
        r.encoding = code#指定编码，不用r.apparent_encoding，分析，更快
        return r.text
    except:
        return ""
 
def getStockList(lst, stockURL):
    html = getHTMLText(stockURL, "GB2312")
    soup = BeautifulSoup(html, 'html.parser') 
    a = soup.find_all('a')
    for i in a:
        try:
            href = i.attrs['href']
            lst.append(re.findall(r"[s][hz]\d{6}", href)[0])
        except:
            continue
 
def getStockInfo(lst, stockURL, fpath):
    count = 0
    for stock in lst:
        url = stockURL + stock + ".html"
        html = getHTMLText(url)
        try:
            if html=

最低0.47元/天解锁文章

enenenn

关注

0
点赞
踩
2

收藏

觉得还不错? 一键收藏
0
评论
网络爬虫之股票实例

从东方财富获取股票列表，从百度股票获取各个股票详细信息#CrawBaiduStocksB.pyimport requestsfrom bs4 import BeautifulSoupimport tracebackimport re def getHTMLText(url, code="utf-8"): try: r = requests.get(url) ...
复制链接

扫一扫