python解析html

最新推荐文章于 2023-12-25 20:34:57 发布

wan_zaiyunduan

最新推荐文章于 2023-12-25 20:34:57 发布

阅读量157

点赞数

分类专栏：随笔

本文链接：https://blog.csdn.net/wan_zaiyunduan/article/details/89681407

版权

随笔专栏收录该内容

13 篇文章 0 订阅

订阅专栏

获取html中数据

# coding=utf-8
import sys
from bs4 import BeautifulSoup as bs


def read_html(filepath):
    '''
    用BeautifulSoup解析数据  python3 必须传入参数二'html.parser' 得到一个对象，接下来获取对象的相关属性
    :param filepath: 要解析的html文件路径
    :return: 返回文件内容
    '''
    try:
        f = open(filepath)
    except IOError as e:
        print(e)
    else:
        content = f.read()
    return content
# htmlpath2 = "/Users/.../Pycharms/DEL/Test/overview.html"
htmlpath = sys.argv[1]
htmlcontent = read_html(htmlpath)
html3 = bs(htmlcontent, 'html.parser')
passed = int(html3.find_all('td', attrs={"class": "passed number"})[0].string)
skiped = int(html3.find_all('td', attrs={"class": "zero number"})[0].string)
errored = int(html3.find_all('td', attrs={"class": "failed number"})[0].string)
print(passed)
print(skiped)
print(errored)

参考：https://blog.csdn.net/qq_36411874/article/details/83784101

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

wan_zaiyunduan

关注关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
python解析html

获取html中数据# coding=utf-8import sysfrom bs4 import BeautifulSoup as bsdef read_html(filepath): ''' 用BeautifulSoup解析数据 python3 必须传入参数二'html.parser' 得到一个对象，接下来获取对象的相关属性 :param filepat...
复制链接

扫一扫