读取本地html并进行解析,将结果输出为json文件
具体实现方式如一下代码:
from lxml import etree
import json
tree = etree.HTML(open("E:/devTask/icon/test.html",'r',encoding="utf-8").read())
print(tree)
nodes = tree.xpath("//div[@class='icon-wrapper']")
res = {
"data":[]
}
for node in nodes:
name = node.xpath("p//text()")[0]
cls = node.xpath("span//@class")[0]
temp = {
"name":name,
"cls":cls
}
res["data"].append(temp)
print(res)
json_data = json.dumps(res)
with open('E:/devTask/icon/icon.json', 'a') as f_six:
f_six.write(json_data)
即可实现代码