环境:python2.7
安装lxml模块pip install lxml
例子:from lxml import etree
text = ‘‘‘
- first item
- second item
- third item
- fourth item
- fifth item
‘‘‘
html = etree.HTML(text) #这是一个地址
result = etree.tostring(html) #读出来源码,并且补全,如输出的《body》标签
print(result)
输出:
- first item
- second item
- third item
- fourth item
- fifth item
#读取文件里的内容
from lxml import etree
html = etree.parse(‘hello.html‘)
result = etree.tostring(html, pretty_print=True)
print(result)
获取li标签里的东西html=etree.parse(‘hello.html‘)
printtype(html)
result=html.xpath(‘//li‘)
printresult
printlen(result)
printtype(result)
printtype(result[0])
说明:此篇博客仅仅是为了自己学习lxml模块,故没好好写,下面是我微信二维码
本文出自 “天道酬勤” 博客,谢绝转载!