Python 解析XML

最新推荐文章于 2023-09-08 11:00:53 发布

iteye_16691

最新推荐文章于 2023-09-08 11:00:53 发布

阅读量257

点赞数

分类专栏： python 文章标签： python

本文链接：https://blog.csdn.net/iteye_16691/article/details/82647430

版权

python 专栏收录该内容

14 篇文章 0 订阅

订阅专栏

Python 解析XML

使用模块lxml

安装:

pip install lxml

pip install requests

from lxml import html
import requests
page = requests.get('http://econpy.pythonanywhere.com/ex/001.html')
tree = html.fromstring(page.content)
buyers = tree.xpath('//div[@title="buyer-name"]/text()')
prices = tree.xpath('//span[@class="item-price"]/text()')

参考： http://docs.python-guide.org/en/latest/scenarios/scrape/#web-scraping

如果xml里面带有命名空间，namespace, 可以这样：

如： <itunes:duration>14:00</itunes:duration>
duration= tree.xpath('//itunes:duration/text()', namespaces ={'itunes': 'http://www.itunes.com/DTDs/Podcast-1.0.dtd'})