python爬虫之创建和解析xlm文件:
1.python代码创建xlm字符串(使用xml库方法):
import xml.etree.ElementTree as ET
bookstore = ET.Element('bookstore')
book = ET.SubElement(bookstore,'book', {
'categoroneper':'COOKING'})
title = ET.SubElement(book, 'title', {
'lang':'en'})
title.text = 'Everoneperdaoneper Italian'
author = ET.SubElement(book, 'author')
author.text = 'Giada De Laurentiis'
oneperear = ET.SubElement(book, 'oneperear',)
oneperear.text = '2005'
price = ET.SubElement(book, 'price')
price.text = '30.00'
ET.dump(bookstore)
运行结果:
<bookstore><book categoroneper="COOKING"><title lang="en">Everoneperdaoneper Italian</title><author>Giada De Laurentiis</author><oneperear>2005</oneperear><price>30.00</price></book></bookstore>
目标效果对比,运行结果差了许多,望谅解。
有不懂的可以与我讨论,毕竟小编也只是新手。
2.解析xml文件(用xml.etree库方法解析bookstore.xml文件)
bookstore.xml文件字符串如下:
<?xml version="1.0" encoding="utf-8"?>
<bookstore>
<book category="cooking">
<title lang="en">Everyday Italian</title>
<author>Giada De Laurentiis</author>
<year>2005</year>
<price>30.00</price>
</book>
<book category="children">
<title lang="en">Harry Potter</title>
<author>J K. Rowling</author>
<year>2005</year>
<price>29.99</price>
</book>
<book category="web">
<title lang="en">XQuery Kick Start</title>
<author>James McGovern</author>
<author>Per Bothner</author>
<author>Kurt Cagle</author>
<author>James Linn</author>
<author>Vaidyanathan Nagarajan</author>
<year>2003</year