处理xml的命令工具

最新推荐文章于 2021-05-12 21:59:14 发布

whackw

最新推荐文章于 2021-05-12 21:59:14 发布

阅读量1.2k

点赞数

分类专栏：自动化测试文章标签： xml

自动化测试专栏收录该内容

54 篇文章 0 订阅

订阅专栏

down vote

XMLStarlet or another XPath engine is the correct tool for this job.

For instance, with data.xml containing the following:

<root>
  <item> 
    <title>15:54:57 - George:</title>
    <description>Diane DeConn? You saw Diane DeConn!</description> 
  </item> 
  <item> 
    <title>15:55:17 - Jerry:</title> 
    <description>Something huh?</description>
  </item>
</root>

...you can extract only the first title with the following:

xmlstarlet sel -t -m '//title[1]' -v . -n <data.xml

Trying to use sed for this job is troublesome. For instance, the regex-based approaches won't work if the title has attributes; won't handle CDATA sections; won't correctly recognize namespace mappings; can't determine whether a portion of the XML documented is commented out; won't unescape attribute references (such as changing Brewster & Jobs to Brewster & Jobs), and so forth.
、、、、、、、、、、、、、、、、、、、、、、、、、、、、、

Do you really have to use only those tools? They're not designed for XML processing, and although it's possible to get something that works OK most of the time, it will fail on edge cases, like encoding, line breaks, etc.

I recommend xml_grep:

xml_grep 'job' jobs.xml --text_only

Which gives the output:

programming

On ubuntu/debian, xml_grep is in the xml-twig-tools package.

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

whackw

关注关注

0
点赞
踩
1

收藏

觉得还不错? 一键收藏
0
评论
处理xml的命令工具

down voteXMLStarlet or another XPath engine is the correct tool for this job.For instance, with data.xml containing the following: 15:54:57 - George: Diane DeConn? You sa
复制链接

扫一扫