首先请阅读文档 http://www.runoob.com/xpath/xpath-tutorial.html ,这里有背景和概念的介绍。
下面展示一些实际中的应用。
一、准备:
from lxml import etree # 加载模块
html='''
<html>
<head>
<title>哈哈测试一下</title>
<link type="text/css" rel="stylesheet" href="haha.css" />
<link type="text/css" rel="stylesheet" href="haha1.css" />
<link type="text/css" rel="stylesheet" href="haha2.css" />
<script type="text/javascript" src="haha.js"></script>
<script type="text/javascript" src="haha1.js"></script>
<script type="text/javascript" src="haha2.js"></script>
</head>
<body>
<div id="id1" class="class1">
<div id="id2" class="class2">
<ul class="cls_ul1">
<li class="cls_li1">
<div class="cls_3">
<span>span_text1</span>
<span>span_text2</span>
<i>text_1</i>
</div>
<div>
<a href="a_1.html">a_1</a>
<a href="a_2.html">a_2</a>
<a href="a_3.html">a_3</a>
</div>
<div class="cls_4">
<a href="a_4.html">
<img href="a_img1.jpg" />
</a>
</div>
</li>
<li class="cl