如何使用XPath解析网页

最新推荐文章于 2024-05-18 20:31:25 发布

cuke3186

最新推荐文章于 2024-05-18 20:31:25 发布

阅读量2.4k

点赞数

文章标签： css php html js xpath ViewUI

原文链接：https://www.script-tutorials.com/how-to-parse-web-pages-using-xpath/

版权

摘要由CSDN通过智能技术生成

Parsing pages with XPath. Today I will tell you how you can make parsers of remote HTML pages (in PHP). In this article I will show you how to perform xpath queries to Web pages. XPath – a query language to elements of xml or xhtml document. To obtain the necessary data, we just need to create the necessary query. For the work, we also need: browser Mozilla Firefox, firebug and firepath plugins. For our experiment, I suggest this webpage Google Sci/Tech News. Of course you can choose any other web page too.

使用XPath解析页面。今天，我将告诉您如何创建远程HTML页面(在PHP中)的解析器。在本文中，我将向您展示如何对网页执行xpath查询。 XPath – xml或xhtml文档元素的查询语言。为了获得必要的数据，我们只需要创建必要的查询。对于这项工作，我们还需要：浏览器Mozilla Firefox， firebug和firepath插件。对于我们的实验，我建议使用此网页Google Sci / Tech News 。当然，您也可以选择任何其他网页。

Here is downloadable package:

这是可下载的软件包：

[sociallocker]

[社交储物柜]

打包下载

[/sociallocker]

[/ sociallocker]

Ok, lets start, firstly make sure that both plugins installed in your browser. Then lets open our page with news (Google Sci/Tech News page). After – clicking the right mouse button at any description text, as example inside ‘A Samsung Electronics Co. Galaxy S smartphone, top…’ text, and in popup menu selecting ‘Inspect in Firepath’

好的，让我们开始吧，首先确保两个插件都已安装在浏览器中。然后，让我们打开包含新闻的页面(Google Sci / Tech新闻页面)。之后–在任何描述文字上单击鼠标右键，例如“ A Samsung Electronics Co. Galaxy S智能手机，顶部…”文字内的文字，然后在弹出菜单中选择“在Firepath中检查”

as result – we will see next:

结果–我们将看到下一个：

Make attention to XPath: .//*[@id=’top-stories’]/div[2]/div[3]/div[1]

注意XPath：.//*[@id='top-stories']/div[2]/div[3]/div[1]

The result will be highlighted by the dashed line. After cleanup all unnecessary indexes and small corrections – we will get next query: .//*[@id=’top-stories’]/div/div[@class=’body’]/div[1]

结果将以虚线突出显示。清除所有不必要的索引并进行较小的更正之后，我们将得到下一个查询：.//*[@id='top-

最低0.47元/天解锁文章

cuke3186

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
如何使用XPath解析网页

Parsing pages with XPath. Today I will tell you how you can make parsers of remote HTML pages (in PHP). In this article I will show you how to perform xpath queries to Web pages. XPath – a query langu...
复制链接

扫一扫