Parsing HTML in PHP

最新推荐文章于 2024-07-24 16:53:23 发布

newweapon

最新推荐文章于 2024-07-24 16:53:23 发布

阅读量768

点赞数

分类专栏： PHP 文章标签： parsing html object character attributes debugging

PHP 专栏收录该内容

4 篇文章 0 订阅

订阅专栏

Have you ever wanted to get a list of the links contained in a HTML page? Or a list of images, the title or every other non-nested tag for that matter? Then this is the class for you!

Example:

include("phpHTMLParser.php"); $content = file_get_contents("http://www.onderstekop.nl/"); $parser = new phpHTMLParser("$content"); $HTMLObject = $parser->parse_tags(array("a", "title")); $aTags = $HTMLObject->getTagsByName("a"); foreach ($aTags as $a) { if ($a->href != "") { echo $a->href . "<br/>"; echo $a->innerHTML . "<br/><br/>"; } } ?>

In this example the parser only keeps track of the 'a' and 'title' tag from which only the 'a' tag object is being requested afterwards. Running this code will parse the HTML page obtained from http://www.onderstekop.nl/, return an object containing all the information you need and output a list of links with their description. This makes the job of dealing with web pages pretty simple, because you can work with a page in an object oriented way instead of having to go through it character by character or with sophisticated and error-prone regular expressions.

Some other features

Each tag object in the object obtained by a getTagsByName call, currently supports href and innerHTML (as shown), but also id, src and innerTag (to get all the attributes as a string).

Another feature, most useful for dumping results and debugging is the output() function available on the object returned by parse() or parse_tags() ($HTMLObject in our example). Furthermore, for even more debugging, you could set $debug=True in the php file itself.

Download phpHTMLParser

newweapon

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
Parsing HTML in PHP

Haveyou ever wanted to get a list of the links contained in a HTML page? Ora list of images, the title or every other non-nested tag for thatmatter? Then this is the class for you!Example:
复制链接

扫一扫

专栏目录