OX01: Web Speder 入门

最新推荐文章于 2022-08-01 15:35:02 发布

weeever

最新推荐文章于 2022-08-01 15:35:02 发布

阅读量600

点赞数

分类专栏： PHP之Spider 文章标签： phpspan idtransmarks wespan idtransmarksp spider

本文链接：https://blog.csdn.net/weeever/article/details/47029633

版权

PHP之Spider 专栏收录该内容

1 篇文章 0 订阅

订阅专栏

最近雨下得厉害，闲的百无聊赖，遂萌生码码最近看的《Webbots、Spiders 和 Screen Scrapers》的想法~~~

一下正文：

----------------------------------------------------------------------------------------------------------------------------------------------------------------

由于，初次接触PHP。不知配置的PHP的生猛，一连上网找了许多的资料。几经测试都是失败，后来试了wamp。（慕课上有教程的）nice！终于可以了！那个激动得！

还要说说本书含的LIB库。先从本书官网上下载该库，解压到任意路径。记住！

0x01 LIB_http定义了以下的默认变量

可以在源代码中修改

define("WEBBOTS_NAME","Test Webbot");
define("CURL_TIMEOUT",25);
define("COOKIE_FILE","c:\cookie.txt");

事实上，这个库只是curl的封装，更便于使用。

0x02 使用LIB_parse，获取特定html标签

<?php
include("D:\wamp\bin\php\php5.5.12\include\LIB_parse.php");
include("D:\wamp\bin\php\php5.5.12\include\LIB_http.php");

$web_page = http_get($target = "www.baidu.com",$referer ="");

$meta_tag_array = parse_array($web_page['FILE'],"<img",">");

for($xx = 0; $xx<count($meta_tag_array);$xx++)
{
	echo $meta_tag_array[$xx]."\n";
	$name = get_attribute($meta_tag_array[$xx],$attribute="src");
	echo $name ."\n";
}
?>

weeever

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
OX01: Web Speder 入门

最近雨下得厉害，闲的百无聊赖，遂萌生码码最近看的《Webbots、Spiders 和 Screen Scrapers》的想法~~~一下正文：------------------------------------------------------------------------------------------------------------------------
复制链接

扫一扫

专栏目录