【常见】web目录扫描 python实现

春暖花开.,

于 2023-05-17 15:25:42 发布

阅读量339

点赞数

文章标签： python 开发语言

本文链接：https://blog.csdn.net/m0_64494670/article/details/130726891

版权

该文章介绍了如何使用Python编程语言，结合requests库发送HTTP请求获取网页内容，然后利用BeautifulSoup解析HTML，提取网页中的所有链接。通过示例代码，展示了从指定URL抓取链接的流程。

摘要由CSDN通过智能技术生成

1.创建 .py 文件

2. pip 安装第三方依赖

pip install beautifulsoup4

pip install requests

3. python 文件名 // 启动

import requests
from bs4 import BeautifulSoup

def scan_website(url):
    # 发送GET请求获取网页内容
    response = requests.get(url)
    
    # 检查请求是否成功
    if response.status_code == 200:
        # 使用BeautifulSoup解析网页内容
        soup = BeautifulSoup(response.text, 'html.parser')
        
        # 查找所有链接标签<a>的href属性
        links = soup.find_all('a')
        
        # 打印所有链接
        for link in links:
            href = link.get('href')
            print(href)
    else:
        print("请求失败")

# 调用函数并传入目标网站的URL
scan_website("http://www.xiankabao.com")