爬虫基本使用

最新推荐文章于 2024-07-12 16:16:27 发布

无心之局

最新推荐文章于 2024-07-12 16:16:27 发布

阅读量1.1k

点赞数

分类专栏： python 文章标签： python 爬虫

本文链接：https://blog.csdn.net/weixin_52243857/article/details/125589064

版权

文章目录

前言
一、引入库
二、使用步骤
总结

前言

爬虫的基本使用

一、引入库

（1）requests引入。

import requests

（2）BeautifulSoup的引入。

from bs4 import BeautifulSoup

（3）urllib.request的引入。

import urllib.request

（4）xpath的导入

from lxml import etree

二、使用步骤

requests的使用：

(1)简单的使用：

    url = 'https://www.sogou.com/' //目标url
    response = requests.get(url=url) //获取
    page_text = response.text   //获取页面文本
    print(page_text)
    with open('

最低0.47元/天解锁文章

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

无心之局

关注关注

0
点赞
踩
2

收藏

觉得还不错? 一键收藏
0
评论
爬虫基本使用

爬虫的基本使用（1）requests引入。（2）BeautifulSoup的引入。（3）urllib.request的引入。（4）xpath的导入二、使用步骤 requests的使用：(1)简单的使用：(2)增加了封装头和参数： from bs4 import BeautifulSoup的使用： find_all函数:select函数: urllib.request的使用简单使用带头和参数的使用
复制链接

扫一扫