Python 第十一篇 python爬虫

最新推荐文章于 2024-09-05 10:28:02 发布

别让星星等待

最新推荐文章于 2024-09-05 10:28:02 发布

阅读量441

点赞数

文章标签： python 爬虫开发语言

本文链接：https://blog.csdn.net/m1234l/article/details/122134808

版权

学习目标：

网络爬虫步骤:

	import requests:导入库

	r=requests.get('ur1'):发送并返回请求资源对象

	print(r.status_code):查着状态码(是否发送成功)

	r.encoding:查看网页编码

	r.apparent.encoding:根据网页内容评估的备用编码

	r.text:查看整个网页内容

	r.encoding=r.apparent_encoding:将备用编码替换头编码

	r.text[-500:]

	r.text[:1000]

	from bs4 import BeautifulSoup

	demo=text[ :1000]

	soup=BeautifulSoup(文本/网页内容，html.parser)

	print(soup. prettify())

学习内容：

欧克，接下来我们实际操作下。
嗯就比从网络上抓取一张蝙蝠侠的图片

#图片爬取

import requests
r=requests.get('https://pic2.zhimg.com/50/v2-76d77ea8cbf4a3fa50856451f1803049_720w.jpg?source=54b3c3a5')
path='../picture/蝙蝠侠.jpg'



try:

    f=open(path,'wb')
    f.write(r.content)           #写入二进制文件
    f.close()
    print('文件保存成功')
except:
    print('爬虫失败！')