python爬虫

最新推荐文章于 2022-06-21 22:07:30 发布

一朵麻花

最新推荐文章于 2022-06-21 22:07:30 发布

阅读量160

点赞数 1

本文链接：https://blog.csdn.net/qq_41959567/article/details/83277509

版权

网络爬虫的尺寸：
   1、爬取网页，玩转网页即可使用Request 库》90%
   2、爬取网站，系列网站使用Scrapy库
   3、爬取整个internet网站

更改头部信息：
1、模拟一个键值对，

kv={'useragent':'Mozilla/5.0'}
	r=requests.get(url,headers=kv)


import requests
url = ''
try:
	kv={'useragent':'Mozilla/5.0'}
	r=requests.get(url,headers=kv)

图片爬取代码：

import requests
import os
url = 's'
root = '/home/rym'
path = root+url.split('/')[-1]
try:
	if not os.path.exists(root):
		os.mkdir(root)
	if not os.path.exists(path):
		r=requests.get(url)
		with open(path,'wb') as f:
			f.write(r.content)
			f.close()
			print('文件保存成功')
	else:
		print('文件已存在')
except:
	print('爬取失败')

一朵麻花

关注

1
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
python爬虫

网络爬虫的尺寸： 1、爬取网页，玩转网页即可使用Request 库》90% 2、爬取网站，系列网站使用Scrapy库 3、爬取整个internet网站更改头部信息： 1、模拟一个键值对，kv={'useragent':'Mozilla/5.0'} r=requests.get(url,headers=kv)import r...
复制链接

扫一扫