python爬虫实例-cat_picture_download

最新推荐文章于 2024-08-09 00:05:38 发布

Hkpery

最新推荐文章于 2024-08-09 00:05:38 发布

阅读量1.5k

点赞数

分类专栏： python爬虫文章标签： python

本文链接：https://blog.csdn.net/Hkpery/article/details/119423413

版权

python爬虫专栏收录该内容

4 篇文章 0 订阅

订阅专栏

这段代码展示了如何利用Python的urllib库和随机选择的代理服务器获取placekitten网站上的猫咪图片。代码中设置了多种代理，并通过User-Agent伪装浏览器标识，最后将下载的图片保存为JPEG格式。

摘要由CSDN通过智能技术生成

如果你也喜欢猫猫(>^ω<)喵

import urllib.request
import random
import time

height = random.randint(1,1024)

weight = random.randint(1,1024)

new_url='http://placekitten.com/'+str(height)+'/'+str(weight)

ip_list=['14.116.213.100:8081','14.18.109.42:8081','47.107.128.69:888','47.108.155.96:80','183.7.29.244:9999','36.57.68.239:8888','171.15.65.120:8080']

dynamic_ip=random.choice(ip_list)

#自建代理
proxy_support = urllib.request.ProxyHandler({'https':dynamic_ip})	
opener = urllib.request.build_opener(proxy_support)
urllib.request.install_opener(opener)
opener.addheaders = [('User-Agent','Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/91.0.4472.164 Safari/537.36')]


#建立User—Agent
'''
disguse_url ={}
disguse_url['User-Agent'] = 'Mozilla/5.0 (Windows NT 10.0;Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/91.0.4472.164 Safari/537.36'
'''
response = urllib.request.urlopen(new_url)




#response.add_header('User-Agent','Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/91.0.4472.164 Safari/537.36')

cat_img = response.read()

with open ('cat_img_'+str(height)+str(weight)+'.jpg','wb') as f:
     f.write(cat_img) 

#代理使用成功
"""
response = urllib.request.urlopen('https://www.whatismyip.com.tw')

html = response.read().decode('utf-8')
"""

time.sleep(10)

Hkpery

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
python爬虫实例-cat_picture_download

如果你也喜欢猫猫(>ω<)喵import urllib.requestimport randomimport timeheight = random.randint(1,1024)weight = random.randint(1,1024)new_url='http://placekitten.com/'+str(height)+'/'+str(weight)ip_list=['14.116.213.100:8081','14.18.109.42:8081','47.10
复制链接

扫一扫