python爬虫编程代码_python爬虫

最新推荐文章于 2024-06-19 17:27:45 发布

weixin_39968722

最新推荐文章于 2024-06-19 17:27:45 发布

阅读量860

点赞数

文章标签： python爬虫编程代码

爬取爬虫框架的通用代码

import requests

def getHtmlText(url):

try:

Headers = {

'user-agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/79.0.3945.130 Safari/537.36'}

r = requests.get(url,headers=Headers)

r.raise_for_status() # 如果状态不是200，则为httpError异常

r.encoding = r.apparent_encoding

return r.text

except:

return "产生异常"

if __name__ == "__main__":

url ="http://news.fznews.com.cn/shehui/list.shtml"

HtmlText = getHtmlText(url)

print(HtmlText)

2、爬取照片代码

import requests

def getPicture(url):

try:

Headers = {'user-agent':

'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/79.0.3945.130 Safari/537.36'}

r = requests.get(url,headers\=Headers)

r.raise\_for\_status() \# 如果状态不是200，则为httpError异常

return r.content

except:

return "产生异常"

if \_\_name\_\_ == "\_\_main\_\_":

picurl ="http://img0.dili360.com/pic/2019/10/23/5db027e9441a73i93221149.jpg"

path="C://Users//fuxingyu//Desktop//abc.jpg"

Pic=getPicture(picurl)

with open(path,'wb')as f:

f.write(Pic)

f.close()

或者

import requests

import os

url="https://pic.rmb.bdstatic.com/1cf349c922d2e0faa054de841535a0788853.gif"

root="C://Users//fuxingyu//Desktop//"

path=root+url.split('/')[-1]

try:

if not os.path.split(root):

os.mkdir(root)

if not os.path.exists(path):

r=requests.get(url)

with open(path,'wb')as f:

f.write(r.content)

f.close()

print("文件保存成功")

else:

print("文件已经存在")

except:

print("爬取失败")

weixin_39968722

关注

0
点赞
踩
1

收藏

觉得还不错? 一键收藏
0
评论
python爬虫编程代码_python爬虫

爬取爬虫框架的通用代码import requestsdef getHtmlText(url):try:Headers = {'user-agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/79.0.3945.130 Safari/537.36'}r = re...
复制链接

扫一扫

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。