py爬虫之获取b站广告一份

最新推荐文章于 2024-05-04 20:29:53 发布

肥宅乐乐水

最新推荐文章于 2024-05-04 20:29:53 发布

阅读量259

点赞数 1

文章标签： python xpath url

本文链接：https://blog.csdn.net/qq_46591743/article/details/118384387

版权

py爬虫之获取b站广告一份

本文是基于py的一则简单爬虫实例，通过调用request库和lxml库，实现对b站首页的访问，并提取一张首页广告区的图片保存到本地桌面。话不多说，上代码。

import requests
from lxml import etree
url='https://www.bilibili.com/'
headers={'user-agent':'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/83.0.4103.106 Safari/537.36'}
resp=requests.get(url=url,headers=headers)
print(resp)
tree=etree.HTML(resp.content.decode())
b=tree.xpath('//*[@id="reportFirst2"]/div[1]/div/div[1]/div/a/img/@src')[0]
print(b)
resp1=requests.get(url=b,headers=headers)
with open('C://users/administrator/desktop/123.jpg','wb') as f:
    f.write(resp1.content)
print('over')