python获取已打开的网页内容_用Python获取网页数据

最新推荐文章于 2024-02-04 10:15:00 发布

VIP文章李Moon

最新推荐文章于 2024-02-04 10:15:00 发布

阅读量2.1k

点赞数

文章标签： python获取已打开的网页内容

本文链接：https://blog.csdn.net/weixin_42515795/article/details/111963355

版权

# -coding: utf-8

imoprt urllib2

import urllib

import re

# 填写需要采集的网址

urlPath = '

# 设置网页头部信息，模拟浏览器

headers = {'User-Agent' : agent, 'Accept' : '*/*', 'Referer' : 'http://www.google.com'}

# 打开网页，并读取网页源码

request = urllib2.Request(urlPath, headers=headers)

response = urllib2.urlopen(request)

html = response.read()

# 构建图片标签正则表达式

img=re.compile(r"""""",re.I)

# 保存的图片名称和路径，需要自己设置

path = '~/Code/Python/img_splider/'

try:

# 使用正则匹配出所有的img标签

img_list = re.findAll(img, html)

# 遍历得到的所有标签，然后进行下载

for i in xrange(length(img_list)):

# 使用urllib读取打开图片

data = urllib.urlopen(img_list[i]).read()

优惠劵

关注关注