python学习之旅-爬虫

最新推荐文章于 2022-02-21 10:13:41 发布

liaolin147

最新推荐文章于 2022-02-21 10:13:41 发布

阅读量280

点赞数

本文链接：https://blog.csdn.net/liaolin147/article/details/73555536

版权

# -*- coding: utf-8 -*-
"""
Spyder Editor

This is a temporary script file.
"""

import re
import urllib.request

def getHtml(url):
    page = urllib.request.urlopen(url)
    html = page.read()
    return html

def getImg(html):    
    html = html.decode('utf_8')
    reg = r'src="(.*?\.jpg)" width'
    imgre = re.compile(reg)
    imglist = imgre.findall(html)
    return imglist

html = getHtml('https://movie.douban.com/')
x = 0
for imgurl in getImg(html):
    urllib.request.urlretrieve(imgurl,'%s.jpg' % x)
    x += 1

print(getImg(html))

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

liaolin147

关注关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
python学习之旅-爬虫

# -*- coding: utf-8 -*-"""Spyder EditorThis is a temporary script file."""import reimport urllib.requestdef getHtml(url): page = urllib.request.urlopen(url) html = page.r
复制链接

扫一扫