python实现下载_Python实现Http下载

最新推荐文章于 2024-05-11 05:22:34 发布

冷处理

最新推荐文章于 2024-05-11 05:22:34 发布

阅读量883

点赞数

文章标签： python实现下载

本文链接：https://blog.csdn.net/weixin_42501848/article/details/112895309

版权

一、背景：

1.每个文件的json格式的描述信息通过url指定，譬如

{"filename": "gcutil.tar.gz", "md5":

"506324a08271389f7d1ecb82090ef425", "filepath":

"http://szwg-stoff-hdfs.dmop.mango.com:8070/data/app/dpf/filecenter/1/gcutil.tar.gz?ugi=app,app"}

2. 希望从给定的url获取json格式的字符串，json解析，提取关键项(filename,md5,filepath)，下载并做md5校验

二、python程序

#!/usr/bin/python

coding:utf-8

#-------------------------------------------------------------------------------

Name: DownLoad

Purpose: download file from

url

Author: mango

Created: 18/7/2012

Licence:

licence>

#-------------------------------------------------------------------------------

import

urllib2,re,string

from

time import sleep

import

md5

import

json

########################

import

time

########################

#(1)function

get_md5：获取一个文件的MD5值

def

get_md5(full_filename):

f=file(full_filename,'rb')

return

md5.new(f.read()).hexdigest()

#(2)function

check_md5：判断一个文件的MD5值是否正确

def

check_md5(correct_md5,full_filename):

my_md5=

get_md5(full_filename)

if(my_md5.upper()==correct_md5.upper()):

return True

else:

return False

# (3)function

getItemValueFromURL：从网页上获取关键项item对应的值,不存在返回None

def

getItemValueFromURL(urlpath,item):

urlpath=urlpath

item=item

request=urllib2.Request(urlpath)

html_str=urllib2.urlopen(request).read()

#print

html_sr

jsonDict =json.loads(html_str)

for key in

jsonDict:

if(key==item):

return

jsonDict[item]

return

None

# (4)function downloadfile:用于下载文件

def

downloadfile(file_url,filename,download_path):

try:

request =

urllib2.Request(file_url)

f=open(download_path+filename,'wb')

start_time=time.time()

#print 'time stamp is : ',time.time()

print start_time

size =0

speed=0

data_lines =

urllib2.urlopen(request).readlines()

#data =

urllib2.urlopen(request).read()

for data in data_lines:

f.write(data)

size =

size + len(data)

dural_time=float(time.time()) - float(start_time)

if(dural_time>0):

speed =

float(size)/float(dural_time)/(1000*1000)

while(speed

>1):

print 'speed lagger than 1MB/s ,

sleep(0.1).....'

print 'sleep .....'

sleep(0.1)

dural_time=float(time.time()) -

float(start_time)

speed =

float(size)/float(dural_time)/(1000*1000)

print 'total time is : ',dural_time

,'seconds'

print 'size is : ',size ,'KB'

print 'speed is : ',speed ,'MB/s'

except

Exception,e:

print 'download error: ',e

return False

return

True

#(5) download: 调用上述函数下载文件

def

download(path='http://koala.dmop.mango.com:8080/fc/query/1/',download_path='c:/',attempt=5):

#1.init parameter

path=path

download_path=download_path

attempt=attempt

#2.

download

# download and check

md5

while(attempt>0):

attempt=attempt-1

#.get download parameter

filename=getItemValueFromURL(path,"filename")

correct_md5=getItemValueFromURL(path,"md5")

filepath=getItemValueFromURL(path,"filepath")

if(filename==None or filepath==None or

correct_md5==None):

#print "

download parameter not correct "

continue

#download

print 'begin the ',5-attempt,'time

download'

if(

(downloadfile(filepath,filename,download_path)) ==False

#print

"download failed ..continue"

continue

#check_md5

if(check_md5(correct_md5,download_path+filename)):

#print

"download ok ...."

return

True

return

False

#main

def main():

#init

parmeter

path='http://koala.dmop.mango.com:8080/fc/query/1/'

download_path='c:/'

#download

if(download(path,download_path)):

print "download commplete ..."

else:

print "download failed ..."

__name__=='__main__':

main()

冷处理

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
python实现下载_Python实现Http下载

一、背景：1.每个文件的json格式的描述信息通过url指定，譬如{"filename": "gcutil.tar.gz", "md5":"506324a08271389f7d1ecb82090ef425", "filepath":"http://szwg-stoff-hdfs.dmop.mango.com:8070/data/app/dpf/filecenter/1/gcutil.tar.gz?...
复制链接

扫一扫