pdf_down_py

最新推荐文章于 2020-11-25 07:13:41 发布

xinghuaCai

最新推荐文章于 2020-11-25 07:13:41 发布

阅读量323

点赞数

分类专栏： py脚本

本文链接：https://blog.csdn.net/xinghuacai/article/details/49662341

版权

py脚本专栏收录该内容

1 篇文章 0 订阅

订阅专栏

#通过一个python的google搜索引擎模块实现简单关键字批量下载pdf工具

import google
import requests
def download_file(url,index):
	local_filename=index+"-"+url.split("/")[-1]
	r=requests.get(url,stream=True)
	with open(local_filename,"wb") as f:
		for chunk in r.iter_content(chunk_size=1024):
			if chunk:
				f.write(chunk)
				f.flush
	return local_filename			
g=google.search('site:*.gov.ph filetype:pdf',tld='com.hk')
index=1
for url in g:
	if url.endswith(".pdf"):
		file_path=download_file(url,str(index))
		print "downloading:"+url+"->"+file_path
		index+=1
print "all download finished"

xinghuaCai

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
pdf_down_py

#通过一个python的google搜索引擎模块实现简单关键字批量下载pdf工具import googleimport requestsdef download_file(url,index): local_filename=index+"-"+url.split("/")[-1] r=requests.get(url,stream=True) with open(local_fil
复制链接

扫一扫