R网络爬虫之批量下载

最新推荐文章于 2023-08-20 20:43:18 发布

honeyasong

最新推荐文章于 2023-08-20 20:43:18 发布

阅读量3.1k

点赞数

分类专栏： R 文章标签： R 网络爬虫批量下载

本文链接：https://blog.csdn.net/asongsongsong/article/details/45507843

版权

R 专栏收录该内容

73 篇文章 3 订阅

订阅专栏

setwd("E:/r_w/")
#设置工作目录
library(RCurl)
html=getURL("http://rfunction.com/code/1202/")
#下载页面
temp=strsplit(html,"<li><a href=\"")[[1]]
#分割页面
files=strsplit(temp,"\"")
#分割页面
files=lapply(files,function(x){x[1]})
#此时files为list类型，取files中的每个元素的第一个元素
files=unlist(files)
#转换成非list类型
files=files[-(1:2)]
#去除第一第二行


base="http://rfunction.com/code/1202/"
for(i in 1:length(files))
{
  url=paste(base,files[i],sep='') 
  temp=getBinaryURL(url)
  #下载文件
  note=file(paste("1202",files[i],sep='.'),open="wb")
  #设置目录
  writeBin(temp,note)
  #写入
  close(note)
  Sys.sleep(2)
}

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

honeyasong

关注关注

0
点赞
踩
5

收藏

觉得还不错? 一键收藏
0
评论
R网络爬虫之批量下载

setwd("E:/r_w/")#设置工作目录library(RCurl)html=getURL("http://rfunction.com/code/1202/")#下载页面temp=strsplit(html,"<a href=\"")[[1]]#分割页面files=strsplit(temp,"\"")#分割页面files=lapply(files,function(x){
复制链接

扫一扫