R爬取网页信息

最新推荐文章于 2020-10-09 11:06:19 发布

aniuzaixian_2017

最新推荐文章于 2020-10-09 11:06:19 发布

阅读量142

点赞数

原文链接：http://www.cnblogs.com/zhp2016/p/6005440.html

版权

#爬取电影票房信息
library(stringr)
library(XML)
library(maps)
#htmlParse()用来interpreting HTML
#创建一个object
movie_parsed<-htmlParse("http://58921.com/boxoffice/wangpiao/20161004",
                        encoding = "UTF-8")
#the next step:extract tables/data
#readHTMLTable() for identifying and reading out those tables
tables<-readHTMLTable(movie_parsed,stringsAsFactors=FALSE)
is.matrix(tables)
is.character(tables)
is.data.frame(tables)
is.list(tables)
#so we got an "list" format#

转载于:https://www.cnblogs.com/zhp2016/p/6005440.html

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

aniuzaixian_2017

关注关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
R爬取网页信息

#爬取电影票房信息library(stringr)library(XML)library(maps)#htmlParse()用来interpreting HTML#创建一个objectmovie_parsed<-htmlParse("http://58921.com/boxoffice/wangpiao/20161004", ...
复制链接

扫一扫